Open
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
实现了"albert/albert-base-v1"模型在"SetFit/20_newsgroups"数据集上的微调实验。
任务链接在https://gitee.com/mindspore/community/issues/IAUONP
transformers+pytorch+4060的benchmark是自己编写的,仓库位于https://github.com/outbreak-sen/albert_finetuned
更改代码位于llm/finetune/albert,只包含mindnlp+mindspore的
实验结果如下
Albert的20Newspaper微调
硬件
资源规格:NPU: 1*Ascend-D910B(显存: 64GB), CPU: 24, 内存: 192GB
智算中心:武汉智算中心
镜像:mindspore_2_5_py311_cann8
torch训练硬件资源规格:Nvidia 3090
模型与数据集
模型:"albert/albert-base-v1"
数据集:"SetFit/20_newsgroups"
训练与评估损失
由于训练的损失过长,只取最后十五个loss展示
mindspore+mindNLP
Pytorch+transformers