Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

git_6663_introduction #2877

Open
wants to merge 197 commits into
base: class
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
197 commits
Select commit Hold shift + click to select a range
ed76a8b
Update README.md
vansin Sep 3, 2024
eab12e6
update
vansin Sep 3, 2024
937cf4a
update
vansin Sep 3, 2024
02dd725
Camp4 add L0-linux (#1984)
acwwt Oct 8, 2024
cf43fbe
Update README.md
vansin Oct 14, 2024
e95b3f9
Update README.md
vansin Oct 15, 2024
5c03c12
Update README.md
vansin Oct 15, 2024
19f6617
Update README.md
vansin Oct 15, 2024
ec6546f
Create readme.md
vansin Oct 15, 2024
6017373
Create task.md
vansin Oct 15, 2024
a3504fd
Create readme.md
vansin Oct 15, 2024
1240878
Create task.md
vansin Oct 15, 2024
034d0c4
Create readme.md
vansin Oct 15, 2024
0687193
Update README.md
vansin Oct 15, 2024
28b775d
add Prompt Tutorial (#2004)
yzfly Oct 15, 2024
0498147
Update README.md
vansin Oct 15, 2024
32dfbe9
Update README.md
vansin Oct 15, 2024
35885ca
Update README.md
vansin Oct 15, 2024
bc04d49
Camp4 llamaindex rag (#2015)
blankanswer Oct 18, 2024
5b2269e
Camp4 hf ms ml (#2016)
blankanswer Oct 20, 2024
6a788ff
Camp4 hf ms ml words (#2022)
blankanswer Oct 21, 2024
5639946
git 已修改,图像居中,加了永久保存 (#2023)
random-zhou Oct 21, 2024
8870d23
Create L0/python (#2006)
charleson98 Oct 21, 2024
cebe4cc
update
vansin Oct 21, 2024
fd07c82
Update README.md
vansin Oct 21, 2024
aa8f4ba
Update README.md
vansin Oct 21, 2024
c88dc9d
Update README.md
vansin Oct 21, 2024
33be64c
Update README.md
vansin Oct 21, 2024
216db2b
Update readme.md (#2024)
blankanswer Oct 21, 2024
9a30e0f
Update task.md
vansin Oct 21, 2024
74fbf90
optimze: Git readme.md (#2025)
blankanswer Oct 21, 2024
d3b2c7e
Update task.md
vansin Oct 22, 2024
03bcfb1
fix: project-apply link (#2036)
blankanswer Oct 22, 2024
88e8d64
Update README.md
vansin Oct 22, 2024
3423756
git (#2046)
random-zhou Oct 22, 2024
dd775aa
Update README.md
vansin Oct 22, 2024
63e3508
[doc] XTuner assistant (#1933)
JimmyMa99 Oct 22, 2024
57e2e20
Update README.md
vansin Oct 22, 2024
797e2b8
[doc]Camp4 (#2053)
JimmyMa99 Oct 22, 2024
dcb5c8e
Update readme.md (#2089)
blankanswer Oct 24, 2024
bc6ba92
Camp4 (#2098)
JimmyMa99 Oct 24, 2024
be92157
Camp4 (#2099)
JimmyMa99 Oct 24, 2024
ea0658d
Update readme.md (#2101)
blankanswer Oct 25, 2024
0dc4013
Camp4 (#2109)
JimmyMa99 Oct 25, 2024
2134980
Camp4 (#2112)
JimmyMa99 Oct 25, 2024
6a2009b
Camp4 (#2117)
JimmyMa99 Oct 26, 2024
bd091c9
Update README.md (#2125)
JimmyMa99 Oct 27, 2024
715bc8b
update camp4 cource(#2013)
1587causalai Oct 27, 2024
a6300ac
Update readme.md (#2147)
random-zhou Oct 28, 2024
bf30d76
Update README.md
vansin Oct 28, 2024
198bb50
Update python doc (#2151)
charleson98 Oct 28, 2024
368b540
Update README.md
vansin Oct 29, 2024
6e37d86
[doc] Camp4-InternVL (#2021)
Control-derek Oct 29, 2024
e00957c
Update README.md
vansin Oct 29, 2024
168b795
Update README.md (#2167)
JimmyMa99 Oct 29, 2024
d70af12
Update README.md
vansin Oct 30, 2024
ae652db
Update tasks.md
vansin Oct 30, 2024
28dfc28
Update task.md
vansin Oct 31, 2024
b0625b3
add prompt practice.md (#2205)
yzfly Oct 31, 2024
e1c3211
fix: llamaindex_torch (#2192)
blankanswer Oct 31, 2024
d779296
update video
vansin Oct 31, 2024
8070834
Update readme.md (#2281)
huangjinyuu Nov 1, 2024
54ded8b
Update get_data.py (#2294)
JimmyMa99 Nov 2, 2024
0646685
update L1/Prompt/README.md (#2303)
Mictorinox Nov 2, 2024
6bad683
Camp4 (#2168)
1587causalai Nov 2, 2024
1dae48f
Llamaindex RAG 增加浦语 API 任务 (#2297)
cgq0816 Nov 2, 2024
af4a4d6
Update readme.md (#2311)
1587causalai Nov 2, 2024
8c11de7
Update task.md
vansin Nov 2, 2024
ae25bef
Update task.md
vansin Nov 2, 2024
d14ca20
Update task.md
vansin Nov 2, 2024
aac4e72
Update task.md
vansin Nov 2, 2024
c35dd2e
Update task.md
vansin Nov 2, 2024
d120d1c
add excellent blog (#2318)
Alannikos Nov 2, 2024
4cd81d9
Update task.md
vansin Nov 2, 2024
f95a206
Update task.md
vansin Nov 2, 2024
3f1ee7d
Update task.md
vansin Nov 2, 2024
d8503af
Update readme_api.md (#2323)
cgq0816 Nov 2, 2024
3104014
Update readme_api.md
vansin Nov 2, 2024
156e644
Update readme_api.md
vansin Nov 2, 2024
6f167ea
Update readme_api.md
vansin Nov 2, 2024
8211610
Update readme_api.md
vansin Nov 2, 2024
106af6c
Update readme_api.md
vansin Nov 2, 2024
d873285
Update readme_api.md
vansin Nov 2, 2024
6f71dc0
Update readme_api.md
vansin Nov 2, 2024
31a9e16
Update readme_api.md
vansin Nov 2, 2024
08d17a1
Update readme_api.md
vansin Nov 2, 2024
b4d0ff1
Update readme_api.md
vansin Nov 2, 2024
2513518
Update readme_api.md
vansin Nov 2, 2024
8855805
Update readme_api.md
vansin Nov 2, 2024
c007749
Create readme_local.py
vansin Nov 2, 2024
ab052c2
update
vansin Nov 2, 2024
d891ceb
Update readme.md
vansin Nov 2, 2024
fec33bb
Update readme.md
vansin Nov 2, 2024
8a24879
Update readme.md
vansin Nov 2, 2024
8dbeac9
Update readme.md
vansin Nov 2, 2024
e78332f
Update README.md
vansin Nov 3, 2024
380b296
Create readme.md
vansin Nov 3, 2024
df81b28
Update README.md
vansin Nov 3, 2024
42031e7
Update readme_api.md
vansin Nov 4, 2024
d55be11
Update readme_api.md
vansin Nov 4, 2024
64c344a
Update readme_api.md
vansin Nov 4, 2024
efa9d69
Update readme_api.md
vansin Nov 4, 2024
c23d296
update
vansin Nov 4, 2024
73af528
Update readme.md
vansin Nov 4, 2024
b1cb99c
Update README.md
vansin Nov 4, 2024
d7f4750
Update README.md
vansin Nov 4, 2024
3a646fd
Create readme.md
vansin Nov 4, 2024
e12739b
Update task.md
vansin Nov 4, 2024
2a0cbae
Update task.md
vansin Nov 4, 2024
c571168
Update README.md
vansin Nov 5, 2024
67c71d4
Update tasks.md
vansin Nov 6, 2024
b1cf3b9
Update tasks.md
vansin Nov 6, 2024
4730a34
update prompt task (#2418)
yzfly Nov 6, 2024
92c9f39
Camp4Tasks_InternIntro (#2275)
Abelmx Nov 6, 2024
b0e2b31
Update readme.md
vansin Nov 6, 2024
3ba99b8
Update practice.md
vansin Nov 7, 2024
4ae792e
Update task.md
vansin Nov 7, 2024
b567b2f
Update task.md
vansin Nov 7, 2024
102f081
Update task.md
vansin Nov 7, 2024
61cec19
Update task.md
vansin Nov 7, 2024
33a364b
Update README.md
vansin Nov 7, 2024
a96e8b7
Update README.md
vansin Nov 7, 2024
36e6670
Update task.md
vansin Nov 7, 2024
bbfeab4
Update tasks.md
vansin Nov 8, 2024
74ca0ec
Update readme.md (#2502)
Jun-Howie Nov 9, 2024
d0e26a8
Update assistant_Tuner.jsonl (#2509)
JimmyMa99 Nov 9, 2024
e28dfdb
Update readme_api.md
vansin Nov 10, 2024
50c30ea
Update readme_api.md (#2508)
Alannikos Nov 10, 2024
afc0ee8
Update task.md
vansin Nov 11, 2024
2683b75
Update task.md
vansin Nov 11, 2024
2cab8d7
Update task.md
vansin Nov 11, 2024
b44a905
update readme (#2531)
Alannikos Nov 13, 2024
5b7e212
Update task.md
vansin Nov 15, 2024
2d193ed
Update task.md
vansin Nov 15, 2024
9074bba
fix: add " to docs maas (#2594)
blankanswer Nov 16, 2024
73fc0cc
fix the error caused by model's path (#2596)
Alannikos Nov 16, 2024
e0a0407
增加Lagent部分的Task (#2611)
colorfulandcjy0806 Nov 18, 2024
ec23385
Create readme.md
vansin Nov 18, 2024
e3ef677
Update readme.md
vansin Nov 18, 2024
9991fdd
Create task.md
vansin Nov 18, 2024
fe1c23d
Update README.md
vansin Nov 18, 2024
88c89f9
Update README.md
vansin Nov 18, 2024
a4ffd76
lmdeploy 部署量化出现错误的解决方法,测试有效 (#2625)
boshallen Nov 19, 2024
bd518fc
Update readme.md
vansin Nov 19, 2024
cf98a32
Update readme_api.md
vansin Nov 21, 2024
9c76dae
Update readme_api.md
vansin Nov 21, 2024
775ce8d
Update readme_api.md
vansin Nov 21, 2024
c4e8ad3
Camp4 (#2645)
charleson98 Nov 21, 2024
f09bd00
Update python task (#2651)
charleson98 Nov 21, 2024
67b7cee
Update README.md
vansin Nov 22, 2024
e78faa8
Update README.md (#2659)
JimmyMa99 Nov 22, 2024
f9dfbf8
Update change_script.py (#2660)
JimmyMa99 Nov 22, 2024
257d25a
更新Lagent的readme.md (#2643)
colorfulandcjy0806 Nov 22, 2024
6093a36
新增Lagent文档、Task的链接 (#2664)
colorfulandcjy0806 Nov 22, 2024
88d9ae5
更新标题Lagent:从零搭建你的 Multi-Agent ,同时修改了readme的第一张图 (#2669)
colorfulandcjy0806 Nov 23, 2024
a93259e
fix docs maas (#2670)
blankanswer Nov 24, 2024
12a43e9
显存计算公式存在错误 (#2627)
lzl-am Nov 24, 2024
9cd5476
更新了和风天气API的注册说明 (#2677)
colorfulandcjy0806 Nov 25, 2024
cdad0e6
Update readme.md (#2680)
colorfulandcjy0806 Nov 25, 2024
df61fda
Update README.md (#2682)
colorfulandcjy0806 Nov 25, 2024
5a0c78e
Update README.md
vansin Nov 29, 2024
1b2ef18
update Mindsearch docs (#2750)
blankanswer Dec 6, 2024
ed2a071
Update README.md (#2756)
blankanswer Dec 6, 2024
08de287
Update task.md
vansin Dec 6, 2024
455ac30
Update readme.md (#2759)
blankanswer Dec 6, 2024
6b08306
Update README.md
vansin Dec 7, 2024
0500cf8
Update get_data.py (#2762)
JimmyMa99 Dec 7, 2024
557d8b2
Update task.md
vansin Dec 7, 2024
6128562
Update task.md
vansin Dec 7, 2024
ab046ef
[Doc] Camp4 internvl update (#2772)
Control-derek Dec 8, 2024
00c12fa
Update readme.md
vansin Dec 8, 2024
167ed47
Update README.md
vansin Dec 8, 2024
a434e77
Update README.md (#2784)
JimmyMa99 Dec 9, 2024
097a652
Create requirements.txt (#2783)
JimmyMa99 Dec 9, 2024
311d3f6
Update README.md (#2786)
JimmyMa99 Dec 10, 2024
5ef5ae5
Update task.md
vansin Dec 13, 2024
611b262
Create readme.md
vansin Dec 16, 2024
aee9419
Create task.me
vansin Dec 16, 2024
7d0aaa7
Rename task.me to task.md
vansin Dec 16, 2024
a2334ee
Update README.md
vansin Dec 16, 2024
bc2ecbb
Camp4 (#2831)
Control-derek Dec 16, 2024
8aaa9ea
Update easy_README.md
vansin Dec 17, 2024
4c641a2
Update easy_README.md
vansin Dec 17, 2024
793d354
Update mass docs (#2839)
blankanswer Dec 19, 2024
4face0e
Update task.md
vansin Dec 23, 2024
e4d2408
Update task.md
vansin Dec 23, 2024
fa24fe9
Update task.md
vansin Dec 23, 2024
00ca461
Update task.md
vansin Dec 23, 2024
c35e0e5
Update README.md (#2855)
JimmyMa99 Dec 25, 2024
efcdf9b
Update readme.md (#2860)
fresh-little-lemon Dec 28, 2024
139c4cd
修改Opencompass中的模型名称错误以及解决本地评测环境报错问题 (#2854)
fresh-little-lemon Dec 28, 2024
c928327
Update lagent tutorial (#2868)
fresh-little-lemon Jan 5, 2025
173da3d
Create readme.md
vansin Jan 6, 2025
cbccccd
Update README.md
vansin Jan 6, 2025
6c38ffd
Update readme.md
vansin Jan 6, 2025
5c10a3d
1
2279460456 Jan 7, 2025
262d38c
1
2279460456 Jan 7, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
93 changes: 92 additions & 1 deletion README.md
Original file line number Diff line number Diff line change
@@ -1 +1,92 @@
# tutorial
# 书生大模型实战营(第四期)

![icamp4](https://github.com/user-attachments/assets/cf5eca85-bb13-4b96-aa54-b29fa40d36b7)


闯关手册:https://aicarrier.feishu.cn/wiki/QtJnweAW1iFl8LkoMKGcsUS9nld


## 1. 关卡

![update](https://github.com/user-attachments/assets/1f65a77a-3490-47ff-b408-be6d7a190802)


### 1.1. 入门关卡

||关卡名称|资料|闯关激励|
|:-----|:----|:----|:-----|
|第 1 关| Linux 前置基础 |[任务](docs/L0/linux/task.md)、[文档](docs/L0/linux)、[视频](https://www.bilibili.com/video/BV13U1VYmEUr)| 50元算力点 |
|第 2 关|Python 前置基础 | [任务](docs/L0/Python/task.md)、[文档](docs/L0/Python)、[视频](https://www.bilibili.com/video/BV1u61jYSExg)| 50元算力点|
|第 3 关|Git 前置基础|[任务](docs/L0/git/task.md)、[文档](docs/L0/git/)、[视频](https://www.bilibili.com/video/BV15MShYkEgg)| 50元算力点 |
|第 4 关|玩转「HF/魔搭/魔乐」等平台|[任务](docs/L0/maas/task.md)、[文档](docs/L0/maas)、[视频](https://www.bilibili.com/video/BV1XxStYYEH1/)| 50元算力点 |


### 1.2. 基础关卡


||关卡名称|资料|闯关激励|
|:-----|:----|:----|:-----|
|第 1 关| 书生大模型全链路开源体系 |[任务](docs/L1/ToolChain)、[视频](https://www.bilibili.com/video/BV1CkSUYGE1v/)| 100元算力点 |
|第 2 关| 玩转书生「多模态对话」和「AI搜索」产品 | [任务](docs/L1/InternIntro/tasks.md)、[文档](docs/L1/InternIntro)、[视频](https://www.bilibili.com/video/BV1ExDQYyEAA)| 100元算力点 |
|第 3 关| 浦语提示词工程实践 | [任务](docs/L1/Prompt/tasks.md)、[文档](docs/L1/Prompt)、[视频](https://www.bilibili.com/video/BV1tjS7YfEWJ/)| 100元算力点 |
|第 4 关| InternLM + LlamaIndex RAG 实践|[任务](docs/L1/LlamaIndex/task.md)、[文档](docs/L1/LlamaIndex)、[视频](https://www.bilibili.com/video/BV1YzDJY1E2i/)| 100元算力点 |
|第 5 关| XTuner 微调个人小助手认知 | [任务](docs/L1/XTuner/task.md)、[文档](docs/L1/XTuner)、[视频](https://www.bilibili.com/video/BV1G9SJYGEtD)| 100元算力点 |
|第 6 关| OpenCompass 评测书生大模型实践 | [任务](docs/L1/Evaluation/task.md)、[文档](docs/L1/Evaluation/)、[视频](https://www.bilibili.com/video/BV1dtD4YKENj)| 100元算力点 |



### 1.3. 进阶关卡

||关卡名称|资料|闯关激励|
|:-----|:----|:----|:-----|
|第 1 关| 探索书生大模型能力边界 | [任务](https://aicarrier.feishu.cn/wiki/SmUNwgM8vizLPRkhqr1cY1BKngb)| 100元算力点 |
|第 2 关| Lagent:从零搭建你的 Multi-Agent | [任务](docs/L2/Agent/task.md)、[文档](docs/L2/Agent)、[视频](https://www.bilibili.com/video/BV19RzcYaEFy)| 100元算力点 |
|第 3 关| LMDeploy 量化部署进阶实践 | [任务](docs/L2/LMDeploy/task.md)、[文档](docs/L2/LMDeploy)、[视频](https://www.bilibili.com/video/BV18aUHY3EEG/)| 100元算力点 |
|第 4 关| InternVL 多模态模型部署微调实践 | [任务](docs/L2/InternVL/task.md)、[文档](docs/L2/InternVL)、[视频](https://www.bilibili.com/video/BV1nESCYWEnN)| 100元算力点 |
|第 5 关| 茴香豆:企业级知识库问答工具 | [任务](docs/L2/huixiangdou/task.md)、[文档](docs/L2/huixiangdou/readme.md)、[视频](https://www.bilibili.com/video/BV1C5W2epEYP/)| 100元算力点 |
|第 6 关| MindSearch深度解析及实践 | [任务](docs/L2/MindSearch/task.md)、[文档](docs/L2/MindSearch/readme.md) | 100元算力点 |

### 1.4. 彩蛋岛

||关卡名称|资料|
|:-----|:----|:-----|
|第 1 关| InternLM+WasmEdge 部署智能体实践|[文档]、[视频](https://www.bilibili.com/video/BV1K2rAYeEXG/)|

## 2. 证书

完成进阶岛闯关将获得精美的结营证书~

![image](https://github.com/user-attachments/assets/86f420b1-5f82-4ae3-b7f6-4b4bdcdca1b8)


## 3. 书生共学计划


在大模型技术的浪潮中,面对混杂的众多信息,如何获取有效、可信的学习资源成为了一项挑战。
为此,我们推出“书生共学计划”,鼓励大家将实战营活动分享给你身边有需要的小伙伴,让每一位热爱技术的朋友都能在这个复杂的信息环境中找到自己的航向,帮助他们在大模型的学习之路上少走弯路。


参与方法
- 启航准备:通过填写问卷报名加入实战营,开启你的学习之旅。
- 专属海报:访问书生共学计划活动页面(https://colearn.intern-ai.org.cn/co ),系统将自动生成你的专属海报
- 招募同行者:将海报分享给你身边的小伙伴,邀请他们报名实战营,共享知识的力量。


<div align="center">

<img width="440" alt="image" src="https://github.com/user-attachments/assets/99f03ded-da19-42a1-a42d-982fe58ae0c3">

</div>




独家奖励等你拿
- 每邀请 1 位同学填写报名问卷即可获得 50 算力点。
- 成功邀请 3 人,解锁 InternStudio 平台 24GB A100 及 80GB 存储使用权限。
- 成功邀请 6 人,解锁 InternStudio 平台 40GB A100 及 120GB 存储使用权限。
- 成功邀请 16 人,解锁 InternStudio 平台 80GB A100 及 200GB 存储使用权限。

展现你的影响力,成为知识的使者
这不仅是一个促进个人学习和成长的机遇,更是一个帮助他人、为自己赢得认可和资源的舞台。通过你的分享,我们可以一起帮助更多的人接触和了解前沿技术,期待你的加入。

Empty file added configs/.gitkeep
Empty file.
219 changes: 219 additions & 0 deletions configs/internlm2_5_chat_7b_qlora_alpaca_e3_copy.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,219 @@
# Copyright (c) OpenMMLab. All rights reserved.
import torch
from datasets import load_dataset
from mmengine.dataset import DefaultSampler
from mmengine.hooks import (CheckpointHook, DistSamplerSeedHook, IterTimerHook,
LoggerHook, ParamSchedulerHook)
from mmengine.optim import AmpOptimWrapper, CosineAnnealingLR, LinearLR
from peft import LoraConfig
from torch.optim import AdamW
from transformers import (AutoModelForCausalLM, AutoTokenizer,
BitsAndBytesConfig)

from xtuner.dataset import process_hf_dataset
from xtuner.dataset.collate_fns import default_collate_fn
from xtuner.dataset.map_fns import alpaca_map_fn, template_map_fn_factory
from xtuner.engine.hooks import (DatasetInfoHook, EvaluateChatHook,
VarlenAttnArgsToMessageHubHook)
from xtuner.engine.runner import TrainLoop
from xtuner.model import SupervisedFinetune
from xtuner.parallel.sequence import SequenceParallelSampler
from xtuner.utils import PROMPT_TEMPLATE, SYSTEM_TEMPLATE

#######################################################################
# PART 1 Settings #
#######################################################################
# Model
pretrained_model_name_or_path = '/root/finetune/models/internlm2_5-7b-chat'
use_varlen_attn = False

# Data
alpaca_en_path = '/root/finetune/data/assistant_Tuner_change.jsonl'
prompt_template = PROMPT_TEMPLATE.internlm2_chat
max_length = 2048
pack_to_max_length = True

# parallel
sequence_parallel_size = 1

# Scheduler & Optimizer
batch_size = 1 # per_device
accumulative_counts = 1
accumulative_counts *= sequence_parallel_size
dataloader_num_workers = 0
max_epochs = 3
optim_type = AdamW
lr = 2e-4
betas = (0.9, 0.999)
weight_decay = 0
max_norm = 1 # grad clip
warmup_ratio = 0.03

# Save
save_steps = 500
save_total_limit = 2 # Maximum checkpoints to keep (-1 means unlimited)

# Evaluate the generation performance during the training
evaluation_freq = 500
SYSTEM = SYSTEM_TEMPLATE.alpaca
evaluation_inputs = [
'请介绍一下你自己', 'Please introduce yourself'
]

#######################################################################
# PART 2 Model & Tokenizer #
#######################################################################
tokenizer = dict(
type=AutoTokenizer.from_pretrained,
pretrained_model_name_or_path=pretrained_model_name_or_path,
trust_remote_code=True,
padding_side='right')

model = dict(
type=SupervisedFinetune,
use_varlen_attn=use_varlen_attn,
llm=dict(
type=AutoModelForCausalLM.from_pretrained,
pretrained_model_name_or_path=pretrained_model_name_or_path,
trust_remote_code=True,
torch_dtype=torch.float16,
quantization_config=dict(
type=BitsAndBytesConfig,
load_in_4bit=True,
load_in_8bit=False,
llm_int8_threshold=6.0,
llm_int8_has_fp16_weight=False,
bnb_4bit_compute_dtype=torch.float16,
bnb_4bit_use_double_quant=True,
bnb_4bit_quant_type='nf4')),
lora=dict(
type=LoraConfig,
r=64,
lora_alpha=16,
lora_dropout=0.1,
bias='none',
task_type='CAUSAL_LM'))

#######################################################################
# PART 3 Dataset & Dataloader #
#######################################################################
alpaca_en = dict(
type=process_hf_dataset,
dataset=dict(type=load_dataset, path='json', data_files=dict(train=alpaca_en_path)),
tokenizer=tokenizer,
max_length=max_length,
dataset_map_fn=None,
template_map_fn=dict(
type=template_map_fn_factory, template=prompt_template),
remove_unused_columns=True,
shuffle_before_pack=True,
pack_to_max_length=pack_to_max_length,
use_varlen_attn=use_varlen_attn)

sampler = SequenceParallelSampler \
if sequence_parallel_size > 1 else DefaultSampler
train_dataloader = dict(
batch_size=batch_size,
num_workers=dataloader_num_workers,
dataset=alpaca_en,
sampler=dict(type=sampler, shuffle=True),
collate_fn=dict(type=default_collate_fn, use_varlen_attn=use_varlen_attn))

#######################################################################
# PART 4 Scheduler & Optimizer #
#######################################################################
# optimizer
optim_wrapper = dict(
type=AmpOptimWrapper,
optimizer=dict(
type=optim_type, lr=lr, betas=betas, weight_decay=weight_decay),
clip_grad=dict(max_norm=max_norm, error_if_nonfinite=False),
accumulative_counts=accumulative_counts,
loss_scale='dynamic',
dtype='float16')

# learning policy
# More information: https://github.com/open-mmlab/mmengine/blob/main/docs/en/tutorials/param_scheduler.md # noqa: E501
param_scheduler = [
dict(
type=LinearLR,
start_factor=1e-5,
by_epoch=True,
begin=0,
end=warmup_ratio * max_epochs,
convert_to_iter_based=True),
dict(
type=CosineAnnealingLR,
eta_min=0.0,
by_epoch=True,
begin=warmup_ratio * max_epochs,
end=max_epochs,
convert_to_iter_based=True)
]

# train, val, test setting
train_cfg = dict(type=TrainLoop, max_epochs=max_epochs)

#######################################################################
# PART 5 Runtime #
#######################################################################
# Log the dialogue periodically during the training process, optional
custom_hooks = [
dict(type=DatasetInfoHook, tokenizer=tokenizer),
dict(
type=EvaluateChatHook,
tokenizer=tokenizer,
every_n_iters=evaluation_freq,
evaluation_inputs=evaluation_inputs,
system=SYSTEM,
prompt_template=prompt_template)
]

if use_varlen_attn:
custom_hooks += [dict(type=VarlenAttnArgsToMessageHubHook)]

# configure default hooks
default_hooks = dict(
# record the time of every iteration.
timer=dict(type=IterTimerHook),
# print log every 10 iterations.
logger=dict(type=LoggerHook, log_metric_by_epoch=False, interval=10),
# enable the parameter scheduler.
param_scheduler=dict(type=ParamSchedulerHook),
# save checkpoint per `save_steps`.
checkpoint=dict(
type=CheckpointHook,
by_epoch=False,
interval=save_steps,
max_keep_ckpts=save_total_limit),
# set sampler seed in distributed evrionment.
sampler_seed=dict(type=DistSamplerSeedHook),
)

# configure environment
env_cfg = dict(
# whether to enable cudnn benchmark
cudnn_benchmark=False,
# set multi process parameters
mp_cfg=dict(mp_start_method='fork', opencv_num_threads=0),
# set distributed parameters
dist_cfg=dict(backend='nccl'),
)

# set visualizer
visualizer = None

# set log level
log_level = 'INFO'

# load from which checkpoint
load_from = None

# whether to resume training from the loaded checkpoint
resume = False

# Defaults to use random seed and disable `deterministic`
randomness = dict(seed=None, deterministic=False)

# set log processor
log_processor = dict(by_epoch=False)
Empty file added data/.gitkeep
Empty file.
Loading