File tree Expand file tree Collapse file tree 4 files changed +11
-11
lines changed
zh/guide_cloud/integration Expand file tree Collapse file tree 4 files changed +11
-11
lines changed Original file line number Diff line number Diff line change @@ -64,11 +64,11 @@ modelscope download --model=AI-ModelScope/sdxl-vae-fp16-fix --local_dir models/k
6464
6565如果你需要离线记录,可以添加` --swanlab_mode "local" ` 。
6666
67- ``` bash {3,4}
67+ ``` bash
6868CUDA_VISIBLE_DEVICES=" 0" python examples/train/kolors/train_kolors_lora.py \
6969...
70- --use_swanlab \
71- --swanlab_mode " cloud"
70+ --use_swanlab \ # [!code ++]
71+ --swanlab_mode " cloud" # [!code ++]
7272```
7373
7474## 开启训练
Original file line number Diff line number Diff line change @@ -38,7 +38,7 @@ bash examples/run_qwen2_5_7b_math_swanlab.sh
3838
3939当然,这里我们可以剖析一下,由于EasyR1是原始 veRL 项目的一个干净分叉,所以继承了[ veRL与SwanLab的集成] ( /guide_cloud/integration/integration-verl.md ) 。所以这里我们来看` run_qwen2_5_7b_math_swanlab.sh ` 文件:
4040
41- ``` sh {10}
41+ ``` sh
4242set -x
4343
4444export VLLM_ATTENTION_BACKEND=XFORMERS
@@ -48,7 +48,7 @@ MODEL_PATH=Qwen/Qwen2.5-7B-Instruct # replace it with your local file path
4848python3 -m verl.trainer.main \
4949 config=examples/grpo_example.yaml \
5050 worker.actor.model.model_path=${MODEL_PATH} \
51- trainer.logger=[' console' ,' swanlab' ] \
51+ trainer.logger=[' console' ,' swanlab' ] \ # [!code ++]
5252 trainer.n_gpus_per_node=4
5353```
5454
Original file line number Diff line number Diff line change @@ -38,8 +38,8 @@ pip install swanlab
3838``` bash
3939swift sft \
4040 ...
41- --report_to swanlab \
42- --swanlab_project swift-robot \
41+ --report_to swanlab \ # [!code ++]
42+ --swanlab_project swift-robot \ # [!code ++]
4343 ...
4444```
4545
@@ -74,8 +74,8 @@ swift sft \
7474 --dataloader_num_workers 4 \
7575 --model_author swift \
7676 --model_name swift-robot \
77- --report_to swanlab \
78- --swanlab_project swift-robot
77+ --report_to swanlab \ # [!code ++]
78+ --swanlab_project swift-robot # [!code ++]
7979```
8080
8181运行指令后,就可以在SwanLab看到训练过程:
Original file line number Diff line number Diff line change @@ -55,10 +55,11 @@ pip install -U swanlab
5555
5656** 完整的测试命令如下:**
5757
58- ``` bash {20}
58+ ``` bash
5959PYTHONUNBUFFERED=1 python3 -m verl.trainer.main_ppo \
6060 data.train_files=$HOME /data/gsm8k/train.parquet \
6161 data.val_files=$HOME /data/gsm8k/test.parquet \
62+ trainer.logger=[' console' ,' swanlab' ] \ # [!code ++]
6263 data.train_batch_size=256 \
6364 data.val_batch_size=1312 \
6465 data.max_prompt_length=512 \
@@ -75,7 +76,6 @@ PYTHONUNBUFFERED=1 python3 -m verl.trainer.main_ppo \
7576 critic.model.path=Qwen/Qwen2.5-0.5B-Instruct \
7677 critic.ppo_micro_batch_size_per_gpu=4 \
7778 algorithm.kl_ctrl.kl_coef=0.001 \
78- trainer.logger=[' console' ,' swanlab' ] \
7979 +trainer.val_before_train=False \
8080 trainer.default_hdfs_dir=null \
8181 trainer.n_gpus_per_node=1 \
You can’t perform that action at this time.
0 commit comments