Hello, author! When running the sft script, due to insufficient equipment, I was unable to conduct distributed training. Therefore, I set the parameter -- num_processes to 1 and disabled the wandb operation. But now there are some mistakes. This might be an initialization problem and I can't solve it
