LlamaFactory微调Qwen3-0.6B大模型踩坑实验整理确保Qwen3-0.6B模型在特定人物和自我认知上不犯事实性错误,调一个xx领域专属的人物专家模型GRADIO_SERVER_PORT=8103 CUDA_VISIBLE_DEVICES=1,2,5,7 llamafactory-cli train –stage sft –do_train –model_name_or_path /workspace/codes/deepseek/Qwen3-0.6B –dataset alpaca_zh_demo,identity,train –dataset_dir