deepseek-r1-7b - deepseek-r1-7b技术,学习,经验文章

F_D_Z

1 年前

8K样本在DeepSeek-R1-7B模型上的复现效果7B Model and 8K Examples: Emerging Reasoning with Reinforcement Learning is Both Effective and Effic (notion.site)