技术栈

humaneval+

hhhhhlt
6 个月前
论文阅读·chatgpt·基准测试·代码大模型·humaneval+·evalplus
【代码大模型】Is Your Code Generated by ChatGPT Really Correct?论文阅读key word: evaluation framework, LLM-synthesized code, benchmark