详解GPT-信息抽取任务 (GPT-3 FAMILY LARGE LANGUAGE MODELS)

GPT-3 FAMILY LARGE LANGUAGE MODELS

Information Extraction

自然语言处理信息提取任务(NLP-IE):从非结构化文本数据中提取结构化数据,例如提取实体、关系和事件 164。将非结构化文本数据转换为结构化数据可以实现高效的数据处理、知识发现、决策制定并增强信息检索和搜索。

Information Extraction 子任务

信息抽取任务多种多样153

  1. 实体类型(entity typing)
  2. 实体提取(entity extraction)
  3. 关系分类(relation classification)
  4. 关系提取(relation extraction)
  5. 事件检测(event detection)
  6. 事件参数提取(event argument extraction )
  7. 事件提取 (event extraction)

**Entity typing (ET):**classifying identified named entity mentions into one of the predefined entity types 165.

**Named Entity Recognition (NER):**identifying entity mentions and then assigning them to appropriate entity types 166.

**Relation classification (RC):**identifying the semantic relationship between the given two target entities in a sentence 167.

**Relation Extraction (RE):**extracting the entities and then classifying the semantic relationship between the two target entities, i.e., involves entity extraction followed by relation classification 168.

**Event Detection (ED):**aims to identify and categorize words or phrases that trigger events 169.

**Event Argument Extraction (EAE):**identifying event arguments, i.e., entities involved in the event and then classifying their roles 170.

**Event Extraction (EE):**aims to extract both the events and the involved entities, i.e., it involves event detection followed by event argument extraction 171.

GPT relation classification 任务

138\], \[149\], \[153\]--\[156\], \[163

138 Y. Wang, Y. Zhao, and L. Petzold, "Are large language models ready for healthcare? a comparative study on clinical language understanding," arXiv preprint arXiv:2304.05368, 2023. chain-of-thought (CoT) self-question prompting (SQP)

链接: https://proceedings.mlr.press/v219/wang23c/wang23c.pdf

149 B. J. Gutie ́rrez, N. McNeal, C. Washington, Y. Chen, L. Li, H. Sun, and Y. Su, "Thinking about gpt-3 in-context learning for biomedical ie? think again," in Findings of the Association for Computational Linguistics: EMNLP 2022, 2022, pp. 4497--4512.

链接: https://arxiv.org/pdf/2203.08410

153 B. Li, G. Fang, Y. Yang, Q. Wang, W. Ye, W. Zhao, and S. Zhang, "Evaluating chatgpt's information extraction capabilities: An assessment of performance, explainability, calibration, and faithfulness," arXiv preprint arXiv:2304.11633, 2023.

链接: https://arxiv.org/pdf/2304.11633

154 C. Chan, J. Cheng, W. Wang, Y. Jiang, T. Fang, X. Liu, and Y. Song, "Chatgpt evaluation on sentence level relations: A focus on temporal, causal, and discourse relations," arXiv preprint arXiv:2304.14827, 2023.

链接: https://arxiv.org/pdf/2304.14827

155 X. Xu, Y. Zhu, X. Wang, and N. Zhang, "How to unleash the power of large language models for few-shot relation extraction?" arXiv preprint arXiv:2305.01555, 2023.

链接: https://arxiv.org/pdf/2305.01555

156 Z. Wan, F. Cheng, Z. Mao, Q. Liu, H. Song, J. Li, and S. Kurohashi, "Gpt-re: In-context learning for relation extraction using large language models," arXiv preprint arXiv:2305.02105, 2023. chain-of-thought (CoT)

链接: https://arxiv.org/pdf/2305.02105

163 K. Zhang, B. J. Gutie ́rrez, and Y. Su, "Aligning instruction tasks unlocks large language models as zero-shot relation extractors," arXiv preprint arXiv:2305.11159, 2023.

链接: https://arxiv.org/pdf/2305.11159

GPT relation extraction 任务

148, 151--153, 158, 161, 162,

148 X. Wei, X. Cui, N. Cheng, X. Wang, X. Zhang, S. Huang, P. Xie, J. Xu, Y. Chen, M. Zhang et al., "Zero-shot information extraction via chatting with chatgpt," arXiv preprint arXiv:2302.10205, 2023.

链接: https://eva.fing.edu.uy/pluginfile.php/524749/mod_folder/content/0/ChatIE_Zero-Shot%20Information%20Extraction%20via%20Chatting%20with%20ChatGPT.pdf

151 H. Rehana, N. B. C ̧ am, M. Basmaci, Y. He, A. ̈Ozgu ̈ r, and J. Hur, "Evaluation of gpt and bert-based models on identifying protein-protein interactions in biomedical text," arXiv preprint arXiv:2303.17728, 2023.

链接: https://pmc.ncbi.nlm.nih.gov/articles/PMC11101131/pdf/nihpp-2303.17728v2.pdf

152 C. Yuan, Q. Xie, and S. Ananiadou, "Zero-shot temporal relation extraction with chatgpt," arXiv preprint arXiv:2304.05454, 2023. chain-of-thought (CoT) event ranking (ER)

链接: https://arxiv.org/pdf/2304.05454

153 B. Li, G. Fang, Y. Yang, Q. Wang, W. Ye, W. Zhao, and S. Zhang, "Evaluating chatgpt's information extraction capabilities: An assessment of performance, explainability, calibration, and faithfulness," arXiv preprint arXiv:2304.11633, 2023.

链接: https://arxiv.org/pdf/2304.11633

158 Y. Ma, Y. Cao, Y. Hong, and A. Sun, "Large language model is not a good few-shot information extractor, but a good reranker for hard samples!" arXiv preprint arXiv:2303.08559, 2023.

链接: https://arxiv.org/pdf/2303.08559

161 S. Wadhwa, S. Amir, and B. C. Wallace, "Revisiting relation extraction in the era of large language models," arXiv preprint arXiv:2305.05003, 2023. chain-of-thought (CoT)

链接: https://pmc.ncbi.nlm.nih.gov/articles/PMC10482322/pdf/nihms-1912166.pdf

162 P. Li, T. Sun, Q. Tang, H. Yan, Y. Wu, X. Huang, and X. Qiu, "Codeie: Large code generation models are better few-shot information extractors," arXiv preprint arXiv:2305.05711, 2023.

链接: https://arxiv.org/pdf/2305.05711

Summary

参考文献

164 Y. Lu, Q. Liu, D. Dai, X. Xiao, H. Lin, X. Han, L. Sun, and H. Wu, "Unified structure generation for universal information extraction," in Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022, pp. 5755--5772.

165 Y. Chen, J. Cheng, H. Jiang, L. Liu, H. Zhang, S. Shi, and R. Xu, "Learning from sibling mentions with scalable graph inference in fine-grained entity typing," in Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022, pp. 2076--2087.

166 S. S. S. Das, A. Katiyar, R. J. Passonneau, and R. Zhang, "Container: Few-shot named entity recognition via contrastive learning," in Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022, pp. 6338--6353.

167 S. Wu and Y. He, "Enriching pre-trained language model with entity information for relation classification," in Proceedings of the 28th ACM international conference on information and knowledge management, 2019, pp. 2361--2364.

168 D. Ye, Y. Lin, P. Li, and M. Sun, "Packed levitated marker for entity and relation extraction," in Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022, pp. 4904--4917.

169 K. Zhao, X. Jin, L. Bai, J. Guo, and X. Cheng, "Knowledgeenhanced self-supervised prototypical network for few-shot event detection," in Findings of the Association for Computational Linguistics: EMNLP 2022, 2022, pp. 6266--6275.

170 Y. Ma, Z. Wang, Y. Cao, M. Li, M. Chen, K. Wang, and J. Shao, "Prompt for extraction? paie: Prompting argument interaction for event argument extraction," in Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022, pp. 6759--6774.

1 A Survey of GPT-3 Family Large Language Models Including ChatGPT and GPT-4. 2023

相关推荐
gptAI_plus1 天前
用 React + TypeScript 写一个世界杯淘汰赛对阵树组件
chatgpt·openai
AI工程效率栈5 天前
AI 帮你补异常处理时,新人最容易犯的错:把失败悄悄变成成功
gpt·chatgpt
凌奕8 天前
让你的 AI 编程助手「偷懒」:50k Star 的 Ponytail,让 Agent 少写一半代码
chatgpt·agent·claude
星落zx14 天前
Spring Boot 多模型集成:优雅调用全球主流大模型
人工智能·spring boot·chatgpt
爱读书的小胖14 天前
无偿分享ChatGPT Image 2画图网页与并发绘图python程序【Ai绘图】
开发语言·python·chatgpt
码农小旋风14 天前
Claude Code 基础用法大全:对话、分析、修改、测试、Git 和工作流
人工智能·git·chatgpt·claude
武子康14 天前
调查研究-180 roboflow/supervision:计算机视觉工程里的“胶水层“,为什么值得关注?
人工智能·opencv·计算机视觉·chatgpt·llm·向量化
果子耶耶14 天前
让大模型帮我写单元测试,5个模型的覆盖率和边界处理能力实测
chatgpt·单元测试
LaughingZhu14 天前
Product Hunt 每日热榜 | 2026-06-16
前端·人工智能·经验分享·chatgpt·html
小宋102114 天前
4 万 Star 的开源 ChatGPT 桌面端:用 Jan 把电脑变成离线 AI 工作站
人工智能·chatgpt·开源·jan