期刊文献+

面向文本实体关系抽取研究综述

Review of Text-Oriented Entity Relation Extraction Research
下载PDF
导出
摘要 信息抽取是知识图谱构建的基础,关系抽取作为信息抽取的关键流程和核心步骤,旨在从文本数据中定位实体并识别实体间的语义联系。因此提高关系抽取的效率可以有效提升信息抽取的质量,进而影响到知识图谱的构建以及后续的下游任务。关系抽取按照抽取文本长度可以分为句子级关系抽取和文档级关系抽取,两种级别的抽取方法在不同应用场景下各有优缺点。句子级关系抽取适用于较小规模数据集的应用场景,而文档级关系抽取适用于新闻事件分析、长篇报告或文章的关系挖掘等场景。不同于已有的关系抽取,介绍了关系抽取的基本概念以及领域内近年来的发展历程,罗列了两种级别关系抽取所采用的数据集,对数据集的特点进行概述;分别对句子级关系抽取和文档级关系抽取进行了阐述,介绍了不同级别关系抽取的优缺点,并分析了各类方法中代表模型的性能以及局限性;总结了当前研究领域中存在的问题并对关系抽取发展前景进行了展望。 Information extraction is the foundation of knowledge graph construction,and relation extraction,as a key process and core step of information extraction,aims to locate entities from text data and recognize semantic links between entities.Therefore,improving the efficiency of relation extraction can effectively improve the quality of information extraction,which affects the construction of knowledge graph and subsequent downstream tasks.Relation extraction can be categorized into sentence-level relation extraction and document-level relation extraction according to the length of the extracted text.The two levels of extraction methods have their own advantages and disadvantages in different application scenarios:sentence-level relation extraction is suitable for application scenarios with smaller datasets,while document-level relation extraction is suitable for scenarios such as news event analysis,long reports or articles with relational mining.Unlike the existing relation extraction,this paper first introduces the basic concept of relation extraction and the development history of the field in recent years,lists the datasets used in the two levels of relation extraction,and gives an overview of the characteristics of the datasets.Then,this paper elaborates on the sentence-level relation extraction and the document-level relation extraction respectively,summarizes the advantages and disadvantages of different levels of relation extraction,and analyses the performance and limitations of the representative models in each method.Finally,this paper summarizes the problems in the current research field and looks forward to future development of relation extraction.
作者 任安琪 柳林 王海龙 刘静 REN Anqi;LIU Lin;WANG Hailong;LIU Jing(School of Computer Science and Technology,Inner Mongolia Normal University,Hohhot 010022,China;Computer Science Joint Innovation Laboratory,Inner Mongolia Normal University,Hohhot 010022,China;Library,Inner Mongolia University,Hohhot 010021,China)
出处 《计算机科学与探索》 CSCD 北大核心 2024年第11期2848-2871,共24页 Journal of Frontiers of Computer Science and Technology
基金 国家重点研发计划(2020YFC1523305) 内蒙古自治区自然科学基金(2023LHMS06006) 内蒙古师范大学基本科研业务费专项资金(2022JBYJ032) 内蒙古自治区档案馆档案科技项目(2023-13) 无穷维哈密顿系统及其算法应用教育部重点实验室(内蒙古师范大学)开放课题(2023KFYB03,2023KFZD03)。
关键词 信息抽取 实体关系抽取 句子级关系抽取 文档级关系抽取 知识图谱构建 information extraction entity relation extraction sentence-level relation extraction document-level relation extraction knowledge graph construction
  • 相关文献

参考文献21

二级参考文献213

共引文献1471

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部