期刊文献+

基于RoBERTa-wwm-BiLSTM-CRF的扶持政策文本实体识别研究 被引量:7

Entity recognition of support policy text based on RoBERTa-wwm-BiLSTM-CRF
下载PDF
导出
摘要 扶持政策能够帮助企业获得政府在资金补助、税务减免等方面的支持,帮助企业更好地发展。针对扶持政策文本存在实体边界难以划分且传统词向量无法解决一词多义的问题,提出基于RoBERTa-wwm-BiLSTM-CRF的扶持政策文本实体识别模型。该模型使用预训练语言模型RoBERTa-wwm训练得到动态词向量,能够表征词的多义性;利用BiLSTM网络进一步抽取扶持政策文本的上下文信息和语义特征;最后通过条件随机场得到最佳的预测序列。提出的模型在自建的5512条语料组成的扶持政策数据集上的F1值达到91.7%,结果表明,该模型能够有效识别扶持政策文本的命名实体,从而提高企业筛选政策的效率。 Support policies can help enterprises obtain government support in funding subsidies,tax reductions,and other aspects,and help enterprises develop better.In order to address the problem that the entity boundaries in support policy texts are difficult to define and traditional word vectors cannot solve the problem of polysemy,a support policy text s named entity recognition model based on RoBERTa-wwm-BiLSTM-CRF is proposed.Firstly,the model uses the pre-trained language model RoBERTa-wwm to obtain dynamic word vectors,which can represent the polysemy of words.Secondly,the BiLSTM network is used to further extract the context information and semantic features of support policy texts.Finally,the best prediction sequence is obtained through the conditional random field.The proposed model achieves an F1 value of 91.7%on a self-built support policy dataset composed of 5512 sentences.The results show that the model can effectively recognize the named entities in support policy texts,thereby improving the efficiency of enterprise policy screening.
作者 喻金平 朱伟锋 廖列法 YU Jin-ping;ZHU Wei-feng;LIAO Lie-fa(School of Information Engineering,Jiangxi University of Science and Technology,Ganzhou 314000;School of Software Engineering,Jiangxi University of Science and Technology,Nanchang 330000,China)
出处 《计算机工程与科学》 CSCD 北大核心 2023年第8期1498-1507,共10页 Computer Engineering & Science
基金 国家自然科学基金(71462018,71761018)。
关键词 扶持政策文本 预训练语言模型 命名实体识别 动态词向量 企业扶持 support policy text pre-trained language model named entity recognition dynamic word vector enterprise support
  • 相关文献

参考文献23

二级参考文献173

共引文献587

同被引文献108

引证文献7

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部