基于字符词组特征与原型网络融合训练的事件分类

Event Classification Based on Fusion Training of Character Phrase Features and Prototype Network

下载PDF

导出

摘要事件检测与分类任务,包含两个步骤的子任务:识别事件触发词和将其分类为正确的事件类型。在这项任务中首要关键的就是触发词的识别,利用基于神经网络的模型来识别句子中的触发词是这些年的主流方法。然而,当涉及到由语义结构不清和语义相近的字符和词组组成的句子时,识别事件的触发词变得有些困难。本文提出一个融合字与词信息,再通过原型网络来精确事件分类的模型:输入融合字与词的信息的嵌入信息,将各个组成的嵌入信息投影到一个高维的特征空间中,对于每个维度类型的样本信息提取他们的均值作为聚类中心即原型,使用欧几里得距离作为距离度量,训练使得测试样本到自己类别原型的距离越近越好,到其他类别原型的距离越远越好,更精确地识别出句子所包含的触发词,分辨出事件类型。 The event detection and classification task consists of two-step subtasks: identifying the event trigger word and classifying it into the correct event type. The most important thing in this task is the recognition of trigger words. Using neural network-based models to identify trigger words in sentences is the mainstream method in these years. However, when it comes to sentences composed of characters and phrases with unclear semantic structure and similar semantics, it becomes difficult to identify the trigger words of the event. This paper proposes to train an n-dimensional prototype network that integrates the embedded information of the word information: input the embedded information of the fused word and word information, and project the embedded information of each composition into a high-dimensional feature space. For each dimension type, the sample information extracts their mean value as the cluster center or prototype, and uses the Euclidean distance as the distance metric. Training makes the test sample the closer to the prototype of its own category, the better, and the farther the distance to prototypes of other categories, the better. Accurately identify the trigger words contained in the sentence and distinguish the type of event.

作者赵芝茵程良伦陈光明

机构地区广东工业大学计算机学院

出处《计算机科学与应用》 2021年第4期920-927,共10页 Computer Science and Application

关键词触发词检测事件分类原型网络神经网络

分类号 TP3 [自动化与计算机技术—计算机科学与技术]

引文网络
相关文献

1崔立新,侯强.基于机会成本的高速铁路客票动态定价模型[J].铁道学报,2021,43(3):9-17. 被引量：3
2《工业工程》第13卷(2010年)总目次[J].工业工程,2010,13(6).
3郭月玲.谈加强企业固定资产管理工作的思路[J].市场周刊·理论版,2020(40):3-4.
4王宁.大数据时代下的计算机网络安全与防范策略研究[J].IT经理世界,2020,23(12):139-139.
5孙中苗,徐琪.随机需求下考虑不同竞争情形的网约车平台动态定价[J].中国管理科学,2021,29(1):138-148. 被引量：26
6李卫兵,曾泽熠,曾强.面向混合数据源的企业数据库私有云设计[J].电力大数据,2021,24(2):27-33. 被引量：2
7黄尹旭.区块链应用技术的金融市场基础设施之治理--以数字货币为例[J].东方法学,2020(5):56-65. 被引量：43
8陈莉莉.新时期高校党建与思政教育管理研究——评《高校网络舆情管理与思政教育创新——基于网络身份隐匿视角的研究》[J].科技管理研究,2021,41(7). 被引量：2
9刘君红.学术虚拟社区冲突话语生成机制及其身份建构研究[J].外国语文研究（辑刊）,2019(1):82-93. 被引量：1
10杨志坚(译),许芹(译),徐华新(译).国际会计准则理事会2020年的挑战和发展以及未来一年的计划[J].金融会计,2021(3):12-16.

计算机科学与应用

2021年第4期

浏览历史

内容加载中请稍等...

基于字符词组特征与原型网络融合训练的事件分类

相关作者

相关机构

相关主题

浏览历史