基于多特征融合的意图识别算法研究

Research on Intent Recognition Algorithm Based on Multi-feature Fusion

下载PDF

导出

摘要针对中文口语短文本缺少上下文信息、语法不规范和噪声较大等特征造成语义模糊,进而导致用户意图识别准确率不高的问题,提出了一种基于多特征融合的意图识别算法。算法对传统Bi-LSTM(Bi-directional Long Shot-Term Memory)文本分类算法进行改进,将原始文本的字向量、词向量、词性向量和实体知识库向量进行融合,结合字级别的意图识别模型,在人工标注的实际场景下的用户意图数据集上进行训练和测试。实验结果表明,改进后的用户意图识别算法在实际场景中准确率等评价指标有明显提高。 To solve the problem of semantic ambiguity caused by the lack of contextual information,irregular grammar in the spoken Chinese text,an intention recognition algorithm based on multi-feature fusion is proposed,which improves the traditional Bi-LSTM(Bi-directional Long Shot-Term Memory)text classification algorithm.The improved algorithm fuses the character vector,word vector,part-of-speech vector and entity-knowledge-based vector of the original text,combines the character level intention recognition model,and conducts training and testing on the user intention data set under the real scene of manual annotation.The experimental results show that the improved intention recognition algorithm has a significant improvement in the accuracy and oth⁃er evaluation indexes in the real scene.

作者周权陈永生郭玉臣 ZHOU Quan;CHEN Yong-sheng;GUO Yu-chen(Department of Computer Science and Technology,Tongji University,Shanghai 201804,China)

机构地区同济大学电子与信息工程学院

出处《电脑知识与技术》 2020年第21期28-31,共4页 Computer Knowledge and Technology

关键词意图识别短文本分类多特征融合词嵌入深度学习 Bi-LSTM intent recognition short text classification multi-feature fusion word embedding deep learning Bi-LSTM

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献1

1靳小波.文本分类综述[J].自动化博览,2006,23(z1):24-29. 被引量：16

二级参考文献11

1[2]D.D.Lewis,Challenges in Machine Learning for Text Classification.The 9th Annual Conference on Computational Learning Theory.Italy,1996. 被引量：1
2[3]D.D.Lewis,Representation and Learning in Infor mation Retrieval,Doctoral Thesis,1992. 被引量：1
3[4]F.Debole,ESebastiani,An Analysis of the Relative Hardness of Reuters-21578 Subsets.Journal of the American Society for Information Science and Technology,Vol.56,No.6,2005. 被引量：1
4[5]F.Sebastiani,Machine Learning in Automated Text Categorization,ACM Computing Surveys,Vol.34,No.1,March 2002,pp.1-47. 被引量：1
5[6]K.Aas,L.Eikvil,Text Categorisation:A Survey.http://www.nr.no/files/samba/bamg/tm_survey.ps. 被引量：1
6[7]R.E.Schapire,Y.Singer.Improved Boosting Algorithms Using Confidence-Rated Predictions.Machine Learning,Vol.37,No.3,pp.297-336,1999. 被引量：1
7[8](美)Tom M.Mitchell著,曾华军,张银奎,等译.机器学习[M].机械工业出版社,2002. 被引量：1
8[9]Y.Yang,J.Pedersen,A comparative study on feature election in text categorization.International conference on Machine Learning (ICML),1997.http://citeseer.ist.psu.edu/yang97comparative.html. 被引量：1
9[10]Y.Yang,An Evaluation of Statistical Approaches to Text Categorization.Information Retrieval,Vol.1,No.1-2,1999. 被引量：1
10[11]C.J.van Rijsbergen,Information Retrieval.University of Glasgow,1979. 被引量：1

共引文献15

1陈立伟,井志强,葛秘蕾.基于特征项扩展的中文文本分类方法[J].应用科技,2010,37(3):1-4. 被引量：1
2段莹.支持向量机在文本分类中的应用[J].计算机与数字工程,2012,40(7):87-88. 被引量：9
3石文娟,龙舜,云飞.基于背景学习的迭代式文本分类框架[J].计算机工程与应用,2015,51(9):129-134. 被引量：2
4徐凯,陈平华,刘双印.基于AdaBoost-Bayes算法的中文文本分类系统[J].微电子学与计算机,2016,33(6):63-67. 被引量：7
5周源,刘怀兰,杜朋朋,廖岭.基于改进TF-IDF特征提取的文本分类模型研究[J].情报科学,2017,35(5):111-118. 被引量：51
6段国仑,谢钧,郭蕾蕾,王晓莹.一种融合多种信息的Web文档分类方法[J].信息技术与网络安全,2018,37(6):76-79. 被引量：1
7刘测,韩家新.面向新闻文本的分类方法的比较研究[J].智能计算机与应用,2018,8(5):38-41. 被引量：10
8陈北辰.朴素贝叶斯在文本分类中的应用[J].文理导航（教育研究与实践）,2018,0(7):171-171.
9薛峰,胡越,夏帅,许剑东.基于论文标题和摘要的短文本分类研究[J].合肥工业大学学报（自然科学版）,2018,41(10):1343-1349. 被引量：6
10段国仑,谢钧,郭蕾蕾,王晓莹.Web文档分类中TFIDF特征选择算法的改进[J].计算机技术与发展,2019,29(5):49-53. 被引量：4

1张文君.浅谈中职中文口语交际训练[J].商品与质量,2019,0(28):236-236.
2筑后川(翻译).线上挑战[J].摄影之友（影像视觉）,2020(7):76-81.
3姚明悦.新媒体环境下粉丝的跨媒体同人叙事——以《全职高手》同人作品为例[J].新闻研究导刊,2020,11(10):232-234. 被引量：2
4迟海洋,严馨,周枫,徐广义,张磊.基于BERT-BiGRU-Attention的在线健康社区用户意图识别方法[J].河北科技大学学报,2020,41(3):225-232. 被引量：7
5程一芳.基于开放网络知识的信息检索与数据挖掘[J].信息记录材料,2020,21(6):3-5. 被引量：1
6汤洪.《论语》篇章主旨综览——兼谈《论语》的仁学精神[J].文史杂志,2020(4):32-35.
7孙敏,李旸,庄正飞,钱涛.基于BGRU和自注意力机制的情感分析[J].江汉大学学报（自然科学版）,2020,48(4):80-89. 被引量：5
8常莉莉.《局外人》默尔索的个性悲剧[J].文学教育,2020(19):62-63.
9聂玮,曹悦,朱冬雪,朱艺璇,黄林毅.复杂监控背景下基于边缘感知学习网络的行为识别算法[J].计算机应用与软件,2020,37(8):227-232. 被引量：1
10陈玉婵,刘威.基于情感分析的学生评教文本观点抽取与聚类[J].计算机应用,2020,40(S01):113-117. 被引量：15

电脑知识与技术

2020年第21期

浏览历史

内容加载中请稍等...

基于多特征融合的意图识别算法研究

参考文献1

二级参考文献11

共引文献15

相关作者

相关机构

相关主题

浏览历史