期刊文献+

word2vec-ACV:OOV语境含义的词向量生成模型 被引量:7

word2vec-ACV: word vector generation model of OOV context meaning
下载PDF
导出
摘要 针对word2vec模型生成的词向量缺乏语境的多义性以及无法创建集外词(OOV)词向量的问题,引入相似信息与word2vec模型相结合,提出word2vec-ACV模型。该模型首先基于连续词袋(CBOW)和Hierarchical softmax的word2vec模型训练出词向量矩阵即权重矩阵;然后将共现矩阵进行归一化处理得到平均上下文词向量,再将词向量组成平均上下文词向量矩阵;最后将平均上下文词向量矩阵与权重矩阵相乘得到词向量矩阵。为了能同时解决集外词及多义性问题,将平均上下文词向量分为全局平均上下文词向量(global ACV)和局部平均上下文词向量(local ACV)两种,并对两者取权值组成新的平均上下文词向量矩阵,并将word2vec-ACV模型和word2vec模型分别进行类比任务实验和命名实体识别任务实验。实验结果表明,word2vec-ACV模型同时解决了语境多义性以及创建集外词词向量的问题,降低了时间消耗,提升了词向量表达的准确性和对海量词汇的处理能力。 The word2vec model is a neural network model (NNLM) that converts words in text into a word vector. It is widely used in natural language processing tasks such as emotional analysis, question-answering robot and so on. Word vectors generated for the word2vec model lacked the ambiguity of context and the inability to create OOV word vectors. Based on the similarity information of document context and word2vec model, this paper proposed a word vector generation model called the word2vec-ACV model which conformed to the meaning of OOV context. The model was similar to the process of the word vector generated by the word2vec model. First of all, base on the continuous word bag (CBOW) and the Hierarchical softmax, the word2vec model trained the word vector matrix, namely the weight matrix. Secondly, normalized the co-occurrence matrix to get the average context word vector. Then, the word vector consisted of an average context word vector matrix. Finally, it multiplied the average context word vector matrix and the weight matrix to get the word vector matrix. In order to simultaneously solve the ambiguity problem of out of vocabulary words and out of vocabulary words to create, this paper divided the average context word vectors into the global average context word vector (global ACV) and the local average context word vector (local ACV). In addition, the two taken the weight value to form a new average context word vector matrix. The word2vec model could effectively express the word in vector form. Experiments on analogical tasks and named entity recognition (NER) tasks respectively, the results show that the word2vec-ACV model is superior to the word2vec model in the accurate expression of the word vector. It is a word vector representation method to create a contextual context for OOV words.
作者 王永贵 郑泽 李玥 Wang Yonggui;Zheng Ze;Li Yue(College of Software,Liaoning Technical University,Huludao Liaoning 125105,China)
出处 《计算机应用研究》 CSCD 北大核心 2019年第6期1623-1628,共6页 Application Research of Computers
基金 国家自然科学基金青年基金资助项目(61404069)
关键词 word2vec模型 词向量 共现矩阵 平均上下文词向量 word2vec model word vector co-occurrence matrix ACV
  • 相关文献

参考文献2

二级参考文献31

  • 1Baeza-Yates R,Ribeiro-Neto B.Modern Information Retrieval[M].New York:ACM press,1999. 被引量:1
  • 2Manning C D,Schütze H.Foundations of Statistical NaturalLanguage Processing [M].Cambridge:MIT press,1999. 被引量:1
  • 3Hwang M,Choi C,Youn B,et al.Word Sense Disambiguation Based on Relation Structure[C]∥International Conference on Advanced Language Processing and Web Information Technology.2008:15-20. 被引量:1
  • 4Wang X,Mccallum A,Wei X.Topical N-Grams:Phrase andTopic Discovery,with an Application to Information Retrieval [C]∥IEEE International Conference on Data Mining.IEEE Computer Society,2007:697-702. 被引量:1
  • 5Haruechaiyasak C,Jitkrittum W,Sangkeettrakarn C,et al.Im-plementing News Article Category Browsing Based on Text Categorization Technique [C]∥2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology.IEEE Computer Society,2008:143-146. 被引量:1
  • 6Mikolov T,Sutskever I,Chen K,et al.Distributed Representations of Words and Phrases and their Compositionality [J].Advances in Neural Information Processing Systems,2013,26:3111-3119. 被引量:1
  • 7Mikolov T,Chen K,Corrado G,et al.Efficient Estimation of Word Representations in Vector Space [C]∥ICLR 2013.2013. 被引量:1
  • 8Joachims T.A Probabilistic Analysis of the Rocchio Algorithm with TFIDF for Text Categorization [M].Springer US,1997:143-151. 被引量:1
  • 9Hinton G E.Learning distributed representations of concepts[C]∥Proceedings of CogSci.1986:1-12. 被引量:1
  • 10Socher R,Bauer J,Manning C D,et al.Parsing with Compositional Vector Grammars [C]∥Meeting of the Association for Computational Linguistics.2013:455-465. 被引量:1

共引文献269

同被引文献43

引证文献7

二级引证文献8

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部