期刊文献+

基于词向量包的自动文摘方法 被引量:5

Automatic Summarization Based on Bag of Word Vector
下载PDF
导出
摘要 [目的]利用向量空间描述语义信息,研究基于词向量包的自动文摘方法;[方法]文摘是文献内容缩短的精确表达;而词向量包可以在同一个向量空间下表示词、短语、句子、段落和篇章,其空间距离用于反映语义相似度。提出一种基于词向量包的自动文摘方法,用词向量包的表示距离衡量句子与整篇文献的语义相似度,将与文献语义相似的句子抽取出来最终形成文摘;[结果]在DUC01数据集上,实验结果表明,该方法能够生成高质量的文摘,结果明显优于其它方法;[结论]实验证明该方法明显提升了自动文摘的性能。 [Purposes] This work focused on automatic summarization by utilizing vector space to describe the semantics. [Methods] proposed a new representation based on word vector,which is called bag of word vector( BOWV),and employed it for automatic summarization. Words,phrases,sentences,paragraphs and documents could be represented in a same vector space by using BOWV. And the distance between representations was used to reflect the semantic similarity. For automatic summarization,the paper used the distance between BOWVs to measure the semantic similarity between sentences and document. The sentences similar with the document are extracted to form the summary. [Findings]Experimental results on DUC01 dataset showed that the proposed method could generate high- quality summary and outperforms comparison methods. [Conclusions] The experiment showed that this research improved the performance of automatic summarization significantly.
出处 《现代情报》 CSSCI 北大核心 2017年第2期8-13,共6页 Journal of Modern Information
基金 国家自然基金项目"基于领域本体的蒙古文数字资源整合机制研究"(项目编号:71163029)
关键词 词向量 词包向量 自动文摘 vector bag of word vector automatic summarization
  • 相关文献

参考文献1

二级参考文献73

  • 1Luhn H P. The automatic creation of literature abstracts[J]. IBM Journal of Research and Development, 1958, 2(2): 159-165. 被引量:1
  • 2Mani I, Maybury M T. Advances in automatic text summarization[M]. Cambridge: MIT Press, 1999. 被引量:1
  • 3Mani I, Bloedorn E. Machine learning of generic and user-focused summarization[C]//Proceedings of the Fifteenth National Conference on Artificial Intelligence.Reston VA:AAAI Press, 1998: 821-826. 被引量:1
  • 4Mitchell T M. Machine learning[M]. Burr Ridge: McGraw Hill, 1997:45. 被引量:1
  • 5Jones K S. Automatic summarizing:Factors and directions[C]//Advances in Automatic Text Summarization. Cambridge: MIT Press,1999:1-12. 被引量:1
  • 6Hovy E, Marcu D. Automated text summarization[C]//The Oxford Handbook of Computational Linguistics. USA: Oxford University Press,2005:583-598. 被引量:1
  • 7Baxendale P B. Machine-made index for technical literature:An experiment[J]. IBM Journal of Research and Development, 1958, 2(4): 354-361. 被引量:1
  • 8Edmundson H P. New methods in automatic extracting[J]. Journal of the ACM (JACM), 1969, 16(2): 264-285. 被引量:1
  • 9Ramezania M, Feizi-Derakhshi M. Automated text summarization:An overview[J]. Applied Artificial Intelligence:An International Journal,2014, 28(2):178-215. 被引量:1
  • 10Gong Yihong, Liu Xin. Generic text summarization using relevance measure and latent semantic analysis[C]//Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. New York:ACM, 2001: 19-25. 被引量:1

共引文献15

同被引文献63

引证文献5

二级引证文献50

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部