期刊文献+

基于时间戳的多文档自动文摘 被引量:3

Automatic Multidocument Summarization Based on Time Stamp
下载PDF
导出
摘要 网站的新闻专题往往包含大量的网页,多文档自动文摘可以帮助人们从中快速获取主要信息。该文提出了利用时间戳改善文摘句子抽取质量和排序的方法。介绍了句子抽取方法、句子重要度计算、句子冗余减小方法。实验表明,形成的文摘性能良好,可以应用于实际系统中。 News special topic in Web site has plentiful pages. People can get main information rapidly by automatic multidocument summarization. A method which uses time stamp to improve sentence extraction quality is presented. The method of news sentence extraction, sentence importance calculation, and redundancy reducing is introduced. Experimental results show that summarization is good enough for practical application.
出处 《计算机工程》 CAS CSCD 北大核心 2007年第16期164-165,共2页 Computer Engineering
关键词 多文档自动文摘 时间戳 信息抽取 句子相似度 automatic multidocument summarization time stamp information extraction sentence similarity
  • 相关文献

参考文献4

  • 1秦兵,刘挺,李生.多文档自动文摘综述[J].中文信息学报,2005,19(6):13-20. 被引量:51
  • 2Barzilay R,Elhadad N,McKeown K R.Sentence Ordering in Multidocument Summarization[C]//Proceedings of the 1st Human Language Technology Conference,San Diego,California.2001:32. 被引量:1
  • 3Hatzivassiloglou V,Klavans J L,Holcombe M L,et al.Simfinder:A Flexible Clustering Tool for Summarization[C]//Proc.of the NAACL Workshop on Automatic Summarization.2001:4-14. 被引量:1
  • 4Witbrock J,Mittal V O.Ultra-summarization:A Statistical Approach to Generating Highly Condensed Non-extractive Summaries[C] //Proceedings of the 22th International Conference on Research and Developmentin Information Retrieval,Berkeley,California.1999:13-20. 被引量:1

二级参考文献25

  • 1穗志方 俞士汶.基于骨架依存树的语句相似度计算模型[A]..中文信息处理国际会议论文集(ICCIP''98)[C].北京:清华大学出版社,1998.458-465. 被引量:6
  • 2Over, P and J. Yen. 2003. An Introduction to DUC 2003 - Intrinstic Evaluation of Generic News Text Summatization Systems. http :/www. nlpir, nist. gov/projeets/due/pubs/2003 slides/due2003 intro, pdf. 被引量:1
  • 3Saggion H., D. Radev, S. Teufel, and W. Lmn. 2002. Meta-Evaluation of Summarization in a cross-Lingual Environment Using-Based Metrics. In: Proceedings of COLING - 2002, Taipei. 被引量:1
  • 4Michael White, Tanya Korelsky, Claire Cardie, Vincent Ng, David Pierce and Kiri Wagstaff. Multidocument Summarizatien via Information Extraction[A]. In: Proceedings of the First International Conference on Human Language Technology Research[ C ]. 1998 : 36 - 44. 被引量:1
  • 5Minghui Wang and Hediheko Tanaka. Summarization of Multiple Chinese Technical Articles[A]. In: The First International Conference on Information[C]. Fukuoka, Japan. 2002:16- 19. 被引量:1
  • 6.[EB/OL].http://www-nlpir, nist. gov/projects/duc/index. html.,. 被引量:1
  • 7Chin-Yew Lin, Eduard Hovy. From Single to Multi-document Summarization: A Prototype System and its Evaluation[A]. In Proceeding of the 4Oth Anniversary Meeting of the Association for Computational Linguistics (ACL- 02)[ C ], Philadelphia, USA, 2002:25 - 34. 被引量:1
  • 8Katldeen B. McKeown, Regina Barzilay, David Kirk Evans, etal. Tracking and summarizing news on a daily basis with columbia's newsblaster[ A]. In Proceedings of the Human Language Technology Conference.2002[ C]. 被引量:1
  • 9Dragomir R. Radev, Kathleen R. McKeovwn. Generating Natural Languages Summaries from Multiple On-Line Sources[J]. Computational Linguistics. 1998, 24(3) :21 - 29. 被引量:1
  • 10Sanda Harabagiu and Steven Maiorano. Multi-Document Summarization with GISTexter[A] .In: Proceedings of the Third LREC Conference 2002 (LREC 2002)[ C ]. June 2002, Canary Islands, Spain. 被引量:1

共引文献50

同被引文献27

  • 1秦兵,刘挺,李生.多文档自动文摘综述[J].中文信息学报,2005,19(6):13-20. 被引量:51
  • 2张其文,李明.多文档文摘提取方法的研究[J].兰州理工大学学报,2007,33(1):96-99. 被引量:4
  • 3Luhn H E The Automatic Creation of Literature Abstracts[J]. IBM Journal of Research and Development, 1958, 2(2): 159-165. 被引量:1
  • 4Conory J M. Text Summarization via Hidden Markov Models and Pivoted QR Matrix Decomposition[Z]. 2001. 被引量:1
  • 5Lin Chin-Yew, Hovy E H. Automatic Evaluation of Summaries Using N-gram Co-occurrence Statistics[C]//Proc. of 2003 Conference on the North American Chapter of the Association for Computational Linguistics on Human Language Technology. [S. l.]: IEEE Press, 2003: 150-156. 被引量:1
  • 6Dean J, Ghemawat S.MapReduce: simplified data processing on large clusters[C]//Proc of the 6th Symposium on Operating Systems Design and Implementation.San Francisco:Google Inc,2004. 被引量:1
  • 7Ghemawat S,Gobioff H,Leung S T.The Google file system[C]// Proceedings of the 19th ACM Symposium on Operating Systems Principles, Bolton Landing, ACM, 2003 : 29-43. 被引量:1
  • 8Chang F, Dean J, Ghemawat S, et al.Bigtable: a distributed structured data storage system[C]//7th OSDI,2006:305-314. 被引量:1
  • 9哈罗德.博科,查尔斯.L.贝尼埃.文摘的概念与方法[M].北京:书目文献出版社,1991. 被引量:1
  • 10Radev D R, Jing Hongyan, Stys M, et al.Centroid-based sm-camrizafien of multiple doeuments[J].Informafion Processing and Management, 2004,40: 919-938. 被引量:1

引证文献3

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部