期刊文献+

基于挖掘Web双语词汇关联度的无指导译文消歧

Unsupervised translation disambiguation based on mining Web relatedness of bilingual words
下载PDF
导出
摘要 为缓解译文消歧任务中消歧知识获取困难及数据稀疏问题,提出了一种基于Web的挖掘双语词汇相关关系的无指导译文消歧方法。该方法将双语词汇在语料库中的间接相关拓展到Web,提出了基于Web的双语词汇间接相关模型,在此基础上又提出了一种基于Web的双语词汇相关度的消歧方法,通过构造不同queries并利用搜索引擎抽取返回页面的page counts,最后利用点式互信息来计算词汇间的相关度并用于消歧决策。该方法最好性能(P_(mar)=0.464)超过了国际语义评测Semeval-2007的Task #5上可比较的最好无指导系统TorMd。 This paper presents an unsupervised method by mining Web relatedness of bilingual words. It intends to solve the problem of knowledge acquisition and data sparse in translation disambiguation. By introducing an indirect association model of bilingual words first, this paper expands it to bilingual web page. It goes a step further to a b!lingual Web relatedness which centers around Web pages. It computes point-wise mutual information between words as relatedness and makes disambiguation by constructing different queries and extracting Web page counts through search engine. This method achieves the best performance. It outperforms the best unsupervised system TorMd on Semeval-2007 Task # 5 and gets the state-of-the-art results (Pmar = 0.464).
出处 《高技术通讯》 EI CAS CSCD 北大核心 2010年第4期349-354,共6页 Chinese High Technology Letters
基金 973计划(2004CB318102) 国家自然科学基金(60903063) 中国博士后科学基金(20090450007)资助项目
关键词 无指导译文消歧 双语词汇相关 页面计数 间接相关 基于WEB unsupervised translation disambiguation, bilingual word relatedness, page count, indirect association, web based
  • 相关文献

参考文献2

二级参考文献15

  • 1Edmonds P and Cotton S. Senseval-2: Overview. In Proceedings of the Second International Workshop on evaluating Word Sense Disambiguation Systems, Toulouse, France, 2001: 1-5. 被引量:1
  • 2Mihalcea R, Chklovski T, and Killgariff A. The Senseval-3 English lexical sample task. In Proceedings of the Third InternationalWorkshop on the Evaluation of Systems for the Semantic Analysis of Text (Senseval-3). Barcelona, Spain, 2004: 25-28. 被引量:1
  • 3Li Hang and Li Cong. Word translation disambiguation using bilingual bootstrapping. Computational Linguistics, 2004, 20(4): 563-596. 被引量:1
  • 4Yarowsy D. Unsupervised word sense disambiguation rivaling supervised methods.In Proceedings of the 33^rd Annual Meeting of Association for Computational Linguistics (ACL 1995)(Cambridge, MA, June 1995): 189-196. 被引量:1
  • 5Gale W A, Church K W, and Yarowsky D. Using bilingual materials to develop word sense disambiguation methods. In Proceedings of the International Conference on Theoretical and Methodological Issues in Machine Translation. Montreal1992: 101-112. 被引量:1
  • 6Diab M and Resnik P. An unsupervised method for word sense tagging using parallel corpora. In Proceedings of the 40th Anniversary Meeting of the Association for Computational Linguistics (ACL-02). Philadelphia, USA. 2002: 255-262. 被引量:1
  • 7Ng Hwee Tou, WangBin, and Chan Yee Seng. Exploiting parallel texts for word sense disambiguation: an empirical study. In Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics. Sapporo, Japan. 2003: 465-462. 被引量:1
  • 8Wang X and Carroll J. Word sense disambiguation using sense examples automatically acquired from a second language. In Proceedings of HLT/EMNLP, Vancouver,Canada. 2005: 547-554. 被引量:1
  • 9Leacock C, Chodorow M, and Miller G A. Using corpus statistics and WordNet relations for sense identification. Computational Linguistics, 1998, 24(2): 147-165. 被引量:1
  • 10Agirre E and Martfnez D. Unsupervised WSD based on automatically retrieved examples: The importance of bias. Proceedings of the Conference on Empirical Methods in NLP. Barcelona, Spain. 2004: 25-32. 被引量:1

共引文献7

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部