期刊文献+

基于CFN的相似度计算在实例机器翻译中的应用 被引量:1

Application of Similarity Computation Based on CFN to Example-based Machine Translation
下载PDF
导出
摘要 在信息检索,文本挖掘以及基于实例的机器翻译中,相似度计算都是一个关键问题。在实例机器翻译中,相似度计算一般是基于字符、词的匹配以及向量空间模型,但基于句子语义结构的相似度研究还不多见。借助了汉语框架语义网(Chinese FrameNet,简称CFN)的场景语义描述优势,提出了一种新的面向EBMT进行实例相似度计算的方法。该方法主要基于CFN从句子整体结构相似和各语义块内部相似两个角度来度量句子相似度,将这两部分的相似度结果进行凸组合作为待翻译句子与候选实例之间的相似度值。实验结果表明,与传统方法相比,所提出的这种方法是有效的。 In the information retrieval,text mining,as well as Example based Machine Translation,Similarity calculation is a key issue.In the Example based Machine Translation,General similarity calculation is based on the characters,word matching,and vector space model.However,the study of the similarity based on the semantic structure of sentences is still rare.In this paper, with the semantic description advantage of Chinese FrameNet,We proposed a new method of similarity calculation oriented EBMT.This method is mainly based on the CFN from the overall structure of the sentence and the internal of the semantic block to measure the similarity between two sentences,then the convex combination of the results of these two Similarities is considered to be the similarity between Sentence to be translated and the candidate example.The experimental results show that compared with traditional methods the method proposed in this paper is effective.
作者 杨立波
出处 《电脑开发与应用》 2011年第6期58-60,共3页 Computer Development & Applications
关键词 相似度 实例机器翻译 汉语框架网 框架语义 similarity example based machine translation chinese frameNet frame semantic
  • 相关文献

参考文献5

  • 1Yuan S T, 0002 J S. Ontology-based Structured Cosine Similarity in Document Summarization : With Applications to Mobile Audio-based Knowledge Management [J]. IEEE Transactions on Systems, Man, and Cybernetics, Part B, 2005, 35 (5), 1028- 1040. 被引量:1
  • 2Maedche A, Staab S. Measuring Similarity Between Ontologies[Z]. EKAW, 2002 : 251-263. 被引量:1
  • 3Vitanyi M L C L. The Similarity Metric[J].IEEE Transactions on Information Theory, 2004, 50 (12) : 3250-3264. 被引量:1
  • 4荀恩东,颜伟.基于语义网计算英语词语相似度[J].情报学报,2006,25(1):43-48. 被引量:41
  • 5车万翔,刘挺,秦兵,等.面向双语句对检索的汉语句子相似度计算[C]//全国第七届计算语言学联合学术会议论文集.北京:清华大学出版社,2003:81-88. 被引量:6

二级参考文献10

  • 1George A.Miller,Richard Beckwith,Christiane Fellbaum,Derek Gross,and Katherine Miller.Introduction to WordNet:An On-line Lexical Database[EB].Cognitive Science Laboratory,Princeton University,1993.51 ~ 57 被引量:1
  • 2关毅,王晓龙.基于统计的汉语词汇间语义相似度计算.语言计算与基于内容的文本处理,清华大学出版社,2003.221~227 被引量:2
  • 3Rada R.etc.Development and application of a metric on semantic nets.IEEE Transactions on System,Man and Cybernetics,1989 被引量:1
  • 4Lee J.H.etc.Information retrieval based on conceptual distance in ISA hierarchies.Journal of Documentation,1993(49) 被引量:1
  • 5Agirre E.and Rigau G..A proposal for word sense disambiguation using conceptual distance.In:International Conference "Recent Advances in Natural Language Processing"RANLP'95,Tzigov Chark,Bulgaria,1995.91 ~ 98 被引量:1
  • 6P.Brown etc.Word sense disambiguation using tactical methods.In:Proceedings of 29th Meeting of the Association for Computational Linguistics (ACL-91),1991.201 ~ 207 被引量:1
  • 7Lillian Lee.Similarity-Based Approaches to Natural Language Processing:[Ph.D.Thesis].Harvard University Technical Report TR-11-97 被引量:1
  • 8刘群 李素建.基于《知网》的词汇语义相似度计算[A]..Computational Linguistics and Language Processing[C].,2002.7.2:59-76. 被引量:11
  • 9于江生,俞士汶.中文概念词典的结构[J].中文信息学报,2002,16(4):12-20. 被引量:67
  • 10胡俊峰,俞士汶.唐宋诗中词汇语义相似度的统计分析及应用[J].中文信息学报,2002,16(4):39-44. 被引量:43

共引文献45

同被引文献2

引证文献1

二级引证文献5

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部