期刊文献+

三元组可比语料库自动剖析在情报智能处理中的研究与应用

Research on the Application of Automatic Language Profiling in Intelligent Information Processing Based on 3-Tuple Comparable Corpora
原文传递
导出
摘要 文章提出的基于三元组可比语料库的自动语言剖析技术扩大了该研究领域的内涵,使其包括面向自然语言处理的应用研究。从工程可实现性考虑,创新性地提出建造三元组可比语料库,利用n-元词串、关键词簇和语义多词表达等自动抽取技术,通过对比中式英语表达,发掘英语本族语言模型,实现改进和发展机器翻译、跨语言信息检索等自然语言处理应用的目标。 The proposed automatic language profiling technologies based on the 3-tuple comparable corpora expand the connotation of this research field to include the natural language processing-oriented application and study.Considering the feasibility of the project,this paper innovatively puts forward the building of the 3-tuple comparable corpora and uses the automatic extraction technologies such as n-grams,keyword clusters and semantic multi-word expression to develop the English native language model by comparing with the Chinese type English expression so as to improve and develop the application of natural language processing such as machine translation and cross-language information retrieval.
出处 《情报理论与实践》 CSSCI 北大核心 2012年第4期94-98,共5页 Information Studies:Theory & Application
基金 解放军总后勤部司令部2011年度后勤科研条件建设项目"军事后勤专业术语库及双语资源库信息处理平台"的阶段性研究成果 项目编号:2011-ZHTJ-5031
关键词 机器翻译 三元组可比语料库 自动语言剖析 情报智能处理 machine translation 3-tuple comparable corpora automatic language profiling intelligent information processing
  • 相关文献

参考文献11

  • 1HAZEM A.Bilingual lexicon extraction from comparable corpo-ra as metasearch[C]∥Proceedings of the 4th Workshop onBuilding and Using Comparable Corpora,Oregon,USA,2011:112-130. 被引量:1
  • 2EISELE A,XU Jia.Improving machine translation performanceusing comparable corpora[C]∥Proceedings of the 3rd Work-shop on Building and Using Comparable Corpora,Valletta,Malta,2010:57. 被引量:1
  • 3ZWEIGENBAUM P.Introduction[C]∥Proceedings of the4th Workshop on Building and Using Comparable Corpora,Ore-gon,USA,2011:14-17. 被引量:1
  • 4RAYSON P.New trends in corpus linguistics for translationstudies[M]∥Proceedings of Workshop on Corpus Linguistics&Machine Translation Applications,2008. 被引量:1
  • 5胡开宝著..语料库翻译学概论[M].上海:上海交通大学出版社,2011:242.
  • 6夏云,李德凤.可比语料量化比较分析与应用文体翻译[C]∥2008年上海第18届世界翻译大会论文集,上海,2008:173-176. 被引量:1
  • 7CHEN W.A corpus-based approach to modelling explicitation inEnglish-Chinese translation[C]∥Proceedings of XVIII FITWorld Conference,Shanghai,China,2008. 被引量:1
  • 8MCENERY A M,XIAO R Z.Parallel and comparable corpo-ra:what are they up to[M]∥Incorporating Corpora:Trans-lation and the Linguist.Translating Europe.Clevedon,UK:Multilingual Matters,2007. 被引量:1
  • 9RAYSON P,et al.Quantitative analysis of translation revision:contrastive corpus research on native English and Chinese trans-lationese[C]∥Proceedings of XVIII FIT World Conference,Shanghai,China,2008. 被引量:1
  • 10肖健,徐建,徐晓兰,袁琦.英中可比语料库中多词表达自动提取与对齐[J].计算机工程与应用,2010,46(31):130-134. 被引量:12

二级参考文献26

  • 1de Medeiros Caseli H,Villavicencio A,Machado A, et al.Statistically-driven alignment-based multiword expression identification for technical domains[C]//Proeeedings of the Workshop on Multiword Expressions: Identification, Interpretation, Disambiguation, Applications, 2009:1-8. 被引量:1
  • 2Ren Zhixiang,Lu Yajuan, Cao Jie, et al.Improving statistical machine translation using domain bilingual multiword expressions[C]// Proceedings of the Workshop on Multiword Expressions: Identification, Interpretation, Disambiguation, Applications, 2009: 47-54. 被引量:1
  • 3Rayson P, Xiao Jian, Wong A, et al.Quantitative analysis of translation revision: contrastive corpus on native english and chinese translationese[C]//XVIII FIT World Congress, 2008, Shanghai, China, 2008. 被引量:1
  • 4Ramisch C, Schreincr P,Idiart M,et aLAn evaluation of methods for the extraction of multiword expressions[C]//Proccedings of the LREC Workshop Towards a Shared Task for Multiword Expressions, 2008: 50-53. 被引量:1
  • 5Van de Cruys T,Moir'on B V.Semantics-based multiword expression extraction[C]//Proeeedings of the Workshop on A Broader Perspective on Multiword Expressions,2007:25-32. 被引量:1
  • 6Rayson P.Falling foul of multiword expressions[C]//Proceedings of Lancaster University and CCID Joint Workshop on Chinese Multi-Word Expression(MWE) and Machine Translation, 2006: 8-40. 被引量:1
  • 7Piao S S L.MWE and translation[C]//Proceedings of Lancaster University and CCID Joint Workshop on Chinese Multi-Word Expression(MWE) and Machine Translation,2006:53-54. 被引量:1
  • 8Piao S S L, Sun Guangfan, Rayson P, et al.Automatic extraction of Chinese multiword expressions with a statistical tool[C]// Proceedings of the Workshop on Multi-word Expressions in a Multilingual Context,2006:17-24. 被引量:1
  • 9Katz G,Giesbrecht E.Automatic identification of non-compositional multi-word expressions using Latent Semantic Analysis[C]// Proceedings of the Workshop on Multiword Expressions: Identifying and Exploiting Underlying Properties (COLING/ACL' 06) ,2006:12-19. 被引量:1
  • 10Rayson P.Right from the word goidentifying MWE for semantic tagging[EB/OL]. (2005).http : //www.comp.lanes.ac.uk/-paul/ publications/rayson BaalCorpusSig_2005.pdf. 被引量:1

共引文献11

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部