三元组可比语料库自动剖析在情报智能处理中的研究与应用

Research on the Application of Automatic Language Profiling in Intelligent Information Processing Based on 3-Tuple Comparable Corpora

原文传递

导出

摘要文章提出的基于三元组可比语料库的自动语言剖析技术扩大了该研究领域的内涵,使其包括面向自然语言处理的应用研究。从工程可实现性考虑,创新性地提出建造三元组可比语料库,利用n-元词串、关键词簇和语义多词表达等自动抽取技术,通过对比中式英语表达,发掘英语本族语言模型,实现改进和发展机器翻译、跨语言信息检索等自然语言处理应用的目标。 The proposed automatic language profiling technologies based on the 3-tuple comparable corpora expand the connotation of this research field to include the natural language processing-oriented application and study.Considering the feasibility of the project,this paper innovatively puts forward the building of the 3-tuple comparable corpora and uses the automatic extraction technologies such as n-grams,keyword clusters and semantic multi-word expression to develop the English native language model by comparing with the Chinese type English expression so as to improve and develop the application of natural language processing such as machine translation and cross-language information retrieval.

作者王毅肖健袁琦宋金平李强

机构地区总后勤部后勤科学研究所中国电子信息产业发展研究院中文信息处理实验室

出处《情报理论与实践》 CSSCI 北大核心 2012年第4期94-98,共5页 Information Studies:Theory & Application

基金解放军总后勤部司令部2011年度后勤科研条件建设项目"军事后勤专业术语库及双语资源库信息处理平台"的阶段性研究成果项目编号:2011-ZHTJ-5031

关键词机器翻译三元组可比语料库自动语言剖析情报智能处理 machine translation 3-tuple comparable corpora automatic language profiling intelligent information processing

分类号 TP391.1 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献11

1HAZEM A.Bilingual lexicon extraction from comparable corpo-ra as metasearch[C]∥Proceedings of the 4th Workshop onBuilding and Using Comparable Corpora,Oregon,USA,2011:112-130. 被引量：1
2EISELE A,XU Jia.Improving machine translation performanceusing comparable corpora[C]∥Proceedings of the 3rd Work-shop on Building and Using Comparable Corpora,Valletta,Malta,2010:57. 被引量：1
3ZWEIGENBAUM P.Introduction[C]∥Proceedings of the4th Workshop on Building and Using Comparable Corpora,Ore-gon,USA,2011:14-17. 被引量：1
4RAYSON P.New trends in corpus linguistics for translationstudies[M]∥Proceedings of Workshop on Corpus Linguistics&Machine Translation Applications,2008. 被引量：1
5胡开宝著..语料库翻译学概论[M].上海:上海交通大学出版社,2011:242.
6夏云,李德凤.可比语料量化比较分析与应用文体翻译[C]∥2008年上海第18届世界翻译大会论文集,上海,2008:173-176. 被引量：1
7CHEN W.A corpus-based approach to modelling explicitation inEnglish-Chinese translation[C]∥Proceedings of XVIII FITWorld Conference,Shanghai,China,2008. 被引量：1
8MCENERY A M,XIAO R Z.Parallel and comparable corpo-ra:what are they up to[M]∥Incorporating Corpora:Trans-lation and the Linguist.Translating Europe.Clevedon,UK:Multilingual Matters,2007. 被引量：1
9RAYSON P,et al.Quantitative analysis of translation revision:contrastive corpus research on native English and Chinese trans-lationese[C]∥Proceedings of XVIII FIT World Conference,Shanghai,China,2008. 被引量：1
10肖健,徐建,徐晓兰,袁琦.英中可比语料库中多词表达自动提取与对齐[J].计算机工程与应用,2010,46(31):130-134. 被引量：12

二级参考文献26

1de Medeiros Caseli H,Villavicencio A,Machado A, et al.Statistically-driven alignment-based multiword expression identification for technical domains[C]//Proeeedings of the Workshop on Multiword Expressions: Identification, Interpretation, Disambiguation, Applications, 2009:1-8. 被引量：1
2Ren Zhixiang,Lu Yajuan, Cao Jie, et al.Improving statistical machine translation using domain bilingual multiword expressions[C]// Proceedings of the Workshop on Multiword Expressions: Identification, Interpretation, Disambiguation, Applications, 2009: 47-54. 被引量：1
3Rayson P, Xiao Jian, Wong A, et al.Quantitative analysis of translation revision: contrastive corpus on native english and chinese translationese[C]//XVIII FIT World Congress, 2008, Shanghai, China, 2008. 被引量：1
4Ramisch C, Schreincr P,Idiart M,et aLAn evaluation of methods for the extraction of multiword expressions[C]//Proccedings of the LREC Workshop Towards a Shared Task for Multiword Expressions, 2008: 50-53. 被引量：1
5Van de Cruys T,Moir'on B V.Semantics-based multiword expression extraction[C]//Proeeedings of the Workshop on A Broader Perspective on Multiword Expressions,2007:25-32. 被引量：1
6Rayson P.Falling foul of multiword expressions[C]//Proceedings of Lancaster University and CCID Joint Workshop on Chinese Multi-Word Expression(MWE) and Machine Translation, 2006: 8-40. 被引量：1
7Piao S S L.MWE and translation[C]//Proceedings of Lancaster University and CCID Joint Workshop on Chinese Multi-Word Expression(MWE) and Machine Translation,2006:53-54. 被引量：1
8Piao S S L, Sun Guangfan, Rayson P, et al.Automatic extraction of Chinese multiword expressions with a statistical tool[C]// Proceedings of the Workshop on Multi-word Expressions in a Multilingual Context,2006:17-24. 被引量：1
9Katz G,Giesbrecht E.Automatic identification of non-compositional multi-word expressions using Latent Semantic Analysis[C]// Proceedings of the Workshop on Multiword Expressions: Identifying and Exploiting Underlying Properties (COLING/ACL' 06) ,2006:12-19. 被引量：1
10Rayson P.Right from the word goidentifying MWE for semantic tagging[EB/OL]. (2005).http : //www.comp.lanes.ac.uk/-paul/ publications/rayson BaalCorpusSig_2005.pdf. 被引量：1

共引文献11

1袁琦,肖健,宋金平,朱姝,万缨,许亮.三元组可比语料库自动剖析技术研究与应用[J].计算机工程与应用,2012,48(16):129-132.
2麦热哈巴.艾力,阿孜古丽.夏力甫,吐尔根.依布拉音.维吾尔语多词表达抽取方法研究[J].计算机工程与应用,2014,50(8):26-30. 被引量：3
3原伟,易绵竹.基于维基百科的俄汉可比语料库构建及可比度计算[J].山东大学学报（理学版）,2017,52(9):1-6. 被引量：3
4原伟.俄汉新闻可比语料库的构建、评估及应用展望[J].解放军外国语学院学报,2017,40(6):113-120. 被引量：8
5唐亮,席耀一,彭波,刘香伟,易绵竹.基于词向量的越汉跨语言事件检索研究[J].中文信息学报,2018,32(3):64-70. 被引量：3
6张嘉伟,刘越莲.基于可比语料库的“悲伤”情绪隐/转喻对比研究——以歌德和李白诗歌为例[J].外语教学,2018,39(4):46-51. 被引量：10
7安亚巍,操晓春,罗顺.面向语料的领域主题词表构建算法[J].计算机科学,2018,45(B06):396-397. 被引量：4
8龚双双,陈钰枫,徐金安,张玉洁.基于网络文本的汉语多词表达抽取方法[J].山东大学学报（理学版）,2018,53(9):40-48. 被引量：5
9丘心颖,陈汉武,陈源,谭立聪,张皓,肖莉娴.融合Self-Attention机制和n-gram卷积核的印尼语复合名词自动识别方法研究[J].湖南工业大学学报,2020,34(3):1-9. 被引量：2
10臧国全,王家振,毕崇武,耿瑞利.政府数据中敏感数据识别与隐私计量研究[J].图书情报工作,2022,66(15):66-75. 被引量：7

1袁琦,肖健,宋金平,朱姝,万缨,许亮.三元组可比语料库自动剖析技术研究与应用[J].计算机工程与应用,2012,48(16):129-132.
2胡小鹏,袁琦,耿鑫辉,朱姝.构建和剖析中英三元组可比语料库[J].计算机工程与应用,2014,50(13):153-157. 被引量：5
3崔建粤.英语与电脑[J].中国电化教育,1998(9):22-24.
4肖健,徐建,徐晓兰,袁琦.英中可比语料库中多词表达自动提取与对齐[J].计算机工程与应用,2010,46(31):130-134. 被引量：12
5徐振芳.美国人生活中常用的精彩句子[J].中学英语之友（新教材高二版）,2009(1):13-13.
6毛荣贵.美文佳译：汉译英：由简入繁的过程（1）[J].英语沙龙（原版阅读）,2012(3):46-47.
7WANG Fang-fang.The Study on the Metaphorical Expressions About the Word Face[J].US-China Foreign Language,2013,11(1):33-39.
8彭媛媛,许建潮.基于xml的Deep Web信息自动抽取技术的研究[J].科技信息,2009(33):85-85.
9周亚.2001—2008年国内元数据自动抽取研究综述[J].科技情报开发与经济,2009,19(23):140-142. 被引量：3
10陈辉.“取”之有道[J].英语自学,2009(2):38-41.

情报理论与实践

2012年第4期

浏览历史

内容加载中请稍等...

三元组可比语料库自动剖析在情报智能处理中的研究与应用

参考文献11

二级参考文献26

共引文献11

相关作者

相关机构

相关主题

浏览历史