期刊文献+

基于多层过滤的统计机器翻译 被引量:3

Multi-Layer Filtering Based Statistical Machine Translation
下载PDF
导出
摘要 本文提出了一种基于多层过滤的算法。该算法主要实现从对齐的中英文句子中自动的抽取与对齐双语语块。根据不同语块具备的不同特性,采用不同的层次对其处理。该算法不同于传统的算法,它不需要对句子进行标注,句法分析,词法分析甚至不需要对汉语句子进行分词等操作。初步的实验结果表明该算法性能较好,测试的结果是:抽取语块的准确率能达到F =0 70 ,对齐语块的准确率能达到F =0 80 ;而且将此算法获得的对齐双语语块用于统计机器翻译系统,跟基于词的系统做对比,结果表明基于语块的翻译系统明显提高了翻译水平,差不多能提高10 %。 In this paper we propose a new algorithm called multi-layer filtering to extract the bilingual alignment chunks automatically from Chinese-English parallel texts. Various layers are used to extract bilingual chunks according to different features possessed by different chunks in the bilingual corpus. Our chunking and alignment algorithm does not rely on the information from tagging, parsing or syntax analyzing as most conventional algorithms do. The preliminary experimental results express that our algorithm achieves a good performance in chunking and alignment. The F-measure of chunking is 0.7 and the F-measure of alignment is 0.8. Moreover, the translations generated by this algorithm are much better than the results generated by the baseline word alignment algorithm; it almost improves of 10%.
出处 《中文信息学报》 CSCD 北大核心 2005年第3期54-60,共7页 Journal of Chinese Information Processing
基金 国家自然科学基金资助项目 (6 0 2 72 0 4 1 6 0 12 130 2 )
关键词 人工智能 机器翻译 多层过滤 双语语块识别与对齐 artificial intelligence machine translation chunking and alignment multi-layer-filtering
  • 相关文献

参考文献9

  • 1Brown P. F., et al., The Mathematics of Statistical Machine Translation: Parameter Estimation [ J], Computational Linguistics, 1993, 19(2): 263-311. 被引量:1
  • 2Vogel, S., H. Ney, and C. Tillman. 1996. HMM-Based Word Alignment in Statistical Translation [A]. In:Proceedings of the Seventeenth International Conference on Computational Linguistics: COLING-96 [ C ], 836 - 841,Copenhagen, Denmark. 被引量:1
  • 3Wang Y. Y., Grammar Inference and Statistical Machine Translation [D], PhD thesis, School of Computer Science,Carnegie Mellon University, Pittsburgh, PA, 1998. 被引量:1
  • 4Och. F. J., Tillmann C. and Ney H., Improved alignment models for statistical machine translation [ A], In:Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP) [ C ], 20 - 28, College Park, Maryland, USA, 1999. 被引量:1
  • 5Vogel S., et al.., The CMU statistical machine translation system [ A], In: Proceedings of MT Summit [ C ],110 - 117, New Orleans, Louisiana, September, 2003. 被引量:1
  • 6Yamada K. and Knignt K., A Syntax-based Statistical Translation Model [ A ], Annual Meeting of the Ass. for Computational Linguistics [C], Toulouse, France, 523-530, July 2001. 被引量:1
  • 7Andrew Roberts. Automatic Acquisition of Word Classification Using Distribution Analysis of Content Words with Respect to Function Words [ J]. Computer Science, 2000. 被引量:1
  • 8K. Papineni, et al. BLEU: a method for automatic evaluation of MT [R]. Research Report, Computer Science RC22176 (W0109022), September 17, 2001, IBM, 2001. 被引量:1
  • 9Doddington. Automatic Evaluation of Machine Translation Quality Using N-gram Co-Occurrence Statistics [ R].NIST Rearch Report, 2002. 被引量:1

同被引文献15

引证文献3

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部