期刊文献+

基于强制对齐的层次短语模型过滤和优化 被引量:1

Filtration and Optimization for Hierarchical Phrase-based Model with Forced Alignment
下载PDF
导出
摘要 该文提出一种层次短语模型过滤和优化方法.该方法在采用传统方法训练得到层次短语规则的基础上,通过强制对齐同时构建源语言和目标语言的解析树,从中过滤并抽取对齐的层次短语规则,最后利用这些规则重新估计翻译模型的翻译概率.该方法不需要引入任何语言学知识,适合大规模语料训练模型.在大规模中英翻译评测任务中,采用该方法训练的模型与传统层次短语模型相比,不仅能够过滤50%左右规则,同时获得0.8~1.2BLEU值的提高. This paper proposes an effective method for filtering and optimizing hierarchical phrase-based (HPB) model. After obtaining the original HPB rules with traditional training method, we generate the bilingual derivation trees that represent source and target sentences with forced alignment, and then extract the HPB rules from derivation trees. At last, we re-estimated the probabilities of HPB rules with the extracted rules. This method does not need any linguistic knowledge, and it is suitable for large-scale training corpus. In the large scale Chinese-English translation tasks, our proposed method filters about 50 % of the original HPB rules and improves the translation per- formance ranging from 0.8- 1.2 BLEU on the test sets, comparing to the traditional training method.
出处 《中文信息学报》 CSCD 北大核心 2013年第6期134-138,150,共6页 Journal of Chinese Information Processing
基金 国家高技术研究发展计划(863)资助项目(2011AA01A207)
关键词 统计机器翻译 层次短语 强制对齐 模型训练 statistical machine translation hierarchical phrase-based model forced alignment model training
  • 相关文献

参考文献15

  • 1David Chiang. A hierarchical phrase-based model for statistical machine translation[C]//Proceedings of the 43rd Annual Meeting of the ACL. 2005: 263-270. 被引量:1
  • 2David Chiang. Hierarchical phrase-based translation [J]. Computational Linguistics, 2007, 33(2): 201- 228. 被引量:1
  • 3Philipp Koehn, Franz Joseph Och, Daniel Mareu. Sta- tistical Phrase-Based Translation[C]//Proceedings of the 2003 Conference of the NAACL: HLT. 2003: 48- 54. 被引量:1
  • 4Zhongjun He, Yao Meng, Yajuan L, et al. Reducing smt rule table with monolingual key phrase[C]//Pro- ceedings of the ACL-IJCNLP 2009 Con[erence Short Papers. 2009: 121-124. 被引量:1
  • 5Gonzalo Iglesias, Adri de Gispert, Eduardo R Banga, et al. Rule filtering by pattern for efficient hierarchical translation[C]//Proceedings of the 12th Conference of the EACL. 2009: 380-388. 被引量:1
  • 6Libin Shen, Jinxi Xu, Ralph Weischedel. A new string-to-dependency machine translation algorithm with a target dependency language model[C]//Pro- ceedings of ACL-08: HLT, 2008: 577-585. 被引量:1
  • 7Zhiyang Wang, Yajuan L, Qun Liu, et al. Better fil- tration and augmentation for hierarchical phrase-based translation rules[C]//Proceedings of the ACL 2010 Conference Short Papers. 2010: 142-146. 被引量:1
  • 8Joern Wuebker, Arne Mauser, Hermann Ney. Train- ing phrase translation models with leaving-one-out [C]// Proceedings of the 48th Annual Meeting of the ACL. 2010: 475-484. 被引量:1
  • 9Carmen Heger, Joern Wuebker, David Vilar, et al. A combination of hierarchical systems with forced align- ments from phrase-based systems[C]//Proceeding of the IWSLT. 2010: 291-297. 被引量:1
  • 10Phil Blunsom, Trevor Cohn, Miles Osborne. A dis eriminative latent variable model for statistical ma chine translation[C]//Proeeedings of ACL-08: HLT 2008 : 200-208. 被引量:1

同被引文献4

引证文献1

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部