期刊文献+

An Introduction to the Chinese Speech Recognition Front-End of the NICT/ATR Multi-Lingual Speech Translation System 被引量:3

An Introduction to the Chinese Speech Recognition Front-End of the NICT/ATR Multi-Lingual Speech Translation System
原文传递
导出
摘要 This paper introduces several important features of the Chinese large vocabulary continuous speech recognition system in the NICT/ATR multi-lingual speech-to-speech translation system. The features include: (1) a flexible way to derive an information rich phoneme set based on mutual information between a text corpus and its phoneme set; (2) a hidden Markov network acoustic model and a successive state splitting algorithm to generate its model topology based on a minimum description length criterion; and (3) advanced language modeling using multi-class composite N-grams. These features allow a recognition performance of 90% character accuracy in tourism related dialogue with a real time response speed. This paper introduces several important features of the Chinese large vocabulary continuous speech recognition system in the NICT/ATR multi-lingual speech-to-speech translation system. The features include: (1) a flexible way to derive an information rich phoneme set based on mutual information between a text corpus and its phoneme set; (2) a hidden Markov network acoustic model and a successive state splitting algorithm to generate its model topology based on a minimum description length criterion; and (3) advanced language modeling using multi-class composite N-grams. These features allow a recognition performance of 90% character accuracy in tourism related dialogue with a real time response speed.
出处 《Tsinghua Science and Technology》 SCIE EI CAS 2008年第4期545-552,共8页 清华大学学报(自然科学版(英文版)
关键词 Chinese speech recognition mutual information phoneme set design hidden Markov network minimum description length successive state splitting multi-class composite N-grams Chinese speech recognition mutual information phoneme set design hidden Markov network minimum description length successive state splitting multi-class composite N-grams
  • 相关文献

参考文献10

  • 1Zhang Jinsong,Hu Xinhui,Nakamura Satoshi.Automatic derivation of a phoneme set with tone information for Chi- nese speech recognition based on mutual information crite- rion[].Proceedings of International Conference on Acoustics Speech and Signal Processing.2006 被引量:1
  • 2Jitsuhiro T,Matsui T,Nakamura S.Automatic generationof non-uniform context-dependent HMM topologies based on the MDL criterion[].Proceedings of Eighth European Conference on Speech Communication and Technology.2003 被引量:1
  • 3Chen C,Gopinath R,Monkowski M, et al.New methods in continuous Mandarin recognition[].Proceedings of Fifth European Conference on Speech Communication and Technology.1997 被引量:1
  • 4Huang C,Shi Y,Zhou J, et al.Segmental tonal modeling for phone set design in Mandarin LVCSR[].Proceedings of International Conference on Acoustics Speech and Sig- nal Processing.2004 被引量:1
  • 5Seide F,Wang N.Phonetic modeling in the Philips Chinese continuous-speech recognition system[].Proceedings of International Symposium on Chinese Spoken Language Processing.1998 被引量:1
  • 6Chen C,Li H,Shen L, et al.Recognize tone languages us- ing pitch information on the main vowel of each syllable[].Proceedings of International Conference on Acoustics Speech and Signal Processing.2001 被引量:1
  • 7Zhang J,Zheng F,Li J, et al.Improved context-dependent acoustic modeling for continuous Chinese speech recogni- tion[].Proceedings of Seventh European Conference on Speech Communication and Technology.2001 被引量:1
  • 8Zhang Jinsong,Hirose Keikichi,Nakamura Satoshi.Is tone recognition necessary for Chinese speech recognition?[].Proceedings of Fall Meeting of the Acoustical Society of Japan.2002 被引量:1
  • 9Hu Xinhui,Yamamoto Hirofumi,Kikui Gen.Evaluation of Chinese morphological corpus and language models[].IPSJ (Information Processing Society of Japan) Technical report SLP.2005 被引量:1
  • 10Young S,Evermann G,Gales M, et al.HTK Speech Rec- ognition Toolkit ver. 3.4. Cambridge University. http://htk. eng.cam.ac.uk/ . 被引量:1

同被引文献13

引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部