期刊文献+

基于子字单元的维吾尔语语音识别研究 被引量:5

Research on Uyghur Speech Recognition Based on Subword Unit
下载PDF
导出
摘要 为提高维吾尔语语音识别的识别率,在分析维吾尔语特点的基础上,设计一种基于子字单元的维吾尔语语音识别总体结构,指出维吾尔语单词的发音模型,给出构建子字发音字典的方法,及其以子字单元为基础构建语言模型与声学模型的方法。在一个语音库上进行实验,采用一种非监督的词切分方法对维吾尔语单词进行词切分,生成子字。实验结果表明,基于子字单元的维吾尔语语音识别可以获得更好的识别结果。 To improve on accuracy of Uyghur speech recognition,based on analysis of Uyghur characteristics,the framework of Uyghur speech recognition based on subword is developed for the first time.Pronunciation model of Uyghur word is given.How to build subword pronouncing dictionary,subword language model and acoustic model is described.Experiments are completed on a speech corpus and an unsupervised Uyghur word segmentation method is utilized to produce subwords.Experimental results show that Uyghur speech recognition based on subword can gain better recognition results.
出处 《计算机工程》 CAS CSCD 北大核心 2011年第20期208-210,共3页 Computer Engineering
基金 中国科学院西部行动计划高新技术基金资助项目(KGCX2-YW-507)
关键词 维吾尔语 词切分 子字单元 隐马尔科夫模型 连续语音识别 Uyghur word segmentation subword unit Hidden Markov Model(HMM) continuous speech recognition
  • 相关文献

参考文献6

二级参考文献13

  • 1徐波,史晓东,刘群,宗成庆,庞薇,陈振标,杨振东,魏玮,杜金华,陈毅东,刘洋,熊德意,侯宏旭,何中军.2005统计机器翻译研讨班研究报告[J].中文信息学报,2006,20(5):1-9. 被引量:10
  • 2石现峰,张学智,张峰.基于HTK的语音识别系统设计[J].计算机技术与发展,2006,16(10):37-38. 被引量:23
  • 3BROWN P, COCKE J, PIETRA S, et al. A statistical approach to machine translation[J]. Computational Linguistics, 1990, 16(2):79 -85. 被引量:1
  • 4KOEHN P, OCH F J, MARCU D. Statistical phrase-based translation[ C] // Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language. Morristown, N J: Association for Computational Linguistics, 2003:48 -54. 被引量:1
  • 5OCH F J, NEY H. Discriminative training and maximum entropy models for statistical machine translation[ C]// Proceedings of the 40th Annual Meeting on Association for Computational Linguistics. Morristown, NJ: Association for Computational Linguistics, 2001: 295 - 302. 被引量:1
  • 6STOLKE A. Srilm - An extensible language modeling toolkit [ EB / OL]. [ 2008 - 09 - 20]. http://web, iti. upv. es/-evidal/ students/doct/sht/transp/srlim2p, pdf. 被引量:1
  • 7OCH F J, NEY H, A systematic comparison of various statistical alignment models[ J]. Computational Linguistics, 2003, 29(!) : 19 - 51. 被引量:1
  • 8KOEHN P. Pharaoh: a beam search decoder for phrase-based statistical machine translation models[ EB/OL]. [ 2008 - 08 - 20]. http://www, iccs. inf. ed. ac. uk/- pkoehn/publications/pharaoh - amta2004, ps. 被引量:1
  • 9Gulila·Adongbieke. The Research of Proofreading for the Uighur Character [A],The 2001 IEEE International Conference on System, Man and Cybernetics (SMC2001)[C], 2001.10.7 - . 10.10, Tucson, Arizona ,U.S.A,P874- 876. 被引量:1
  • 10Steve Young, Julian Odell, et all. The HTK Book (for HTK Version g. 2)[R]. Cambridge University Engineering Department. 被引量:1

共引文献62

同被引文献47

  • 1古丽拉.阿东别克,米吉提.阿布力米提.维吾尔语词切分方法初探[J].中文信息学报,2004,18(6):61-65. 被引量:39
  • 2贺苏宁,虞厥邦.几种小训练样本集的数字语音识别模型的比较性研究[J].计算机科学,2005,32(9):170-175. 被引量:1
  • 3李锦,何培宇.一种改进的基于小波去噪HMM非特定人语音识别算法[J].四川大学学报(自然科学版),2007,44(1):69-72. 被引量:12
  • 4蔡琴,吾守尔.斯拉木.基于HTK的维吾尔语连续数字语音识别[J].现代计算机,2007,13(4):14-16. 被引量:7
  • 5Arisoy E, Dutagaci H, Arslan L M. A unified language model for large vocabulary continuous speech recognition of Turkish[J]. Signal Processing, 2006, 86( 10): 2844-2862. 被引量:1
  • 6Tanel A. Phonological and morphological modeling in large vocabulary continuous Estonian speech recognition system [C]//Proceedings of Second Baltic Conference on Human Language Technologies. Tallinn, Estonia, 2005: 89- 94. 被引量:1
  • 7Creutz M, Lagus K. Unsupervised models for morpheme segmentation and morphology learning [J]. ACM Transactions on Speech and Language Processing, 2007, 4(1) : 3 - 36. 被引量:1
  • 8Creutz M, Hirsimfiki T, Kurimo M, et al. Analysis of morph-based speech recognition and the modeling of out of vocabulary words across languages [C]// Proceedings of NAACL HLT. Rochester, NY, USA, 2007: 380-387. 被引量:1
  • 9Hirsimaki T, Pylkkanen J, Kurimo M, et al. Importance of high order N-gram models in morph-based speech recognition [J]. IEEE Tra72sactions on Audio, Speech and Language Processing, 2009, 17(4):724-732. 被引量:1
  • 10Ablimit M, Neubig G, Mimura M, et alo Uyghur morpheme-based language models and ASR [C]// Proc 10th IEEE Conf ICSP. Beiiing, China: IEEE Press, 2010:581 - 584. 被引量:1

引证文献5

二级引证文献24

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部