期刊文献+

基于汉语语音音位的HMM建模方法 被引量:1

HMM modeling based on mandarin phonemes in embedded systems
原文传递
导出
摘要 为了减少声学模型复杂度、降低对嵌入式系统的硬件资源需求,提出了为汉语全音节的声母、韵首、韵腹、韵尾4部分音位分别建立隐含Markov模型的新方法。基于汉语语音学的音位知识,并结合4部分音位方案比较实验,最终确定声母、韵首、韵腹、韵尾4部分音位模型总数分别为76、12、76、14,对应的4部分的模型状态数分别为4、1、4、2。同采用声母、韵母2部分建立的半音节隐含M arkov模型相比,新系统中模型数、状态数减少了30.2%、36.5%,同时关键词识别率提高1.32%。 A method of acoustic model design was developed for Hidden Markov Models to reduce the complexity of the acoustic models and lower the hardware requirements in embedded systems. The method separately models each initial, glide, nucleus, and coda phoneme. The model numbers of these four parts were 76, 12, 76, and 14, and thestate numbers of each model of these four parts were 4, 1, 4, and 2 in the final system based on the knowledge of mandarin phonemes and the results of scheme comparison tests. The total number of models was reduced by 30.2% with the number of states was reduced by 36.5%. The keyword detection accuracy was improved by 1.32% compared with the method of modeling each initial and final semi-syllable.
作者 何珏 刘加
出处 《清华大学学报(自然科学版)》 EI CAS CSCD 北大核心 2007年第4期518-521,共4页 Journal of Tsinghua University(Science and Technology)
基金 国家自然科学基金资助项目(60272016)
关键词 声学模型 隐含Markov模型 语音识别 acoustic model hidden Markov model speech recognition
  • 相关文献

参考文献8

  • 1刘叔新主编..现代汉语理论教程[M].北京:高等教育出版社,2002:501.
  • 2Rose R,Paul D.A hidden Markov model based keyword recognition system[C]∥ ICASSP.Albuquerque,1990:129-132. 被引量:1
  • 3Lee C-H,Rabiner L,Pieraccini R,et al.Acoustic modeling for large vocabulary speech recognition[J].Computer Speech and Language,1990,4(2):127-165. 被引量:1
  • 4李净,郑方,张继勇,吴文虎.汉语连续语音识别中上下文相关的声韵母建模[J].清华大学学报(自然科学版),2004,44(1):61-64. 被引量:18
  • 5邵敬敏..现代汉语通论[M]..上海:上海教育出版社,,1991,4..55.. 被引量:1
  • 6Yong S,Kershaw D,Odell J,et al.The HTK Book[EB/OL].2002.http://htk.eng.cam.ac.uk. 被引量:1
  • 7中国国家对外汉语教学领导小组办公室-汉语语音与语音教学韵母的分类[EB/OL].2005.厦门大学海外教育学院.http://oec.xmu.edu.cn/yuyin/03/03-01.htm 被引量:1
  • 8LIU Chimin,CHIU Chinchih,CHANG Hungyuan.Design of vocabulary-independent mandarin keyword spotters[J].IEEE Transactions on Speech and Audio Processing,2000,8(4):483-487. 被引量:1

二级参考文献8

  • 1Lee C-H, Rabiner L, Pieraccini R, et al. Acoustic modeling for large vocabulary speech recognition [J]. Computer Speech and Language, 1990, 4(2): 127-165. 被引量:1
  • 2Young S J, Woodland P C. Tree-based state tying for high accuracy acoustic modeling [A]. Proc ARPA Human Language Tech Workshop [C]. Plainsboro, NJ: Morgan Kaufmann Publisher, 1994, 307-312. 被引量:1
  • 3Reichl W, Chou W. Decision trees state tying based on segmental clustering for acoustic modeling [A]. Proc Int Conf Acoustics, Speech, Signal Processing'98 [C]. Seattle, Washington: IEEE Press, 1998. 801-804. 被引量:1
  • 4Reichl W, Chou W. Robust decision tree state tying for continuous speech recognition [J]. IEEE Trans Speech and Audio Proc, 2000, 8(5): 555-566. 被引量:1
  • 5曹剑芬.现代语音基础知识 [M].北京: 人民教育出版社,1990.. 被引量:1
  • 6ZHENG Fang, SONG Zhanjiang, XU Mingxing. EASYTALK: A large-vocabulary speaker-independent Chinese dictation machine [A]. EuroSpeech '99 [C]. Budapest, Hungary: ISCA, 1999, 819-822. 被引量:1
  • 7Yong S, Kershaw D, Odell J, et al. The HTK Book [EB/OL]. http://htk.eng.cam.ac.uk, 2002. 被引量:1
  • 8郑方,牟晓隆,徐明星,武健,宋战江.汉语语音听写机技术的研究与实现[J].软件学报,1999,10(4):436-444. 被引量:6

共引文献17

同被引文献9

引证文献1

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部