期刊文献+

English Speech Recognition System on Chip

English Speech Recognition System on Chip
原文传递
导出
摘要 An English speech recognition system was implemented on a chip, called speech system-on-chip (SoC). The SoC included an application specific integrated circuit with a vector accelerator to improve performance. The sub-word model based on a continuous density hidden Markov model recognition algorithm ran on a very cheap speech chip. The algorithm was a two-stage fixed-width beam-search baseline system with a variable beam-width pruning strategy and a frame-synchronous word-level pruning strategy to significantly reduce the recognition time. Tests show that this method reduces the recognition time nearly 6 fold and the memory size nearly 2 fold compared to the original system, with less than 1% accuracy degradation for a 600 word recognition task and recognition accuracy rate of about 98%. An English speech recognition system was implemented on a chip, called speech system-on-chip (SoC). The SoC included an application specific integrated circuit with a vector accelerator to improve performance. The sub-word model based on a continuous density hidden Markov model recognition algorithm ran on a very cheap speech chip. The algorithm was a two-stage fixed-width beam-search baseline system with a variable beam-width pruning strategy and a frame-synchronous word-level pruning strategy to significantly reduce the recognition time. Tests show that this method reduces the recognition time nearly 6 fold and the memory size nearly 2 fold compared to the original system, with less than 1% accuracy degradation for a 600 word recognition task and recognition accuracy rate of about 98%.
出处 《Tsinghua Science and Technology》 SCIE EI CAS 2011年第1期95-99,共5页 清华大学学报(自然科学版(英文版)
基金 Supported by the National Natural Science Foundation of China and Microsoft Research Asia(No. 60776800) the National Natural Science Foundation of China and Research Grants Council (No.60931160443) the National High-Tech Research and Development (863) Program of China(Nos. 2006AA010101,2007AA04Z223,2008AA02Z414,and 2008AA040201)
关键词 non-specific human voice-consciousness SYSTEM-ON-CHIP mel-frequency cepstral coefficients (MFCC) non-specific human voice-consciousness system-on-chip mel-frequency cepstral coefficients (MFCC)
  • 相关文献

参考文献14

  • 1朱璇,陈一宁,刘加,刘润生.语音识别片上系统中的多级搜索算法[J].电子学报,2004,32(1):150-153. 被引量:11
  • 2Yang Haijie,Yao Jing,Liu Jia.A novel speech recognition system-on-chip. International Conference on Audio, Language and Image Processing 2008 (ICALIP 2008) . 2008 被引量:1
  • 3Novuk M,Humpl R,Krbec P,et al.Two-pass search strategy for large list recognition on embedded speech recognition platforms. ICCASP’04 . 2004 被引量:1
  • 4Levy C,Linares G,Nocera P,et al.Reducing computation and memory cost for cellular phone embedded speech rec-ognition system. Proceedings of the ICASSP . 2004 被引量:1
  • 5Xu Haiyang,Fu Yan.Embedded Technology and Applica-tions. . 2002 被引量:1
  • 6Hoon C,Jeon P,Yun L,et al.Fast speech recognition to access a very large list of items on embedded devices. IEEE Transactions on Consumer Electronics . 2008 被引量:1
  • 7Westall F.Review of speech technologies for telecommu-nications. Electronics and Communication Engineering Journal . 1997 被引量:1
  • 8Guo Bing,Shen Yan.SoC Technology and Its Application. . 2006 被引量:1
  • 9Novak M,Hampl R,Krbec P,et al.Two-pass search strategy for large list recognition on embedded speech recognition platforms. Proceedings of ICASSP . 2003 被引量:1
  • 10Q. Huo,C. Chan,C. H. Lee.Bayesian adaptive learning of the parameters of hidden Markov model for speech recognition. IEEE Transactions on Speech and Audio Processing . 1995 被引量:1

共引文献10

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部