English Speech Recognition System on Chip

English Speech Recognition System on Chip

导出

摘要 An English speech recognition system was implemented on a chip, called speech system-on-chip （SoC）. The SoC included an application specific integrated circuit with a vector accelerator to improve performance. The sub-word model based on a continuous density hidden Markov model recognition algorithm ran on a very cheap speech chip. The algorithm was a two-stage fixed-width beam-search baseline system with a variable beam-width pruning strategy and a frame-synchronous word-level pruning strategy to significantly reduce the recognition time. Tests show that this method reduces the recognition time nearly 6 fold and the memory size nearly 2 fold compared to the original system, with less than 1% accuracy degradation for a 600 word recognition task and recognition accuracy rate of about 98%. An English speech recognition system was implemented on a chip, called speech system-on-chip （SoC）. The SoC included an application specific integrated circuit with a vector accelerator to improve performance. The sub-word model based on a continuous density hidden Markov model recognition algorithm ran on a very cheap speech chip. The algorithm was a two-stage fixed-width beam-search baseline system with a variable beam-width pruning strategy and a frame-synchronous word-level pruning strategy to significantly reduce the recognition time. Tests show that this method reduces the recognition time nearly 6 fold and the memory size nearly 2 fold compared to the original system, with less than 1% accuracy degradation for a 600 word recognition task and recognition accuracy rate of about 98%.

作者刘鸿钱彦旻刘加

机构地区 Tsinghua National Laboratory for Information Science and Technology

出处《Tsinghua Science and Technology》 SCIE EI CAS 2011年第1期95-99,共5页 清华大学学报（自然科学版（英文版）

基金 Supported by the National Natural Science Foundation of China and Microsoft Research Asia(No. 60776800) the National Natural Science Foundation of China and Research Grants Council (No.60931160443) the National High-Tech Research and Development (863) Program of China(Nos. 2006AA010101,2007AA04Z223,2008AA02Z414,and 2008AA040201)

关键词 non-specific human voice-consciousness SYSTEM-ON-CHIP mel-frequency cepstral coefficients （MFCC） non-specific human voice-consciousness system-on-chip mel-frequency cepstral coefficients （MFCC）

分类号 TN912.34 [电子电信—通信与信息系统] TN402 [电子电信—信息与通信工程]

引文网络
相关文献

参考文献14

1朱璇,陈一宁,刘加,刘润生.语音识别片上系统中的多级搜索算法[J].电子学报,2004,32(1):150-153. 被引量：11
2Yang Haijie,Yao Jing,Liu Jia.A novel speech recognition system-on-chip. International Conference on Audio, Language and Image Processing 2008 (ICALIP 2008) . 2008 被引量：1
3Novuk M,Humpl R,Krbec P,et al.Two-pass search strategy for large list recognition on embedded speech recognition platforms. ICCASP’04 . 2004 被引量：1
4Levy C,Linares G,Nocera P,et al.Reducing computation and memory cost for cellular phone embedded speech rec-ognition system. Proceedings of the ICASSP . 2004 被引量：1
5Xu Haiyang,Fu Yan.Embedded Technology and Applica-tions. . 2002 被引量：1
6Hoon C,Jeon P,Yun L,et al.Fast speech recognition to access a very large list of items on embedded devices. IEEE Transactions on Consumer Electronics . 2008 被引量：1
7Westall F.Review of speech technologies for telecommu-nications. Electronics and Communication Engineering Journal . 1997 被引量：1
8Guo Bing,Shen Yan.SoC Technology and Its Application. . 2006 被引量：1
9Novak M,Hampl R,Krbec P,et al.Two-pass search strategy for large list recognition on embedded speech recognition platforms. Proceedings of ICASSP . 2003 被引量：1
10Q. Huo,C. Chan,C. H. Lee.Bayesian adaptive learning of the parameters of hidden Markov model for speech recognition. IEEE Transactions on Speech and Audio Processing . 1995 被引量：1

共引文献10

1宋辉,姚竞,路向峰,刘加.基于TMS320VC5507的语音识别系统实现[J].微计算机信息,2008,24(2):168-170. 被引量：2
2侯昭武,侯波.低信噪比下的语音识别技术[J].广西物理,2009,30(1):24-26.
3丁玉国,刘加,刘润生.嵌入式系统上的实时语音识别算法[J].数据采集与处理,2005,20(3):302-305. 被引量：6
4马晓梅,李雪耀,张汝波,徐东.关键词检测系统中废料模型技术的研究[J].应用科技,2006,33(4):54-56.
5杨之佐,董明,刘加,刘润生,孙旭东.语音识别SoC UniLite的系统设计[J].计算机工程,2006,32(21):197-199. 被引量：2
6曹玉东.语音识别中的搜索策略研究[J].攀枝花学院学报,2007,24(3):46-49.
7马晓梅,沈洁.垃圾模型技术在关键词检测系统中的应用[J].信息技术,2009,33(6):142-144.
8于晓明,柏松.基于前向-后向HMM的连续语音识别系统的研究[J].计算机工程与设计,2009,30(18):4339-4341. 被引量：5
9侯昭武.隐马尔可夫模型的拓朴应用[J].河南师范大学学报（自然科学版）,2009,37(6):71-74. 被引量：1
10高建.基于HMM的连续小词量语音识别系统的研究[J].现代电子技术,2011,34(11):205-207. 被引量：1

1Tohru Shimizu,Yutaka Ashikari,Eiichiro Sumita,张劲松,Satoshi Nakamura.NICT/ATR Chinese-Japanese-English Speech-to-Speech Translation System[J].Tsinghua Science and Technology,2008,13(4):540-544. 被引量：3
2JBL On Time环绕式iPod基座音箱[J].高保真音响,2006(8):4-4.
3JBL ON TIME 200iD iPod底座[J].音响世界,2008(7):14-14.
4明亚运,祝世雄,曹云飞.Feistel结构差分活动S盒的搜索算法[J].通信技术,2014,47(10):1207-1210.
5计算机科学数学基础[J].电子科技文摘,2006(3):109-112.
6高亮,谢健,曹天泽.基于Kd树改进的高效K-means聚类算法[J].计算技术与自动化,2015,34(4):69-74. 被引量：7
7Xiao-Fei Zhang Liang Chang Pei-Ming Ren Rong Liu.Automatic Recognition Algorithm of AM Signals Based on Spectrum and Modulation Characters[J].Journal of Electronic Science and Technology,2012,10(2):163-166.
8孙海平,张溯,高明伦.UIO序列的启发式算法[J].合肥工业大学学报（自然科学版）,2001,24(4):486-492. 被引量：1
9张黄鹏.ASIC版图设计技巧[J].硅谷,2011,4(16):99-100.
10李子,蔡跃明.基于±1二次规划的低复杂度球形解码算法[J].通信学报,2007,28(11):15-20.

Tsinghua Science and Technology

2011年第1期

浏览历史

内容加载中请稍等...

English Speech Recognition System on Chip

参考文献14

共引文献10

相关作者

相关机构

相关主题

浏览历史