基于非均匀MCE准则的DNN关键词检测系统中声学模型的训练被引量：1

Non-uniform MCE Based Acoustic Model for Keyword Spotting based on Deep Neural Network

下载PDF

导出

摘要关键词检测是从连续语音流中检测预先定义的给定词的技术,是语音识别领域的一个重要应用。目前的关键词检测研究中,主流的方法是基于连续语音识别器的先识别后检测的两阶段方法,语音识别器的准确率对关键词检测有很大影响。本文首先在识别阶段引入深度学习技术来改善关键词检测算法的性能。进而针对识别阶段和检测阶段缺乏紧密联系,耦合度不够的问题,研究了侧重关键词的深度神经网络声学建模技术,利用非均匀的最小分类错误准则来调整深度神经网络声学建模中的参数,并利用Ada Boost算法来动态调整声学建模中的关键词权重。结果表明,利用非均匀最小分类错误准则来调整深度神经网络参数进行优化的声学模型,可以提高关键词检测的性能。 Spoken term detection（ STD） is a task to automatically detect a set of keywords in continuous speech,which is an important field of speech recognition. Current study is based on two- stage approach i. e. recognition and detection. The accuracy of speech recognition has a significant impact on keyword detection. Firstly,this paper uses deep leaning techniques to improve performance during the first stage. As the two stages lack of close contact,the paper studies using non-uniform misclassification error（ MCE） criteria to adjust the parameters in deep neural network based acoustic modeling.Further the paper uses the adaptive boosting（ Ada Boost） strategy to adjust keywords＇ weight dynamically. It shows that non- uniform MCE can improve the performance of STD.

作者王朝松韩纪庆郑铁然

机构地区哈尔滨工业大学计算机科学与技术学院

出处《智能计算机与应用》 2015年第5期15-17,21,共4页 Intelligent Computer and Applications

基金国家自然科学基金(91120303)

关键词深度学习关键词检测 ADA BOOST 最小分类错误 Deep Learning Spoken Term Detection Ada Boost Minimum Classification Error

分类号 TP391.4 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献16

1HINTON G, DENG L, YU D, et al. Deep Neural Networks for acoustic modeling in Speech Recognition: The shared views of four research groups[J]. Signal Processing Magazine IEEE, 2012, 29(6): 82 - 97. 被引量：1
2MILLER D, KLEBER M, KAO C, et al. Rapid and accurate spoken tenn detection[J]. Proc. Interspeech, 2007 , 3: 1965 - 1968. 被引量：1
3National Institute of Standards and Technology (NIST). The spoken term detection (STD) 2006 evaluation plan[J]. http://www. nist. gov / speechl testsl std ,2006. 被引量：1
4JUANG B, HOU W, LEE C. Minimum classification error rate methods for speech recognition[J]. IEEE Trans on Speech & Audio Proc, 1997,5(3) :257 - 265. 被引量：1
5BAHL L, BROWN P F, De SOUZA P V, et al. Maximum mutual information estimation of hidden Markov model parameters for speech recognition[J]. Acoustics Speech & Signal Processing IEEE International Conference on Icassp, 1986, 11 :49 - 52. 被引量：1
6DANIEL P. Discriminative training for large vocabulary speech recognition[D]. Cambridge: University of Cambridge, 2003. 被引量：1
7FU Q, MANSJUR D S,JUANG B H. Non - Uniform error criteria for automatic pattern and speech recognition[C] / / Acoustics, Speech and Signal Processing, 2008. ICASSP 2008, IEEE International Conference on, Las Vegas: IEEE, 2008:1853 - 1856. 被引量：1
8FU Q, MANSJUR D S,JUANG B. Empirical system learning for statistical pattern recognition With non - uniform error criteria[J]. Signal Processing IEEE Transactions on, 2010, 58(9) :4621 - 4633. 被引量：1
9WENG C,JUANG B. Adaptive boosted non - uniform MCE for keyword spotting on spontaneous speech[C] / /IEEE International Conference on Acoustics, Speech & Signal Processing, Vancouver: IEEE, 2013: 6960 - 6964. 被引量：1
10GHOSHAL A, POVEY D. Sequence discriminative training of deep neural networks[J]. Proclnterspeech, 2013 ( 8) : 2345 - 2349. 被引量：1

同被引文献1

1李鹏,屈丹.采用词图相交融合的语音关键词检测方法[J].信号处理,2015,31(6):702-709. 被引量：4

引证文献1

1徐博文,金小峰.基于语音识别的朝鲜语语音检索方法[J].延边大学学报（自然科学版）,2021,47(3):273-278.

1韩纪庆.基于最小分类错误准则的判别学习方法[J].电子工程师,2001,27(2):1-3. 被引量：2
2韩纪庆.一种语音识别中的环境自适应方法[J].计算机工程与应用,2002,38(1):69-70.
3王成儒,王金甲,李静.一种用于说话人辨认的概率神经网络的MCE训练算法[J].仪器仪表学报,2002,23(z3):154-156. 被引量：4
4郑永军,张连海.融合查询扩展和动态匹配的集外词检测[J].数据采集与处理,2014,29(2):280-285.
5陈斌,张连海,牛铜,屈丹,李弼程.基于MCE准则的语音识别特征线性判别分析[J].自动化学报,2014,40(6):1208-1215. 被引量：4
6杨鹏,谢磊,张艳宁.低资源语言的无监督语音关键词检测技术综述[J].中国图象图形学报,2015,20(2):211-218. 被引量：3
7孟猛,王晓瑞,梁家恩,徐波.一种基于互补声学模型的多系统融合语音关键词检测方法[J].自动化学报,2009,35(1):39-45. 被引量：3
8张文超,吕岳,文颖,黄志敏.几何信息与SIFT特征相结合的特定人手写关键词检测[J].智能系统学报,2014,9(5):544-550. 被引量：1
9王松,孙传庆,朱正平.基于GMM与改进MCE训练的说话人识别研究[J].自动化与仪器仪表,2010(6):21-23.
10王勇,张连海.基于点过程模型连续语音关键词检测[J].太赫兹科学与电子信息学报,2013,11(6):958-963. 被引量：2

智能计算机与应用

2015年第5期

浏览历史

内容加载中请稍等...

基于非均匀MCE准则的DNN关键词检测系统中声学模型的训练被引量：1

参考文献16

同被引文献1

引证文献1

相关作者

相关机构

相关主题

浏览历史

基于非均匀MCE准则的DNN关键词检测系统中声学模型的训练 被引量：1

参考文献16

同被引文献1

引证文献1

相关作者

相关机构

相关主题

浏览历史

基于非均匀MCE准则的DNN关键词检测系统中声学模型的训练被引量：1