基于矢量泰勒级数的鲁棒语音识别被引量：4

Robust Speech Recognition Based on Vector Taylor Series

下载PDF

导出

摘要矢量泰勒级数是一种有效的抗噪声鲁棒语音识别算法.然而在对数谱域,美尔滤波器组的不同通道之间有较强的相关性,因而难以从含噪语音中准确估计噪声的方差.提出了一种基于矢量泰勒级数的倒谱域特征补偿算法.该算法在倒谱域,用一个高斯混合模型描述语音倒谱特征的分布,通过矢量泰勒级数从含噪语音中估计噪声的均值和方差.实验结果表明,此算法能明显提高语音识别系统的性能,优于基于矢量泰勒级数的对数谱域特征补偿算法. The vector Taylor series（VTS）expansion is an effective approach to noise robust speech recognition.How-ever,in the log-spectral domain,there exist the strong correlations among the different channels of Mel filter bank and thus it is difficult to estimate the noise variance from noisy speech proposes.A feature compensation algorithm in the cepstral domain based on vector Taylor series was proposed.In this algorithm,the distribution of speech cepstral features was represented by a Gaussian mixture model（GMM）,and the mean and variance of noise were estimated from noisy speech by the VTS approximation.The experimental results show that the proposed algorithm can signifi-cantly improve the performance of speech recognition system,and outperforms the VTS-based feature compensation method in the log-spectral domain.

作者吕勇吴镇扬

机构地区东南大学信息科学与工程学院

出处《天津大学学报》 EI CAS CSCD 北大核心 2011年第3期261-265,共5页 Journal of Tianjin University(Science and Technology)

基金国家自然科学基金资助项目(60971098)

关键词特征补偿矢量泰勒级数噪声估计鲁棒语音识别 feature compensation vector Taylor series noise estimation robust speech recognition

分类号 TN912.34 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献15

1Atal B. Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification [J]. Journal of the Acoustical Society of America, 1974,55 (6) : 1304-1312. 被引量：1
2Erell A,Weintraub M. Filterbank-energy estimation using mixture and Markov models for recognition of noisy speech [J]. IEEE Trans on Speech and Audio Processing, 1993,1 (1) :68-76. 被引量：1
3Acero A. Acoustical and Environmental Robustness in Automatic Speech Recognition [M]. Norwell:Kluwer Academic Publisher, 1993. 被引量：1
4Sasou A,Asano F,Nakamura S,et al. HMM-based noiserobust feature compensation [J]. Speech Communication, 2006,48(9) :1100-1111. 被引量：1
5Kim W,Hansen J H L. Feature compensation in the cepstral domain employing model combination[J]. Speech Communication, 2009,51 (2) : 83-96. 被引量：1
6吕勇,吴镇扬.基于隐马尔可夫模型与并行模型组合的特征补偿算法[J].东南大学学报（自然科学版）,2009,39(5):889-893. 被引量：4
7Lu Yong, Wu Zhenyang. Maximum likelihood model adaptation using piecewise linear transformation for ro- bust speech recognition [C]//IEEE International Sympo sium on Consumer Electronics. Kyoto,Japan,2009 : 608- 610. 被引量：1
8Gales M J F. Model-Based Techniques for Noise Robust Speech Recognition[D]. Cambridge: Cambridge University, 1995. 被引量：1
9Moreno P J,Raj B,Stern R M. A vector Taylor series approach for environment-independent speech recognition [C]//IEEE Int Conf on Acoustics ,Speech ,and Signal Processing. Atlanta,USA, 1996 : 733-736. 被引量：1
10Moreno P J. Speech Recognition in Noisy Environments[D]. Pittsburgh: Carnegie Mellon University, 1996. 被引量：1

二级参考文献11

1孙暐,吴镇扬.基于独立感知理论的鲁棒语音识别算法[J].东南大学学报（自然科学版）,2005,35(4):506-509. 被引量：2
2赵蕤,王作英.语音识别中信道和噪音的联合补偿[J].声学学报,2006,31(5):466-470. 被引量：11
3Nasersharif B, Akbari A. SNR-dependent compression of enhanced Mel sub-band energies for compensation of noise effects on MFCC features [J ]. Pattern Recognition Letters, 2007,28( 11 ) : 1320 - 1326. 被引量：1
4Cui X, Alwan A. Noise robust speech recognition using feature compensation based on polynomial regression of utterance SNR [ J ]. IEEE Transactions on Speech and Audio Processing, 2005, 13(6) : 1161 -1172. 被引量：1
5Barreaud V, Illina I, Fohr D. On-line stochastic matching compensation for non-stationary noise [ J ]. Computer Speech and Language, 2008, 22 ( 3 ) : 207 - 229. 被引量：1
6Moreno P J. Speech recognition in noisy environments [ D]. Pittsburgh, Pennsylvania, USA: Carnegie Mellon University, 1996: 79 - 126. 被引量：1
7Kim W, Kwon O, Ko H. PCMM-based feature compensation schemes using model interpolation and mixture sharing [ C ]//IEEE International Conference on Acoustics, Speech, and Signal Processing. Montreal, Canada, 2004:989-992. 被引量：1
8Kim W, Hansen J H L. Feature compensation in the cepstral domain employing model combination [ J ]. Speech Communication, 2009, 51 (2) : 83 - 96. 被引量：1
9Sasou A, Asano F, Nakamura S, et al. HMM-based noise-robust feature compensation [ J]. Speech Communication, 2006, 48 (9) : 1100 - 1111. 被引量：1
10Gales M J F, Young S J. Robust speech recognition in additive and convolutional noise using parallel model combination [ J ]. Computer Speech and Language, 1995, 9(4): 289-307. 被引量：1

共引文献3

1牛铜,李弼程,张连杰.基于缺失数据补偿的鲁棒语音识别[J].信息工程大学学报,2012,13(4):411-415.
2张宁,朱礼军.中文问答系统问句分析研究综述[J].情报工程,2016,2(1):32-42. 被引量：12
3李聪,葛洪伟.自适应并行模型组合的鲁棒语音身份识别算法[J].信号处理,2018,34(7):867-875. 被引量：6

同被引文献24

1白文雅,黄健群,陈智伶.基于维纳滤波语音增强算法的改进实现[J].电声技术,2007,31(1):44-46. 被引量：14
2朱维彬,吕士楠.基于语义的语音合成——语音合成技术的现状及展望[J].北京理工大学学报,2007,27(5):408-412. 被引量：8
3GUPTA V K,BHOWMICK A,CHANDRA M,et al. Speech enhancement using MMSE estimation and spectral subtraction methods[ C]//Proc. In- ternational Conference on Devices and Communications. [ S. 1. ] : IEEE Press,2011:1-5. 被引量：1
4DE LA T A,PEINADO A M,SEGURA J C,et al. Histogram equalization of speech representation for robust speech recognition [ J ]. IEEE Trans. Speech and Audio Processing,2005,13 ( 3 ) :355-366. 被引量：1
5LI Sheng, NIU Ming, WANG Jianqi, et al. Enhancement of non-air con- duct speech based on multi- band spectral subtraction method [ C ]// Prec. Congress on Image and Signal Processing, 2008. [ S. 1. ] : IEEE Press ,2008:338-341. 被引量：1
6AFAYANI M, AMET/ H, ABAALI B, et al. An ef~cient multi-band spectral subtraction method for robust speech recognition[ C]//Proc. 9th International Symposium on Signal Processing and Its Applications, 2007. [S. 1. ] :IEEE Press,2007:l-4. 被引量：1
7ABUSHARIAH A A M, GUNAWAN T S, KHALIFA O O, et al. English digits speech recognition system based on hidden Markov models [ C ]// Proc. International Cord'erence on Computer and Communication Engi- neering. Kuala Lumpur Malaysia:Electr. & Comput. Eng. Dept., Int. Islamic Univ. ,2010:1-5. 被引量：1
8程正,赵鹤鸣.基于多频带谱减法的语音增强算法的研究[J].计算机工程与应用,2007,43(36):40-42. 被引量：6
9何强,罗晓松,何巧,万众.语音增强中谱减法的改进应用[J].电声技术,2008,32(7):57-60. 被引量：2
10李伟,李晓强,陈芳,王淞昕.数字音频指纹技术综述[J].小型微型计算机系统,2008,29(11):2124-2130. 被引量：14

引证文献4

1沈崇德,童思木.医院智能语音客户服务系统的创新研究与应用示范[J].中国医学装备,2013,10(1):71-73. 被引量：7
2ZHANG Yi,HE Chun-jiang,LUO Yuan,CHEN Kai,XING Wu-chao.Improved perceptually non-uniform spectral compression for robust speech recognition[J].The Journal of China Universities of Posts and Telecommunications,2013,20(4):122-126. 被引量：1
3张毅,何春江,罗元,徐晓东,童开国.基于改进感知非均匀谱压缩的鲁棒语音识别算法[J].信息与控制,2013,42(5):565-569. 被引量：1
4万义龙,张天骐,王志朝,金静.基于多频带谱减法的抗噪声语音识别研究[J].电视技术,2013,37(23):183-187. 被引量：5

二级引证文献14

1程美,王力华.医疗智能语音技术与应用综述[J].中国数字医学,2021,16(8):1-7. 被引量：7
2张丽,商洪涛,王彪,刘晓日.医院微信服务平台的设计与实现[J].中国医学装备,2015,12(10):46-48. 被引量：23
3胡丹,曾庆宁,龙超,黄桂敏.连续语音识别前端鲁棒性研究[J].电视技术,2015,39(24):43-46. 被引量：2
4吴进,张青.一种改进的孤立词语音识别系统设计[J].西安邮电大学学报,2016,21(1):76-80. 被引量：4
5罗元,吴承军,张毅,黎小松,席兵.Mel频率下基于LPC的语音信号深度特征提取算法[J].重庆邮电大学学报（自然科学版）,2016,28(2):174-179. 被引量：12
6崔旭.基于多窗谱估计的改进的维纳滤波语音增强算法[J].电子世界,2017,0(7):148-148.
7胡金艳,李盛,陈扶明,张宇婷,谢欣欣,王健琪.基于多带谱减法的生物雷达语音增强方法研究[J].科学技术与工程,2017,17(16):76-81. 被引量：1
8刘文华,邵尉.数字化医院语音录入系统的设计与应用[J].中国数字医学,2017,12(10):78-80. 被引量：3
9刘晶,罗进城,左秀然.基于语音识别的移动电子病历应用探索[J].中国数字医学,2018,13(4):23-25. 被引量：13
10张琳,周韬,杜庆治,邵玉斌,龙华.基于物理特征的音频相似度比对算法研究[J].电视技术,2017,41(11):110-114. 被引量：9

1何勇军,付茂国,孙广路.语音特征增强方法综述[J].哈尔滨理工大学学报,2014,19(2):19-25. 被引量：3
2吴海洋,杨飞然,周琳,吴镇扬.矢量泰勒级数特征补偿的说话人识别[J].声学学报,2013,38(1):105-112. 被引量：6
3雷建军,杨震,刘刚,郭军.噪声鲁棒语音识别研究综述[J].计算机应用研究,2009,26(4):1210-1216. 被引量：14
4吕勇,吴镇扬.基于矢量泰勒级数的模型自适应算法[J].电子与信息学报,2010,32(1):107-111. 被引量：2
5吕钊,吴小培,张超.鲁棒语音识别技术综述[J].安徽大学学报（自然科学版）,2013,37(5):17-24. 被引量：4
6牛铜,李弼程,张连杰.基于缺失数据补偿的鲁棒语音识别[J].信息工程大学学报,2012,13(4):411-415.
7陈洁.背景音乐自动分离系统设计与实现[J].现代电子技术,2017,40(5):134-138. 被引量：2
8邱作春.麦克风阵列语音增强用于抗噪说话人识别[J].大众科技,2008,10(12):35-37.
9王让定,柴佩琪.语音倒谱特征的研究[J].计算机工程,2003,29(13):31-33. 被引量：50
10孙暐,吴镇扬.多带抗噪声语音识别算法研究[J].信号处理,2006,22(4):559-563.

天津大学学报

2011年第3期

浏览历史

内容加载中请稍等...

基于矢量泰勒级数的鲁棒语音识别被引量：4

参考文献15

二级参考文献11

共引文献3

同被引文献24

引证文献4

二级引证文献14

相关作者

相关机构

相关主题

浏览历史

基于矢量泰勒级数的鲁棒语音识别 被引量：4

参考文献15

二级参考文献11

共引文献3

同被引文献24

引证文献4

二级引证文献14

相关作者

相关机构

相关主题

浏览历史

基于矢量泰勒级数的鲁棒语音识别被引量：4