期刊文献+

低信噪比环境下的语音识别方法研究 被引量:11

Research on speech recognition in low SNR environment
下载PDF
导出
摘要 单通道语音信号在信噪比较大的环境下经过增强后再识别,能表现出较高的识别率。但是在低信噪比环境下,增强后语音信号的识别率急剧下降。针对此种情况,提出了一种用在识别系统前端的语音增强算法,该增强算法将采集到的带噪语音信号先使用对数最小均方误差(Logarithmic Minimum Mean Square Error,Log MMSE)提高其信噪比,然后再利用改进的维纳滤波去除噪声残留并提升语音可懂度,最后用梅尔频率倒谱系数(Mel-Frequency Cepstral Coefficients,MFCC)和隐马尔科夫模型(Hidden Markov Model,HMM)对增强后的语音信号做特征提取并识别。实验分析结果表明,该方法能有效地抑制背景噪声并减少噪声残留,显著提升低信噪比环境下语音识别的准确性。 The accuracy rate of single channel enhanced speech recognition in high SNR environment is acceptable, but not so in low SNR environment. In this case, speech enhancement based on logarithmic minimum mean square error(Log MMSE) algorithm and modified Wiener filter algorithm is presented. Firstly the gathered speech signals' SNR is improved by the Log MMSE algorithm. Then using the improved Wiener filter algorithm removes residual noise and improves the signal quality. Finally the enhanced speech is used for recognition by MFCC and HMM algorithms. Experimental results show that the proposed method can effectively remove the background noise and reduce the residual noise, significantly increase the accuracy of the automatic speech recognition in noisy environment.
出处 《声学技术》 CSCD 北大核心 2017年第1期50-56,共7页 Technical Acoustics
基金 国家自然科学基金(61461011) 教育部重点实验室2016年主任基金(CRKL160107)资助项目
关键词 语音增强 低信噪比 改进维纳滤波 对数最小均方误差算法 语音识别 speech enhancement low SNR modified Wiener filter Log MMSE algorithm speech recognition
  • 相关文献

参考文献6

二级参考文献46

  • 1王莉,胡剑凌,徐盛.基于听觉掩蔽效应的语音增强算法的研究[J].电声技术,2006,30(7):39-42. 被引量:3
  • 2Lim J S, Oppenheim A V.Enhancement and bandwidth compression of noisy speech[C]//Proceedings of the IEEE, 1979, 67: 1586-1604. 被引量:1
  • 3Ephraim Y,Malah D.Speech enhancement using a minimum meansquare error log-spectral amplitude estimator[J].IEEE Trans on Acoustics,Speech,Signal Processing, 1985,ASSP-32:443-445. 被引量:1
  • 4Ephraim Y, Malah D.Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator[J]. IEEE Trans on Acoustics, Speech, Signal Processing, 1984, AS- SP-32 : 1109-1121. 被引量:1
  • 5Capp60.Elimination of the musical noise phenomenon with the Ephraim and Malah noise suppressor[J].IEEE Trans on Speech and Audio Processing, 1994,2 (2) : 345-349. 被引量:1
  • 6Scalart P,Vieira-Filho J.Speech enhancement based on a priori signal to noise estimation[C]//Proc 21st IEEE Int Conf Acoust Speech Signal Processing, Atlanta, GA, 1996,2 (2) : 629-632. 被引量:1
  • 7Cohen I.Speech enhancement using a noncausal a priori SNR estimator[J].IEEE Signal Processing Letters,2004(9):725-728. 被引量:1
  • 8Arslan L M.Modified Wiener filtering[J].Signal Processing,2006, 86(2) :267-272. 被引量:1
  • 9Xu Yao-hua, Guo Ying, Li Wei, et al.Elimination of musical noise phenomenon with Burg-based a priori SNR estimator[C]// Image and Signal Processing, 2008, CISP' 08,2008,5 : 328-332. 被引量:1
  • 10阔永红,陈健,杨昌方.基于听觉掩蔽效应的MMSE语音增强算法[J].计算机工程与应用,2007,43(27):140-141. 被引量:5

共引文献40

同被引文献83

引证文献11

二级引证文献46

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部