期刊文献+

基于语音增强方法的语音端点检测 被引量:1

Voice activity detection based on speech enhancement method
下载PDF
导出
摘要 语音端点检测的检测结果好坏对后续的语音处理起着决定性的作用。为了解决语音端点在低信噪比情况下检测率不高的问题,该文提出了基于深度置信网络去噪的语音增强方法与传统的端点检测方法相结合的方法。该方法首先由大量的语音数据训练深度置信网络模型,使其能够很好地映射带噪与无噪语音之间的非线性关系,进而使其成为一个良好的降噪滤波器,再对比带噪与去噪后语音对端点检测准确率的影响,以及不同信噪比的端点检测的正确率。从该实验结果可以得到,该方法在平稳噪声和非平稳噪声的低信噪比情况下都可以提高语音端点检测的准确率。 The test results of voice activity detection(VAD)play a decisive role in the subsequent speech processing.To resolve the problem of low detection rate of speech endpoints at low signal-to-noise ratio(SNR),a method of combing speech enhancement method based on deep belief network denoising with the traditional endpoint detection method is proposed.The deep belief network model is trained by large volumes of speech data to effectively map the nonlinear relationship between noisy speech and noise-free speech,and is made to become a good noise reduction filter.The effects of noisy speech and denoised speech on endpoint detection accuracy,and the correctness of endpoint detection at different SNRs are compared.The experimental results show that the method can improve the accuracy of VAD in the case of both stationary noise and non-stationary noise with low SNR.
作者 包武杰 黄浩
出处 《现代电子技术》 北大核心 2017年第22期1-4,9,共5页 Modern Electronics Technique
基金 国家自然科学基金(61365005 60965002)
关键词 语音端点检测 深层置信网络 信噪比 语音处理 voice activity detection deep belief network SNR speech processing
  • 相关文献

参考文献6

二级参考文献35

  • 1许劲,杨秀平.P2P下的语音聊天软件实现[J].湖南城市学院学报(自然科学版),2005,14(2):66-68. 被引量:4
  • 2张仁志,崔慧娟.基于短时能量的语音端点检测算法研究[J].电声技术,2005,29(7):52-54. 被引量:45
  • 3Lee C H,Automatic Speech and speaker recognition-advanced topics,1996年 被引量:1
  • 4Han K,Wang D L.Towards generalizing classification based speech separation[J].Audio,Speech,and Language Processing,2013,21(1):168-177. 被引量:1
  • 5Sohn J,Kim N S,Sung W.A statistical model-based voice activity detection[J].IEEE Signal Processing Letters,1999,6(1):1-3. 被引量:1
  • 6Gazor S,Zhang W.A soft voice activity detector based on a Laplacian-Gaussian model[J].Speech and Audio Processing,2003,11(5):498-505. 被引量:1
  • 7Wu J,Zhang X L.Efficient multiple kernel support vector machine based voice activity detection[J].Signal Processing Letters,2011,18(8):466-469. 被引量:1
  • 8Zhang X L,Wu J.Deep belief networks based voice activity detection[J].Audio,Speech,and Language Processing,2013,21(4):697-710. 被引量:1
  • 9George E,Yu D,Deng L,et al.Context-dependent pretrained deep neural networks for large-vocabulary speech recognition[J].Audio,Speech,and Language Processing,2012,20(1):30-42. 被引量:1
  • 10Miguel A,Geoffrey E.On contrastive divergence learning[C]//Proceedings of the 10th International Workshop on Artificial Intelligence and Statistics,2005:33-40. 被引量:1

共引文献90

同被引文献13

引证文献1

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部