期刊文献+

改进的功率谱二次处理基音检测法 被引量:2

Improved Power Spectrum Reprocessing Pitch Detection Method
下载PDF
导出
摘要 作为语音信号处理中的一项关键技术,基音检测一直是研究热点。本文分析了功率谱二次处理基音检测方法的不足:对于过渡语音,易产生半频或倍频误判;噪声干扰下,检测结果易失真;清、浊音的判断方法复杂。针对这些不足,本文提出一系列改进方法:时域非线性处理,频域加窗滤波,简化清、浊音判断。MATLAB仿真实验结果表明,无论是高信噪比还是低信噪比语音,改进的二次谱法较AMDF法和二次谱法更能清晰、准确地检测出基音轨迹。 As a key technique, the pitch detection has been a hot spot in the field of speech processing. The following three disadvantages of power spectrum reprocessing (PSR) method for pitch detection are found: half pitch error and double pitch error with transition sound; easy distortion in noise speech; the complex method of judging voiceless and voiced speech. A series of improvement methods are proposed: nonlinear processing at time-domain; windowed filtering at fre- quency-domain; simplification method of judging voiceless and voiced speech. The experimental results based on MATLAT show that the improved method detects pitch trajectory more clearly and accurately than the AMDF method and the PSR method.
出处 《计算机工程与科学》 CSCD 北大核心 2010年第5期140-142,146,共4页 Computer Engineering & Science
基金 湖北省教育厅重大项目(Z20081301) 湖北省自然科学基金资助项目(2008CDB346) 宜昌市科学技术研究与开发项目(A09302-31 A09302-32)
关键词 基音检测 倒谱法 功率谱二次处理 非线性处理 pitch detection cepstrum method power spectrum reprocessing nonlinear processing
  • 相关文献

参考文献7

二级参考文献26

  • 1胡瑞敏,薛东辉,姚天任,黄铁侠.神经网络方法及其在语音识别中的应用[J].高技术通讯,1995,5(6):11-15. 被引量:5
  • 2Kadambe S., Boudreaux Barrels G,F., Application of the wavelet transform for pitch detection of speech signals. IEEE Transactions on Information Theory, 1992, 38(2): 917-924. 被引量:1
  • 3Strube H, W., I)etermination of the instant of glottal closure from the speech wave. Journal of the Acoustical Society of America, 1974, 56(5):1625-1629. 被引量:1
  • 4Ananthapadmanabha T. V, , Yegnanarayana B.. Epoch extraction of voiced speech. IEEE Transactions on Acoust, Speech,Signal Processing, 1975, 23(6):562-570. 被引量:1
  • 5Ananthapadmanabha T. V. , Yegnanarayana B,. Epoch extraction from linear prediction residual for identification of closed glottis interval. IEEE Transacions on Acoust, Speech, Signal Processing, 1979, ASSP 27(4): 309-319. 被引量:1
  • 6Cheng Y, M. , O'Shaughnessy D.. Automatic and reliable estimation of glottal closure instant and period. IEEE Transactions on Acoust, Speech, Signal Processing, 1989, 37(12):1805-1815. 被引量:1
  • 7Huang N, E. , Shen Z. , I.ong S, R. et al.. The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis. In: Proceedings of the Royal Society of London, 1998, A(454):903-995. 被引量:1
  • 8Titchmarsh E, C,, Introduction to the Theory of Fourier Integrals, Oxford University Press, 1948. 被引量:1
  • 9Kiritani S., Imagawa H., Hirose H.. Vocal cord vibration and voice source characteristics observation by high speed digital image recording. In: Proceedings of ICSLP'90, Kobe, Japan, 1990, 61-64. 被引量:1
  • 10Hess W. Pitch Determination of Speech Signals[M]. New York:Springer-Verlag, 1983. 被引量:1

共引文献22

同被引文献16

引证文献2

二级引证文献13

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部