期刊文献+

基于Hilbert-Huang变换的语音信号共振峰频率估计 被引量:12

Speech formant frequency estimation based on Hilbert-Huang transform
下载PDF
导出
摘要 由快速傅里叶变换(FFT)初步估计出的语音信号的各阶共振峰频率确定相应带通滤波器的参数,并用该参数对语音信号作滤波处理,对滤波后的信号进行经验模态分解(EMD)得到一族固有模态函数(IMF),按能量最大原则确定出含有共振峰频率的IMF,计算出该IMF的瞬时频率和Hilbert谱即得到语音信号的共振峰频率参数.实验结果表明,与传统方法相比,该方法无须对语音信号进行分帧截断,提高了语音信号共振峰频率估计的时频分辨率和准确性,能够更精确地反映共振峰频率随时间的快速变化. After being filtered with the band-pass filters with the centre-frequencies obtained by using the fast Fourier transform (FFT) analysis, speech data were decomposea into a set of intrinsic mode function (IMF) using empirical mode decomposition (EMD). The IMFs containing formant frequencies were then identified according to the energy maximum criteria, and their instantaneous frequencies and Hilbert spectra were calculated, and finally, the formant frequencies of speech data were efficiently determined. The results show that, compared with the conventional formant estimation methods, the method based on HHT not only can provide more clear descriptions of the non-linear and non-stationary characteristics of speech signals, but also gives the speech formant frequencies and their variations with high time-frequency resolution and veracity.
作者 黄海 陈祥献
出处 《浙江大学学报(工学版)》 EI CAS CSCD 北大核心 2006年第11期1926-1930,共5页 Journal of Zhejiang University:Engineering Science
基金 国家自然科学基金资助项目(60275004)
关键词 语音信号 Hilbert—Huang变换 共振峰 非线性 speech signals Hilbert-Huang transform formant nonlinearity
  • 相关文献

参考文献8

  • 1MARKEL J D.Spectral analysis of speech by linear prediction[J].IEEE Transactions on Audio and Electroacoustic,1973,21(3):140-148. 被引量:1
  • 2CHRISTENSEN R L,SREONG W J,PALMER E P.A comparison of three methods of extracting resonance information from predictor-coefficient coded speech[J].IEEE Transactions on ASSP,1976,24(1):8-14. 被引量:1
  • 3张家騄.元音的内在基频与讲话方式对共振峰的影响[J].声学学报,1989,14(6):401-406. 被引量:6
  • 4张家騄.论语音技术的发展[J].声学学报,2004,29(3):193-199. 被引量:15
  • 5WATANABE A.Formant estimation method using inverse-filter control[J].IEEE Transactions on Speech and Audio Processing,2001,9(4):317-326. 被引量:1
  • 6RAO P,BARMAN A D.Speech formant frequency estimation:evaluating a nonstationary analysis method[J].Signal Processing,2000,80(8):1655-1667. 被引量:1
  • 7HUANG N E,SHEN Z,LONG S R,et al.The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis[J].Proceedings of the Royal Society,1998,454A:903-995. 被引量:1
  • 8FLANDRIN P,RILLING G,GONCALVES P.Empirical mode decomposition as a filter bank[J].IEEE Signal Processing Letters,2004,11(2):112-114. 被引量:1

二级参考文献2

共引文献18

同被引文献107

引证文献12

二级引证文献37

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部