期刊文献+

基于高斯混合模型的语音带宽扩展算法的研究 被引量:7

Speech wideband extension based on Gaussian mixture model
下载PDF
导出
摘要 为了降低高带谱失真,研究了带宽扩展算法中特征参数与高带谱包络的互信息和高带谱失真之间的函数关系,并在此基础上提出了一种扩展高斯混合模型带宽扩展算法。首先,算法选择与高带谱包络互信息大的参数构成特征矢量,并根据高斯混合模型计算特征矢量与高带谱包络的联合概率密度。其次,采用Expectation-Maximization(EM)算法估计高斯分量模型参数并计算后验概率。最后,通过后验概率估计高带谱包络。实验结果表明,与传统的高斯混合模型带宽扩展算法相比,本文算法可降低0.3 dB的高带平均谱失真,将谱失真大于10dB的语音帧减少了50%以上。 To decrease the spectral distortion of highband envelope, the function of spectral distortion and mutual information which is between feature vector and highband envelope is studied, and an extended Gaussian Mixture Model (GMM) wideband extension algorithm is proposed based on the research. The feature parameters which have higher mutual information with highband envelope are selected to constitute feature vector, and the GMM is adopted to compute the joint probability density of the feature vector and highband envelope. Then the highband envelope is estimated via the posterior probabilities computed from the model parameters estimated by Expectation-Maximization (EM) Mgorithm. The experimental results show that the spectral distortion is inferior to the Mgorithm, such as the traditional algorithm based on GMM, by 0.3 dB and the number of frames with spectral distortion over 10 dB sharply reduced over 50%.
作者 张勇 胡瑞敏
出处 《声学学报》 EI CSCD 北大核心 2009年第5期471-480,共10页 Acta Acustica
基金 国家自然科学基金重点资助项目(60832002)
关键词 高斯混合模型 扩展算法 语音帧 带宽 联合概率密度 特征矢量 概率估计 谱失真 Blind source separation Communication channels (information theory) Image retrieval Magnetostrictive devices Probability Probability density function Vector quantization
  • 相关文献

参考文献14

  • 1Jax P, Vary P. Bandwidth extension of speech signals: a catalyst for the introduction of wideband speech coding. IEEE Communications Magazines, 2006; 44(5): 106--111. 被引量:1
  • 2党辰,戴葵,王苏峰,刘芸,王志英.高频重建技术SBR的研究与实现[J].电子学报,2004,32(F12):189-191. 被引量:2
  • 3郎玥,赵胜辉,匡镜明.基于矢量量化的语音信号频带扩展[J].北京理工大学学报,2005,25(3):260-264. 被引量:4
  • 4Geiser B, Jax P. Bandwidth extension for hierarchical speech and audio coding in ITU-T rec. G.729.1. IEEE Transactions on Audio, Speech and Language Processing, 2007; 15(8): 2496--2509. 被引量:1
  • 5Dar Ghulam Raza, Cheung-Fat Chan. Enhancing quality of celp coded speech via wideband extension by using voic- ing GMM interpolation and HNM re-synthesis. Proceeding of IEEE International Conference on Acoustics, Speech~ Signal Processing. 2002; 4:1241--1244. 被引量:1
  • 6Nakatoh Y, Tuushima M, Norimatsu T. Generation of broadband speech from narrowband speech using piecewise linear mapping. In Proceeding of EUROSPEECH, 1997; 9: 1643--1646. 被引量:1
  • 7Enbom N, Klenijn W B. Bandwidth expansion of speech based on vector quantization of the reel frequency cepstral coefficients. IEEE Workshop on Speech Coding Proceedings, 1999; 2:171--173. 被引量:1
  • 8Park K Y, Kim H S. Narrowband to wideband conversion of speech using GMM based transformation. Proceeding of IEEE International Conference on Acoustics, Speech, Signal Processing, 2000; 4:1843--1846. 被引量:1
  • 9Bernhard H P. A tight upper bound on the gain of linear and nonlinear predictors for stationary stochastic processes. IEEE Transactions on Signal Processing, 1998; 46(11): 2909--2917. 被引量:1
  • 10Larsen E, Aarts R M. Audio bandwidth extension - application to psychoacoustics signal processing and loudspeaker design. New York, 2004:207--216. 被引量:1

二级参考文献34

共引文献23

同被引文献48

  • 1岳毅宏,韩文秀,张伟波.基于关联度的混沌序列局域加权线性回归预测法[J].中国电机工程学报,2004,24(11):17-20. 被引量:26
  • 2郎玥,赵胜辉,匡镜明.基于矢量量化的语音信号频带扩展[J].北京理工大学学报,2005,25(3):260-264. 被引量:4
  • 3Gajjar P,Bhatt N,Kosta Y.Artificial bandwidth extension of speech&its applications in wireless communication systems:a review.International Conference on Communication Systems and Network Technologies,Rajkot India,2012(2):563-568. 被引量:1
  • 4Pulakka H,Myllyla V,Laaksonen L,Alku P.Bandwidth extension of telephone speech using a filter bank implementation for highband Mel spectrim.18th European Signal Processing Conference,Aalborg Denmark,2010(4):979-983. 被引量:1
  • 5Pulakka H,Remes U,Palomaki K,Kurimo M.Speech bandwidth extension using gaussian mixture model-based estimation of the highband Mel spectrum.IEEE International Conference on Acoustics,Speech,Signal Processing,Prague Czekh,2011(6):5100-5103. 被引量:1
  • 6Geiser B,Jax P.Bandwidth extension for hierarchical speech and audio coding in ITU-T Rec.G.729.1.IEEE Transactions on Audio,Speech and Language Processing,2007;15(8):2496-2509. 被引量:1
  • 7Nakatoh Y,Tsushima M,Norimatsu T.Generation of broadband speech from narrowband speech using piecewise linear mapping.5th European Conference on Speech Communication and Technology,Rhodes Greece,1997(5):1643-1646. 被引量:1
  • 8Park K Y,Kim H S.Narrowband to wideband conversion of speech using GMM based transformation.IEEE International Conference on Acoustics,Speech,Signal Processing,Istanbul Turkey,2000(4):1843-1846. 被引量:1
  • 9Jax P,Vary P.Wideband extension of telephone speech using a hidden markov model.IEEE Workshop on Speech Coding,Delavan America,2000(1):133-135. 被引量:1
  • 10Bauer P,Fingscheidt T.An HMM-based artificial bandwidth extension evaluated by cross-language training and test.IEEE International Conference on Acoustics,Speech,Signal Processing,Las Vegas America,2008(4):4589-4592. 被引量:1

引证文献7

二级引证文献17

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部