期刊文献+

多带激励声码器的改进双路径基音跟踪算法

Improved two-path pitch tracking algorithm for multiband excitation vocoders
原文传递
导出
摘要 为了增强多带激励(MBE)声码器基音估计性能的鲁棒性,提出了一种可适用于低信噪比语音信号的改进双路径基音跟踪算法.采用全新构造的差值不等式作为约束方程,其差值门限的取值在基音跟踪过程中能够根据基音周期长短的统计特征自动更新.实验结果显示:在SNR为-5dB的高斯白噪声干扰的情况下,基音估计的严重错误概率的性能改善平均达到70%.与传统算法相比,该算法对不同讲话者和不同程度高斯白噪声干扰均具有较强的适应能力,尤其在噪声严重的情况下该算法对基音估计的准确性得到明显改善,从而使合成语音具有较好的可懂度和自然度. An improved two-path pitch tracki signal to noise ratio (SNR) levels was propose ng algorithm applicable to d to enhance robustness of noisy speech signals at low pitch estimation performance for multiband excitation (MBE) vocoder. The algorithm employed a new constructed difference inequation as the constraint equation, and its difference threshold value could be updated automatically during the pitch tracking process according to the statistical feature of the pitch period length. The experimental results show that the average gross error rate improvement achieves 70% at -5 dB SNR level in white Gaussian noise condition. The algorithm outperforms the conventional algorithm, displaying better adaptability to different speakers and different white Gaussian noise conditions, improving the pitch estimation accuracy obviously especially in serious noise eonditions, thus facilitating more intelligible and more natural synthesized speech.
出处 《华中科技大学学报(自然科学版)》 EI CAS CSCD 北大核心 2012年第6期54-58,共5页 Journal of Huazhong University of Science and Technology(Natural Science Edition)
基金 湖北省自然科学基金资助项目(2009CDB320)
关键词 基音跟踪 多带激励声码器 鲁棒性 约束方程 最佳门限 pitch tracking multiband excitation (MBE) vocoder robustness constraint equation optimal threshold
  • 相关文献

参考文献15

二级参考文献33

  • 1刘建,郑方,吴文虎.基于幅度差平方和函数的基音周期提取算法[J].清华大学学报(自然科学版),2006,46(1):74-77. 被引量:22
  • 2杨行峻 迟惠生.语音信号数字处理[M].北京:电子工业出版社,1990.. 被引量:3
  • 3Rabiner L, Cheng M. A comparative performance study of several pitch detection algorithms[J].IEEE Tram On Acoustics, Speech, and Signal Processing, 1976, 24(5): 399 - 418. 被引量:1
  • 4Secrest B, Doddington G. Postprocessing techniques for voice pitch trackers[C]//International Conf On Acoustics, Speech, and Signal Processing. Paris: IEEE, 1982: 172- 175. 被引量:1
  • 5Ney H. A dynamic programming technique for nonlinear smoothing[C]// International Conf On Acoustics, Speech, and Signal Processing. Atlanta: IEEE, 1981: 62-65. 被引量:1
  • 6Plante F, Meyer G F. A pitch extraction reference database [C]// European Conf on Speech Communication and Technology. Madrid, 1995:837 - 840. 被引量:1
  • 7Kondoz A M. Digital Speech[M]. England: John Wiley& Sons Ltd, 2004. 被引量:1
  • 8[1] Griffin D, Lim J S. Multi-band excitation vocoder [J]. IEEE Trans on ASSP, 1988, 36(8):1223~1235. 被引量:1
  • 9[2] McAulay R J, Quantieri T F. Speech analysis/synthesis based on a sinusoidal representation[J]. IEEE Trans on ASSP, 1986,34(4):744~754. 被引量:1
  • 10[3] Yeldener S, De Martin J C, Viswanathen V. A mixed sinusoidally excited linear prediction coder at 4 kb/s and below[A]. Proc IEEE ICASSP [C]. Seattle, USA, 1998.589~592. 被引量:1

共引文献40

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部