期刊文献+

低码率语音编码中过渡帧对合成语音的影响 被引量:2

Effects of transition frame on synthesized speech in low bit rate speech coding
下载PDF
导出
摘要 过渡段对语音清晰度、可懂度和人耳听觉感知都起到不可忽视的作用。参数语音编码中,包含有过渡段的语音帧能否得到恰当处理,是决定其合成语音是否清晰可懂的关键。本文以混合激励线性预测编码为参考,将其中的语音帧划分为静音、清音、浊音、过渡四大类后分别处理,在以往低码率语音编码(<1 kbps)工作基础上,比较了八种过渡帧划分方法对合成语音PESQ MOS的影响。经分析后发现:不同的过渡帧对PESQ MOS的贡献也不同。由清、静音向浊音变化的过渡帧的贡献最大;介于浊辅音与元音之间的过渡帧的贡献也不应被忽略。 Transition segments play an essential role in clarity, intelligibility, and auditory perception of speech. In parametric speech codec algorithm, whether the synthesized speech is clear and intelligible is critically determined by whether transition frames, which contain the transition segments, can be processed felicitously. Referring to the MELP (Mixed excitation linear prediction), frames are classified into four types: silent, unvoiced, voiced, and transition. Each type is processed respectively. Based on the previous work of low bit rate (〈 1 kbps) speech coding, the effect of 8 transition frame classification methods on the PESQ MOS (Perceptual Evaluation of Speech Quality Mean Opinion Score) are studied. It is found that different transitions contribute differently to the PESQ MOS. The transition from unvoiced or silent frame to voiced frame is the most important. And the transition between voiced consonant and vowel can not be neglected either.
出处 《应用声学》 CSCD 北大核心 2016年第1期77-83,共7页 Journal of Applied Acoustics
基金 国家自然科学基金项目(61302109)
关键词 低码率语音编码 混合激励线性预测编码 过渡段 Low bit rate speech codec, Mixed excitation linear prediction, Transition segment
  • 相关文献

参考文献12

二级参考文献32

  • 1刘刚,刘晶,王泉.一种基于覆盖域密度的LBG算法[J].计算机应用,2008,28(S2):319-321. 被引量:2
  • 2吴江滨,丛键,苏旸.一种600bps声码器及其与典型中低速语音编码器的音质对比[J].中国电子科学研究院学报,2007,2(1):93-96. 被引量:5
  • 3杨行峻,迟惠生.语音信号数字处理[M].北京:电子工业出版社,2003. 被引量:3
  • 4马大猷,沈蠓.声学手册(修订版).北京:科学出版社,2003:437. 被引量:1
  • 5Goalic A, Trubuil J. Underwater acoustic communication using reed solomon block turbo codes channel coding to transmit images and speech. OCEANS 2010:1-6. 被引量:1
  • 6Chamberlain M W. A 600 bps MELP vocoder for use on HF channels. Military Communications Conference, 2001(1): 447 453. 被引量:1
  • 7Peutz V M A. Articulation loss of consonants as a crite- rion for speech transmission in a room. J. Audio Er-9. Soc., 1971; 19(11): 915-919. 被引量:1
  • 8吴宗济,林茂灿.实验语音学概要.北京;高等教育出版社,1987.86-96,107,119,209. 被引量:1
  • 9Voiers W D. Diagnostic evaluation of speech intelligibility. Speech Intelligibility and Speaker Recognition, Benchmark paper in Acoustics, 1977; 11:374-387. 被引量:1
  • 10Federal information processing standards publication (MELP): specifications for the analog to digital conver- sion of voice by 2,400 bit/second mixed excitation linear prediction (draft). 1997. 被引量:1

共引文献18

同被引文献9

引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部