摘要
提出基于语音生成模型和发音模型RTLA合成模式实现以共振峰轨迹为目标的语音合成的新方法。该方法采用了基于发音声学原理的反射型传输线模型来实现语音合成器。用于控制合成器的声道面积函数参数由以三个共振峰轨迹为目标的语音生成逆向解获得。该方法不仅可以得到动态过渡和自然度好的合成语音,能够方便灵活地控制或改变语音音色,合成器所需的输入控制参数少,参数更新率低。
A new method to synthesize formant targeted sounds based on speech production model and RTLA articula- tory synthesis model is studied. The synthesis model is implemented with scattering process derived from reflection-type line analog of vocal tract system according to the acoustic model of speech production. The vocal-tract area function which controls the synthesis model is derived from the first three formant trajectories using the inverse solution of speech production. The proposed method hot only gives good naturalness and dynamic smoothness, but also is capable to control or modify speech timbres easily and flexibly. Furthermore, it needs less number of control parameters and very low system sampling rate.
出处
《声学学报》
EI
CSCD
北大核心
2000年第5期455-462,共8页
Acta Acustica
基金
本文研究获国家自然科学基金!(项目编号:69972046)
浙江省自然科学基金!(项目编号:698076)资助。
关键词
语音生成
发音模型
语音合成
Acoustic generators
Inverse problems
Models
Speech processing
Trajectories