基于语音生成和发音模型的语音合成新方法的探讨被引量：2

Study of a new synthesis method based on speech production model and RTLA model

下载PDF

导出

摘要提出基于语音生成模型和发音模型ＲＴＬＡ合成模式实现以共振峰轨迹为目标的语音合成的新方法。该方法采用了基于发音声学原理的反射型传输线模型来实现语音合成器。用于控制合成器的声道面积函数参数由以三个共振峰轨迹为目标的语音生成逆向解获得。该方法不仅可以得到动态过渡和自然度好的合成语音，能够方便灵活地控制或改变语音音色，合成器所需的输入控制参数少，参数更新率低。 A new method to synthesize formant targeted sounds based on speech production model and RTLA articula- tory synthesis model is studied. The synthesis model is implemented with scattering process derived from reflection-type line analog of vocal tract system according to the acoustic model of speech production. The vocal-tract area function which controls the synthesis model is derived from the first three formant trajectories using the inverse solution of speech production. The proposed method hot only gives good naturalness and dynamic smoothness, but also is capable to control or modify speech timbres easily and flexibly. Furthermore, it needs less number of control parameters and very low system sampling rate.

作者俞振利程伯中

机构地区浙江大学信息与电子工程系香港中文大学电子工程系

出处《声学学报》 EI CSCD 北大核心 2000年第5期455-462,共8页 Acta Acustica

基金本文研究获国家自然科学基金!(项目编号:69972046) 浙江省自然科学基金!(项目编号:698076)资助。

关键词语音生成发音模型语音合成 Acoustic generators Inverse problems Models Speech processing Trajectories

分类号 H017 [语言文字—语言学]

引文网络
相关文献

参考文献7

1俞振利,张礼和,曾尚璀.从语音信号的有限个共振峰频率估计声道面积参数的一个方法[J].电子学报,1997,25(7):32-37. 被引量：3
2刘庆峰,王仁华.基于LMA声道模型的语声合成新方法[J].声学学报,1998,23(3):271-278. 被引量：10
3Wang R H，Proc. ISCSLP’98，1998年，16页被引量：1
4Yu Z，Proc. EUROSPEECH’97，1997年，2551页被引量：1
5Yu Z，Proc. IEEE-ICASSP’96，1996年，369页被引量：1
6初敏，声学学报，1996年，21卷，增刊4期，639页被引量：1
7Yu Z，STL-QPSR，1993年，4卷，77页被引量：1

二级参考文献6

1蔡莲红,魏华武.汉语文-语转换系统的研究与实现[J].应用声学,1994,13(6):1-5. 被引量：5
2Lin Q，STL-QPSR，1992年，4期，29页被引量：1
3Lin Q，博士学位论文被引量：1
4王仁华，Proc of ICSLP96，1996年，1441页被引量：1
5罗恒，第六届语音图象信号处理学术会，1995年，162页被引量：1
6吴宗济，实验语音学概要被引量：1

共引文献11

1彭必雨.美国皮革化学家协会第102届年会论文综述[J].中国皮革,2006,35(17):9-12. 被引量：2
2冯哲,孙吉贵,张长胜,王岩.汉语语音合成的研究进展[J].吉林大学学报（信息科学版）,2007,25(2):198-206. 被引量：7
3刘浩杰,杜利民.语音合成技术的发展与展望[J].微计算机应用,2007,28(7):726-730. 被引量：14
4柳春,于洪志.语音合成技术研究[J].卫生职业教育,2008,26(11):64-66. 被引量：3
5刘庆峰,膝永盛,王仁华.多路实时、高音质数字串合成系统[J].声学学报,1999,24(5):510-515.
6YU Zhenli (Dept. of Information and Electronic Engineering, Zhejiang University Hangzhou 310028) Ching Pak-chung (Dept. of Electronic Engineering, The Chinese University of Hong Kang Shatin, N.T. Hong Kong).A synthesis method based on speech production and articulatory model[J].Chinese Journal of Acoustics,2000,19(2):128-141.
7郑新春,柴佩琪.语音拼接合成中基于“调素”论的时长标尺修改[J].计算机工程,2001,27(3):60-61. 被引量：2
8吴宗济.中国音韵学和语音学在汉语言语合成中的应用[J].语言教学与研究,2002(1):1-14. 被引量：18
9廖吉芳,刘群欣,林军,刘悦.列车智能人机交互装置的研究与应用[J].机车电传动,2018(3):52-56. 被引量：8
10段凯宇,俞一彪,石汝杰.基于基音同步帧叠接的吴语语音合成[J].通信技术,2002,35(3X):1-3. 被引量：3

同被引文献17

1孟子厚.普通话单元音女声共振峰统计特性测量[J].声学学报,2006,31(3):199-202. 被引量：8
2Deng L, Lee L, Attias H, Acero A. Adaptive Kalman filtering and smoothing for tracking vocal tract resonances using a continuous valued hidden dynamic model. IEEE Trans. Speech Audio Process, 2007; 15(1): 13-23 被引量：1
3Deng L, Acero A, Bazzi I. Tracking vocal tract resonances using a quantized nonlinear function embedded in a temporal constraint. IEEE Trans. Speech Audio Process, 2006; 14(2): 425-434 被引量：1
4Deng L, Bazzi I, Acero A. Tracking vocal tract resonances using an analytical nonlinear predictor and a target-guided temporal constraint. Proc. Eurospeech, Geneva, 2003; 1: 73-76 被引量：1
5Zhao Q, Shimamura T, Suzuki J. A robust algorithm for formant frequency extraction of noisy speech. ISCAS'98,Proceedings of the 1998 IEEE International Symposium on Circuits and Systems, Monterey, 1998; 5:534-537 被引量：1
6Zolfaghari P, Robinson T. Formant analysis using mixtures of Gaussians. Proc. ICSLP, Philadelphia, 1996; 2: 1229-1232 被引量：1
7Acero A. Formant analysis and synthesis using hidden Markov models. Proc. of the Eurospeech Conference, Budapest, 1999:1047-1050 被引量：1
8Candy J V. Bootstrap particle filtering. IEEE Signal Processing Magazine, 2007:73-85 被引量：1
9Arulampalam M S, Maskell S, Gordon N. A tutorial on particle filters for online nonlinear/non-Gaussian Bayesian tracking. IEEE Trans. on Signal Processing, 2002; 50(2): 174-188 被引量：1
10Hue C, Cadre J, Perez P. Sequential Monte Carlo methods for multiple target tracking and data fusion. IEEE Trans. on Signal Processing, 2002; 50(2): 309-325 被引量：1

引证文献2

1王叶斌,赵鹤鸣.辅助变量粒子滤波的声道共振特性轨迹跟踪[J].声学学报,2009,34(3):275-280.
2WANG Yebin ZHAO Heming.Vocal tract resonances tracking by auxiliary vector particle filters[J].Chinese Journal of Acoustics,2011,30(1):105-114.

1陈敏锋.移动IPv6协议的安全问题分析与对策[J].无锡商业职业技术学院学报,2006,6(3):23-25.
2陈金通.浅谈频率合成技术在电视发射机中的应用[J].科技信息,2008(16):104-104.
3程启明,俞振利,张礼和.基于语音生成逆向解的嘶音合成方法[J].科技通报,2001,17(5):6-9.
4陈洪财.语音生成及传输的电学模拟[J].佳木斯大学学报（自然科学版）,2000,18(2):153-155. 被引量：1
5薛化建,董兴华,周喜,吐尔洪.吾司曼,李晓.基于子字单元的维吾尔语语音识别研究[J].计算机工程,2011,37(20):208-210. 被引量：5
6冷京.小波变换在语音变速上的应用[J].上海师范大学学报（自然科学版）,1999,28(1):44-50. 被引量：1
7栗学丽.基于Pd的“语音信号处理”教学探索[J].电气电子教学学报,2013,35(5):85-87.
8俞振利,程伯中.用矢量码书和动态内插限制方法解决语音生成逆向解的非唯一性问题[J].电子学报,2000,28(4):52-56. 被引量：1
9于天虎.中小学网管人员的德、能、勤[J].中小学信息技术教育,2010(3):1-1.
10地平线[J].计算机光盘软件与应用（COMPUTER ARTS数码艺术）,2005(3):112-113.

声学学报

2000年第5期

浏览历史

内容加载中请稍等...

基于语音生成和发音模型的语音合成新方法的探讨被引量：2

参考文献7

二级参考文献6

共引文献11

同被引文献17

引证文献2

相关作者

相关机构

相关主题

浏览历史

基于语音生成和发音模型的语音合成新方法的探讨 被引量：2

参考文献7

二级参考文献6

共引文献11

同被引文献17

引证文献2

相关作者

相关机构

相关主题

浏览历史

基于语音生成和发音模型的语音合成新方法的探讨被引量：2