语音转换技术在电话语音识别中的应用研究(英文)

Speech Conversion for Telephone Speech Recognition

下载PDF

导出

摘要提出了一种用语音转换技术改善电话语音识别性能的方法。通过模拟真实电话信道条件下影响语音质量的各种因素,实现由纯净语音到电话语音的转换。识别试验利用模拟电话语音评估了HMM识别器做MLLR自适应前后的性能。实验数据显示,自适应前由转换语音训练的模型识别率比由纯净语音训练的模型识别率增加了18.9%,而自适应试验表明,由转换语音训练而成的模型在MLLR自适应后,系统识别性能进一步得到改善,识别率增加了5.8%。识别实验表明所提语音转换方法可以减小由于真实电话语料缺乏而造成训练语音和测试语音声学性质的不匹配,从而有效地改善电话语音识别系统的性能。 A study on speech conversion technology is addressed to improve the telephone speech recognition performance. The speech conversion method is implemented by simulating the influential factors in actual telephone connections. MLLR adaptations are conducted to evaluate the performances of the HMM recognizers, which are trained from the clean speech and generated data respectively. The results without adaptation report that the models trained on generated data can give an 18.9% higher recognition rate than those on clean speech. The adaptation results show that MLLR algorithm contributes an extra increase of 5.8% to the recognition rate of telephone speech system. The experiments illustrate that telephone speech recognition performance can be effectively improved using the generated data, and the conversion method can reduce the acoustic mismatch between the training and test data, which is induced by the shortage of the actual telephone speech.

作者左国玉刘文举阮晓钢

机构地区中国科学院自动化所模式识别国家重点实验室北京工业大学电子信息与控制工程学院北京工业大学电子信息与控制工程学院

出处《系统仿真学报》 EI CAS CSCD 北大核心 2005年第2期448-452,456,共6页 Journal of System Simulation

基金国家自然科学基金项目(60172055 60121302) 北京市自然科学基金(4042025)。

关键词语音转换模拟电话语音语音识别 MLLR speech conversion generated telephone speech speech recognition MLLR

分类号 TN912.3 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献1

1左国玉,刘文举,阮晓钢.声音转换技术的研究与进展[J].电子学报,2004,32(7):1165-1172. 被引量：32

二级参考文献56

1H Kuwabara and Y Sagisaka.Acoustic characteristics of speaker individuality:control and conversion[J].Speech Communication.1995,16(2):165-173. 被引量：1
2D Klatt and L C Klatt.Analysis,synthesis,and perception of voice quality variations among female and male talkers[J].J Acoust Soc Am,1990,87(2):820-857. 被引量：1
3P H Milenkovic.Voice source model for continuous control of pitch period[J].J Acoust Soc Am,1993,93(2):1087-1096. 被引量：1
4H Matsumoto,et al.Multidimensional representation of personal quality of vowels and its acoustical correlates[J].IEEE Trans Audio and Electroacoustics,1973,21(5):428-436. 被引量：1
5S Furui.Research on individuality features in speech waves and automatic speaker recognition techniques [J].Speech Communication,1986,5(2):183-197. 被引量：1
6K S Lee,et al.A new voice transformation based on both linear and nonlinear prediction[A].Proc ICSLP[C].Philadelphia,USA:ESCA,1996.1401-1404. 被引量：1
7L M Arslan.Speaker transformation algorithm using segmental codebooks (STASC)[J].Speech Communication,1999,28(3):211-226. 被引量：1
8H Mizuno and M Abe.Voice conversion algorithm based on piecewise linear conversion rules of formant frequency and spectrum tilt[J].Speech Communication.1995,16(2):165-173. 被引量：1
9T Yoshimura,et al.Speaker interpolation in HMM-based speech synthesis system[A].Proc.Eurospeech [C].Rhodes,Greece:ESCA,1997.2523-2526. 被引量：1
10D G Childers.Glottal source modeling for voice conversion [J].Speech Communication.1995,16 (2):127-138. 被引量：1

共引文献31

1吴梅,冯瑞杰.试论一种语音转换系统的设计与实现[J].中亚信息,2010(S1):61-63.
2左国玉,刘文举,阮晓钢.一种使用声调映射码本的汉语声音转换方法[J].数据采集与处理,2005,20(2):144-149. 被引量：4
3符敏,程德福.支持向量回归在声音转换中的应用[J].电声技术,2006,30(3):45-48. 被引量：1
4张晓洲,黄德智,蔡莲红.考虑帧间动态特征的音色变换算法[J].清华大学学报（自然科学版）,2006,46(10):1767-1770. 被引量：1
5康永国,双志伟,陶建华,张维.基于混合映射模型的语音转换算法研究[J].声学学报,2006,31(6):555-562. 被引量：13
6王海祥,戴蓓蒨,陆伟,张剑.基于共振峰参数和分类线性加权的源-目标声音转换[J].中国科学技术大学学报,2006,36(11):1153-1159.
7王海祥.基于RBF神经网络的源——目标话音转换[J].电子测量技术,2006,29(6):60-63.
8孙俊,戴蓓蒨,张剑.基于基元段特征和GMM的源-目标说话人F_0～t转换[J].信号处理,2007,23(2):283-287.
9王卉,王小军,马骏.基于CMOS工艺的音频前置放大器的设计与实现[J].电子器件,2007,30(3):870-873.
10张照坤.语音转换关键技术研究[J].电脑知识与技术,2008(3):1309-1311.

1彭京湘.软件实现电话语音识别的设计方案[J].电子产品世界,1999,6(7):38-39.
2徐振林,金柱.影响GSM网络MOS值的因素对比分析[J].信息通信,2012,25(1):160-162. 被引量：1
3田野,李涓子,王作英,陆大金.电话语音识别系统[J].计算机工程与应用,2001,37(13):37-39. 被引量：8
4刘洋,贺前华,黄海.改进的MFCCs电话语音识别方法[J].计算机工程,2002,28(10):67-68.
5曹加恒.龚哑人语音训练专家系统LYTFS：知识...[J].计算机杂志,1991,19(5):35-38.
6高晨光,周利民,童珉.基于云计算的监测数据可视化采集分析系统建模实现[J].广播与电视技术,2015(3):138-142. 被引量：3
7无线通信设备[J].个人电脑,2003,9(8):141-141.
8苗新法,范春晓.依赖OSYNO6188的SMS TTS系统的实现[J].电子技术（上海）,2005,32(10):68-70.
9韩纪庆,王承发,高文.二阶CMS用于电话语音识别的通道补偿[J].哈尔滨工业大学学报,1998,30(6):105-107. 被引量：2
10韩纪庆,王承发,高文.电话语音识别中通道补偿方法的性能比较[J].计算机科学,1999,26(7):54-56.

系统仿真学报

2005年第2期

浏览历史

内容加载中请稍等...

语音转换技术在电话语音识别中的应用研究(英文)

参考文献1

二级参考文献56

共引文献31

相关作者

相关机构

相关主题

浏览历史