A modified voice conversion algorithm using compressed sensing 被引量：8

A modified voice conversion algorithm using compressed sensing

导出

摘要 A voice conversion algorithm,which makes use of the information between continuous frames of speech by compressed sensing,is proposed in this paper.According to the sparsity property of the concatenated vector of several continuous Linear Spectrum Pairs（LSP）in the discrete cosine transformation domain,this paper utilizes compressed sensing to extract the compressed vector from the concatenated LSPs and uses it as the feature vector to train the conversion function.The results of evaluations demonstrate that the performance of this approach can averagely improve 3.21%with the conventional algorithm based on weighted frequency warping when choosing the appropriate numbers of speech frame.The experimental results also illustrate that the performance of voice conversion system can be improved by taking full advantage of the inter-frame information,because those information can make the converted speech remain the more stable acoustic properties which is inherent in inter-frames. A voice conversion algorithm,which makes use of the information between continuous frames of speech by compressed sensing,is proposed in this paper.According to the sparsity property of the concatenated vector of several continuous Linear Spectrum Pairs（LSP）in the discrete cosine transformation domain,this paper utilizes compressed sensing to extract the compressed vector from the concatenated LSPs and uses it as the feature vector to train the conversion function.The results of evaluations demonstrate that the performance of this approach can averagely improve 3.21%with the conventional algorithm based on weighted frequency warping when choosing the appropriate numbers of speech frame.The experimental results also illustrate that the performance of voice conversion system can be improved by taking full advantage of the inter-frame information,because those information can make the converted speech remain the more stable acoustic properties which is inherent in inter-frames.

作者 JIAN Zhihua WANG Xiangwen

机构地区 School of Communication Engineering Inst.of Electronic and Information Engineering

出处《Chinese Journal of Acoustics》 2014年第3期323-333,共11页 声学学报（英文版）

基金 supported by the National Natural Science Foundation of China(61201301) Program of Zhejiang Provincial Education Department(Y201016542)

关键词 LPCC A modified voice conversion algorithm using compressed sensing GMM LSP

分类号 TN912.3 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献3

1左国玉,刘文举,阮晓钢.声音转换技术的研究与进展[J].电子学报,2004,32(7):1165-1172. 被引量：32
2俞一彪,曾道建,姜莹.采用独立说话人模型的语音转换[J].声学学报,2012,37(3):346-352. 被引量：8
3康永国,双志伟,陶建华,张维.基于混合映射模型的语音转换算法研究[J].声学学报,2006,31(6):555-562. 被引量：13

二级参考文献89

1左国玉,刘文举,阮晓钢.声音转换技术的研究与进展[J].电子学报,2004,32(7):1165-1172. 被引量：32
2H Kuwabara and Y Sagisaka.Acoustic characteristics of speaker individuality:control and conversion[J].Speech Communication.1995,16(2):165-173. 被引量：1
3D Klatt and L C Klatt.Analysis,synthesis,and perception of voice quality variations among female and male talkers[J].J Acoust Soc Am,1990,87(2):820-857. 被引量：1
4P H Milenkovic.Voice source model for continuous control of pitch period[J].J Acoust Soc Am,1993,93(2):1087-1096. 被引量：1
5H Matsumoto,et al.Multidimensional representation of personal quality of vowels and its acoustical correlates[J].IEEE Trans Audio and Electroacoustics,1973,21(5):428-436. 被引量：1
6S Furui.Research on individuality features in speech waves and automatic speaker recognition techniques [J].Speech Communication,1986,5(2):183-197. 被引量：1
7K S Lee,et al.A new voice transformation based on both linear and nonlinear prediction[A].Proc ICSLP[C].Philadelphia,USA:ESCA,1996.1401-1404. 被引量：1
8L M Arslan.Speaker transformation algorithm using segmental codebooks (STASC)[J].Speech Communication,1999,28(3):211-226. 被引量：1
9H Mizuno and M Abe.Voice conversion algorithm based on piecewise linear conversion rules of formant frequency and spectrum tilt[J].Speech Communication.1995,16(2):165-173. 被引量：1
10T Yoshimura,et al.Speaker interpolation in HMM-based speech synthesis system[A].Proc.Eurospeech [C].Rhodes,Greece:ESCA,1997.2523-2526. 被引量：1

共引文献44

1吴梅,冯瑞杰.试论一种语音转换系统的设计与实现[J].中亚信息,2010(S1):61-63.
2左国玉,刘文举,阮晓钢.语音转换技术在电话语音识别中的应用研究(英文)[J].系统仿真学报,2005,17(2):448-452.
3左国玉,刘文举,阮晓钢.一种使用声调映射码本的汉语声音转换方法[J].数据采集与处理,2005,20(2):144-149. 被引量：4
4符敏,程德福.支持向量回归在声音转换中的应用[J].电声技术,2006,30(3):45-48. 被引量：1
5张晓洲,黄德智,蔡莲红.考虑帧间动态特征的音色变换算法[J].清华大学学报（自然科学版）,2006,46(10):1767-1770. 被引量：1
6康永国,双志伟,陶建华,张维.基于混合映射模型的语音转换算法研究[J].声学学报,2006,31(6):555-562. 被引量：13
7王海祥,戴蓓蒨,陆伟,张剑.基于共振峰参数和分类线性加权的源-目标声音转换[J].中国科学技术大学学报,2006,36(11):1153-1159.
8王海祥.基于RBF神经网络的源——目标话音转换[J].电子测量技术,2006,29(6):60-63.
9孙俊,戴蓓蒨,张剑.基于基元段特征和GMM的源-目标说话人F_0～t转换[J].信号处理,2007,23(2):283-287.
10王卉,王小军,马骏.基于CMOS工艺的音频前置放大器的设计与实现[J].电子器件,2007,30(3):870-873.

同被引文献37

1朱维庆,刘晓东,张东升,张方生,秦高林,廖诤,方常乐.高分辨率测深侧扫声纳[J].海洋技术,2005,24(4):29-35. 被引量：14
2ZHU Weiqing LIU Xiaodong ZHANG Dongsheng LIAO Zheng ZHANG Fangsheng.Estimating the directions of arrival based on multi-subarray subspace fitting[J].Chinese Journal of Acoustics,2006,25(1):16-25. 被引量：7
3朱维庆,刘晓东,张东升,廖诤,张方生.多子阵子空间拟合波达方向估计[J].声学学报,2006,31(2):120-125. 被引量：11
4鄢社锋,马远良,侯朝焕.宽带波束域相干信号子空间高分辨方位估计[J].声学学报,2006,31(5):418-424. 被引量：23
5LI Zi-sheng LI Hai-sen ZHOU Tian YUAN Yan-yi.Multiple sub-array beamspace CAATI algorithm for multi-beam bathymetry system[J].Journal of Marine Science and Application,2007,6(1):47-52. 被引量：2
6朱维庆,胡隽,刘晓东,刘治宇,潘锋.波达方向估计中特征空间的信源数估计方法[J].声学学报,2009,34(2):97-102. 被引量：10
7郭海燕,杨震.基于近似KLT域的语音信号压缩感知[J].电子与信息学报,2009,31(12):2948-2952. 被引量：32
8张鹏,鲍明,冯大航,杨军,李晓东.加权最大似然波达方向估计算法及其应用研究[J].声学学报,2010,35(2):235-240. 被引量：9
9郭海燕,王天荆,杨震.DCT域的语音信号自适应压缩感知[J].仪器仪表学报,2010,31(6):1262-1268. 被引量：27
10叶蕾,杨震,郭海燕.基于小波变换和压缩感知的低速率语音编码方案[J].仪器仪表学报,2010,31(7):1569-1575. 被引量：23

引证文献8

1王彪,朱志慧,戴跃伟.一种快速稀疏贝叶斯学习的水声目标方位估计方法研究[J].声学学报,2016,41(1):81-86. 被引量：9
2高悦,臧明相,郭馥英.基于小波变换和压缩感知的语音信号压缩研究[J].计算机应用研究,2017,34(12):3672-3674. 被引量：9
3GU Dong,JIAN Zhihua.An algorithm for voice conversion with limited corpus[J].Chinese Journal of Acoustics,2018,37(3):371-384.
4谷东,简志华.面向少量语料的语音转换算法[J].声学学报,2018,43(5):864-872. 被引量：3
5周明阳,郭良浩,闫超.改进的贝叶斯压缩感知目标方位估计[J].声学学报,2019,44(6):961-969. 被引量：7
6张石磊,简志华,孙闽红,钟华,刘二小.采用联合字典优化的噪声鲁棒性语音转换算法[J].声学学报,2019,44(6):1074-1082. 被引量：1
7ZHANG Shilei,JIAN Zhihua,SUN Minhong,ZHONG Hua,LIU Erxiao.Noise-robust voice conversion based on joint dictionary optimization[J].Chinese Journal of Acoustics,2020,39(2):259-272.
8姚琳,刘晓东,曹金亮,王舒文,张东升,王晏宾.进行子阵加权波束形成的波达方向估计[J].声学学报,2020,45(4):497-505. 被引量：3

二级引证文献30

1胡永涛,金华,王昌达.基于压缩感知的网络流关联方法[J].计算机应用研究,2020,37(S02):245-246. 被引量：2
2王书豪,阮怀林.基于改进块稀疏贝叶斯学习算法的波达方向估计[J].计算机应用研究,2020,37(2):443-445. 被引量：1
3聂鹏,李锋,李正强.基于云模型和稀疏贝叶斯技术的刀具磨损状态识别技术研究[J].机电信息,2017(33):102-103. 被引量：2
4肖儿良,冯杰,简献忠,王如志.电能质量信号的KSVD-NRAMP归一化自适应稀疏重构算法[J].软件导刊,2018,17(12):87-91.
5杨正理,史文,陈海霞.基于小波包的锅炉炉管声波信号自适应压缩感知[J].热力发电,2019,48(5):114-120.
6杨海燕,吴雷,周萍.基于压缩感知和MFCC的语音端点检测算法[J].测控技术,2019,38(5):88-93. 被引量：2
7王静静,高山,崔玉卫,周平根.基于小波ARMA模型的增值税销项税额预测[J].税务研究,2019,0(10):82-89. 被引量：4
8杨正理,史文,陈海霞.光纤周界报警信号自适应压缩感知[J].激光技术,2020,44(1):74-80. 被引量：7
9KOU Siwei,FENG Xi’an,BI Yang,HUANG Hui.High-resolution angle-Doppler imaging by sparse recovery of underwater acoustic signals[J].Chinese Journal of Acoustics,2020,39(2):133-150. 被引量：1
10冯艺璇,肖东,陈岩,魏丽萍,郭圣明.无精准同步的小规模水声网络节点相对自定位[J].声学学报,2020,45(4):486-496. 被引量：4

1Jian Zhihua,Yang Zhen.ON USING NON-LINEAR CANONICAL CORRELATION ANALYSIS FOR VOICE CONVERSION BASED ON GAUSSIAN MIXTURE MODEL[J].Journal of Electronics(China),2010,27(1):1-7.
2DENG Jia-mei,CAO Jia-lin,LI Ming,LI Ping,YU Ying.Decoding Technique of Concatenated HadamardCodes and Its Performance[J].Advances in Manufacturing,1999(3):218-222.
3Jian Zhihua Yang Zhen.A NOVEL ALGORITHM FOR VOICE CONVERSION USING CANONICAL CORRELATION ANALYSIS[J].Journal of Electronics(China),2008,25(3):358-363.
4Zhou Ying Zhang Linghua.AN IMPROVED ALGORITHM OF GMM VOICE CONVERSION SYSTEM BASED ON CHANGING THE TIME-SCALE[J].Journal of Electronics(China),2011,28(4):518-523.
5双志伟,Raimo Bakis,秦勇.IBM Voice Conversion Systems for 2007 TC-STAR Evaluation[J].Tsinghua Science and Technology,2008,13(4):510-514. 被引量：2
6张峰,马舒啸,石现峰.振动信号离散余弦变换域维纳滤波算法[J].探测与控制学报,2014,36(4):40-43. 被引量：3
7李腾,杨霄鹏,杨朝阳,欧阳超.基于E-Model的语音帧分组传输性能研究[J].测控技术,2014,33(5):35-39. 被引量：1
8王国明,侯整风.信息隐藏技术在密钥管理中的应用研究[J].计算机工程与设计,2008,29(18):4859-4861.
9牛鹏飞,张晓明.基于实时语音的信息隐藏技术的应用研究[J].北京石油化工学院学报,2010,18(2):50-54. 被引量：1
10简志华,王向文.采用压缩感知的改进的语音转换算法[J].声学学报,2014,39(3):400-406. 被引量：5

Chinese Journal of Acoustics

2014年第3期

浏览历史

内容加载中请稍等...