期刊文献+

面向窄带通信的极低速率语音编码算法研究 被引量:1

Research on Speech Coding Algorithm at Very Low Bit Rate for Narrow-Band Communication
下载PDF
导出
摘要 提出了一种面向窄带通信的极低速率参数语音编码算法。在2.4kbps MELP标准的基础上结合听觉感知,对线谱对参数进行联合矢量量化、对基音周期进行内插和非线性量化、对能量参数进行高效压缩,可以使语音数据在0.5kbps下匀速传输;线谱对参数的预测残差用于矢量量化,这是一种提高合成语音的音质的有效方法。实验结果表明,采用本文提出的语音编码算法可以使语音数据在极低码率下有效的传输,解码端合成的语音具有较高的可懂度。 A speech coding algorithm applied to narrowband transmission at very low bit rate is proposed. Based on the MELP standard working at 2.4kbps, joint vector quantization is applied to linear spectrum pair' s parameters, interpolation and nonlinear quantization are applied to pitch, gain parameters are compressed efficiently and auditory perception is con- sidered, speech data could be coded and transmitted smoothly at 0.5kbps The predicative residuals of linear spectrum pair's parameters are also used for vector quantization; it' s a suitable way of improving synthetic speech quality. The experimen- tal results reveal that the proposed algorithm can ensure speech effective transmission at very low bit rate while the intelligi- bility of received speech is acceptable.
出处 《信号处理》 CSCD 北大核心 2013年第9期1134-1141,共8页 Journal of Signal Processing
基金 国家自然科学基金(61273288,61233009,61203258,61011140075,90820303) 中国新加坡数字媒体研究院资助项目(CSIDM)
关键词 联合矢量量化 非线性量化 预测残差 听觉感知 joint vector quantization nonlinear quantization predicative residuals auditory perception
  • 相关文献

参考文献14

  • 1Tokuda K,Masuko T,Hiroi J,et al.A very low bit rate speech coder using HMM-based speech recognitiorn/synthesis techniques[C]//Acoustics,Speech and Signal Processing,1998.Proceedings of the 1998 IEEE International Conference on.IEEE,1998,2:609-612. 被引量:1
  • 2Crosmer J,Bamwell Ⅲ T.A low bit rate segment vocoder based on line spectrum pairs[C]//Acoustics,Speech,and Signal Processing,IEEE International Conference on ICASSP'85.IEEE,1985,10:240-243. 被引量:1
  • 3Wang T,Koishida K,Cuperman V,et al.A 1200 bps speech coder based on MELP[C]//Acoustics,Speech,and Signal Processing,2000.ICASSP'00.Proceedings.2000 IEEE International Conference on.IEEE,2000,3:1375-1378. 被引量:1
  • 4Guilmin G,Capman F,Ravera B,et al.New NATO STANAG narrow band voice coder at 600 bits/s[C]//Acoustics,Speech and Signal Processing,2006.ICASSP 2006Proceedings.2006 IEEE International Conference on.IEEE,2006. 被引量:1
  • 5Zou X,Wen C,Zhang X,et al.An improved 600bps speech coding based on joint quantization of pitch and gain shape[C]//Communication Technology (ICCT),2010 12th IEEE International Conference on.IEEE,2010:1303-1306. 被引量:1
  • 6Wu C,Jiang H,Li B.An improved MELP speech coder[C]//Information Technology and Computer Science,2009.ITCS 2009.International Conference on.IEEE,2009,2:130-133. 被引量:1
  • 7Unver E,Villette S,Kondoz A.Joint quantisation strategies for low bit-rate sinusoidal coding[J].Signal Processing,IET,2010,4(5):548-559. 被引量:1
  • 8季云云,杨震.基于主分量分析的语音信号压缩感知[J].信号处理,2011,27(7):1057-1062. 被引量:3
  • 9肖强,陈亮,朱涛,黄建军.基于压缩感知的线谱对参数降维量化算法[J].信号处理,2011,27(4):563-568. 被引量:1
  • 10Boucheron L E,Leon P L D,Sandoval S.Hybrid scalar/ vector quantization of mel-frequency cepstral coefficients for low bit-rate coding of speech[C]//Data Compression Conference (DCC),2011.IEEE,2011:103-112. 被引量:1

二级参考文献33

  • 1刘刚,刘晶,王泉.一种基于覆盖域密度的LBG算法[J].计算机应用,2008,28(S2):319-321. 被引量:2
  • 2张春梅,尹忠科,肖明霞.基于冗余字典的信号超完备表示与稀疏分解[J].科学通报,2006,51(6):628-633. 被引量:70
  • 3Gwenael G, Francois C, Bertrand R, et al. New NATO STANAG narrow band voice coder at 600bits/s [ C ]. In Proc. IEEE Int. Conf. Acousic Speech Signal Processing,Toulouse,France,2006:689-692. 被引量:1
  • 4Paliwal K K, Atal B S. Efficient vector quantization of LPC parameters at 24bits/frame [ J ]. IEEE Trans on speeeh and audio processing, 1993,1 (1) :3-14. 被引量:1
  • 5Leblanc W P, Bhattacharya B, Mahmoud S A, et al. Efficient search and design procedures for robust multi-stage VQ of LPC parameters for 4 Kb/s speech coding [ J ].IEEE Trans on speech and audio processing, 1993,1 (4) : 373 -385. 被引量:1
  • 6ZOU Xia, ZHANG Xiong-wei. Efficient coding of LSF parameters using multi-mode predictive multistage matrix quantization[ C ]. IEEE lnteruational Conference on Signal Processing, Beijing, China,2008:542-545. 被引量:1
  • 7Subasingha S, Murthi M N, Andersen S V. On gram kalman predictive coding of lsfs for packet loss [ C ]. In Proc. IEEE Int. Conf. Acousic Speech Signal Processing, Taibei, China, 2009 : 4105 - 4108. 被引量:1
  • 8ozaydin S, Baykal B. Matrix quantization and mixed excitation based linear predictive speech coding at very low bit rates[ J ]. Speech Communication ,2003,41:381-392. 被引量:1
  • 9Xydeas C S, Papanastasiou C. Split matrix quantization of LPC parameters [ J ]. IEEE Trans on speech and audio processing, 1999,7 ( 2 ) : 113-125. 被引量:1
  • 10ZOU Xia, ZHANG Xiong-wei, ZHANG Ya-fei. A 300bps speech coding algorithm based on multi-mode matrix quantization[ CI. IEEE International Conference on Wireless Communictions and Signal Processing, Nanjing, China ,2009 : 1-4. 被引量:1

共引文献5

同被引文献12

引证文献1

二级引证文献8

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部