期刊文献+

Study of low bit rate speech codec algorithm in underwater acoustic communication 被引量:4

Study of low bit rate speech codec algorithm in underwater acoustic communication
原文传递
导出
摘要 At medium or long distance (〉 10 kin) underwater acoustic speech communication, information transfer rate is constrained by the complicated, time varying channel and limited bandwidth. The bit rate of speech coding is required to be as low as possible. The time delay of underwater acoustic wave propagation can be used for low bit rate speech coding. After investigating the Mixed Excitation Linear Prediction (MELP) standard and taking account of the auditory perceptual features, a variable and adjustable bit rate speech codec algorithm has been proposed, whose average bit rate is about 600 bps. The average Perceptual Evaluation of Speech Quality Mean Opinion Score (PESQ MOS) of synthesized speeches is about 2.8. It has been proved by the computer simulation and sea trial that the performance of the proposed algorithm is well and robust when bit error rate is no more than 10-3. The synthesized speech is vivid and intelligible, and keeps main individual characteristics of speaker. At medium or long distance (〉 10 kin) underwater acoustic speech communication, information transfer rate is constrained by the complicated, time varying channel and limited bandwidth. The bit rate of speech coding is required to be as low as possible. The time delay of underwater acoustic wave propagation can be used for low bit rate speech coding. After investigating the Mixed Excitation Linear Prediction (MELP) standard and taking account of the auditory perceptual features, a variable and adjustable bit rate speech codec algorithm has been proposed, whose average bit rate is about 600 bps. The average Perceptual Evaluation of Speech Quality Mean Opinion Score (PESQ MOS) of synthesized speeches is about 2.8. It has been proved by the computer simulation and sea trial that the performance of the proposed algorithm is well and robust when bit error rate is no more than 10-3. The synthesized speech is vivid and intelligible, and keeps main individual characteristics of speaker.
出处 《Chinese Journal of Acoustics》 2013年第4期411-424,共14页 声学学报(英文版)
基金 supported by the National Natural Science Foundation of China(61102152)
  • 相关文献

参考文献2

二级参考文献21

  • 1钱跃良,林守勋,刘群,刘宏.2005年度863计划中文信息处理与智能人机接口技术评测回顾[J].中文信息学报,2006,20(B03):1-6. 被引量:4
  • 2吴宗济.普通话语句中的声调变化[J].中国语文,1982,6:439-449. 被引量:25
  • 3Kirchhoff K. Robust speech recognition using articulatory information. PhD thesis, University of Bielefeld, Germany, 1999. 被引量:1
  • 4Livescu K et al. Articulatory feature-based methods for acoustic and audio-visual speech recognition: JHU Summer Workshop Final Report. Technical report, Johns Hopkins University Center for Language and Speech Processing, 2007. 被引量:1
  • 5Cetin Oet al. An articulatory feature-based tandem approach and factored tandem observation modeling, in ICASSP, 2007; 4: 645-648, ISBN: 1-4244-0727-3. 被引量:1
  • 6Cetin O, Magimai-Doss M, Livescu K, Kantor A, King S, Bartels C, Frankel J. Monolingual and crosslingual comparison of tandem features derived from articulatory and phone MLPs. in Proc. ASRU, 2007:36-41. 被引量:1
  • 7吴宗济.试论普通话语音的“区别特征”及其相互关系[J].中国语文,1982,(6). 被引量:1
  • 8Peter Ladefoged. A course in phonetics. Third Edition, P11-P13, University of California, Los Angeles, 1993. 被引量:1
  • 9Papcun J, Hochberg T R, Thomas F, Larouche J, Zacks J, Levy S. In ferring articulation and recognizing gestures from acoustics with a neural network trained on X-ray microbeam data. J. Acoust. Soc. Amer., 1992; 92(2): 688- 700. 被引量:1
  • 10Schroeter J, Sondhi M M. Techniques for estimating vocaltract shapes from the speech signal. IEEE Trans. Speech Audio Process, 1994(2): 133-150. 被引量:1

共引文献16

同被引文献32

  • 1蔡惠智,刘云涛,蔡慧,邓红超,王永丰.第八讲 水声通信及其研究进展[J].物理,2006,35(12):1038-1043. 被引量:26
  • 2Kumar M and Tiwari S.Performance evaluation of conventional and wavelet based OFDM system[J].Journal of Electronics and Communication,2013,67(4):348-354. 被引量:1
  • 3Abdallaa A,Ferreiraa R,and Shahparia A.Improved nonlinear tolerance in ultra-dense WDM OFDM system[J].Optics Communications,2014,325(1):88-93. 被引量:1
  • 4Huang Xiao-peng and Lawrence V B.Capacity criterion- based bit and power loading for shallow water acoustic OFDM system with limited feedback[C].2011 IEEE 73rd Vehicular Technology Conference,Budapest,2011:372-377. 被引量:1
  • 5Xu X K,Wang Z H,Zhou S L,et al..Parameterizing both path amplitude and delay variations of underwater acoustic channels for block decoding of orthogonal frequency division multiplexing[J].Acoustical Society of America,2012,131(6) :4672-4679. 被引量:1
  • 6Hai H N,Soh W S,and Motani M.A Bidirectional-Concur- rent MAC protocol with packet bursting for underwater acoustic networks[J].IEEE Journal of Oceanic Engineering,2013,38(3):547-565. 被引量:1
  • 7Liao W H and Huang C C.SF-MAC:a spatially fair MAC protocol for underwater acoustic sensor networks[J].IEEE Sensors Journal,2012,12(6):1686-1694. 被引量:1
  • 8Lei Yang and Long Zhou.Adaptive bit loading algorithm for OFDM underwater acoustic communication system[C].2011 International Conference on Electronics and Optoelectronics (CEOE 2011),Dalian,China,2011:350-352. 被引量:1
  • 9Huang Xiao-peng.Capacity criterion-based power loading for underwater acoustic OFDM system with limited feedback [C].Wireless Communications,Networking and Information Security,Beijing,2010:54-58. 被引量:1
  • 10杨琳,张建平,颜永红.单通道语音增强算法对汉语语音可懂度影响的研究[J].声学学报,2010,35(2):248-253. 被引量:17

引证文献4

二级引证文献54

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部