Study of low bit rate speech codec algorithm in underwater acoustic communication 被引量：4

Study of low bit rate speech codec algorithm in underwater acoustic communication

导出

摘要 At medium or long distance （〉 10 kin） underwater acoustic speech communication, information transfer rate is constrained by the complicated, time varying channel and limited bandwidth. The bit rate of speech coding is required to be as low as possible. The time delay of underwater acoustic wave propagation can be used for low bit rate speech coding. After investigating the Mixed Excitation Linear Prediction （MELP） standard and taking account of the auditory perceptual features, a variable and adjustable bit rate speech codec algorithm has been proposed, whose average bit rate is about 600 bps. The average Perceptual Evaluation of Speech Quality Mean Opinion Score （PESQ MOS） of synthesized speeches is about 2.8. It has been proved by the computer simulation and sea trial that the performance of the proposed algorithm is well and robust when bit error rate is no more than 10-3. The synthesized speech is vivid and intelligible, and keeps main individual characteristics of speaker. At medium or long distance （〉 10 kin） underwater acoustic speech communication, information transfer rate is constrained by the complicated, time varying channel and limited bandwidth. The bit rate of speech coding is required to be as low as possible. The time delay of underwater acoustic wave propagation can be used for low bit rate speech coding. After investigating the Mixed Excitation Linear Prediction （MELP） standard and taking account of the auditory perceptual features, a variable and adjustable bit rate speech codec algorithm has been proposed, whose average bit rate is about 600 bps. The average Perceptual Evaluation of Speech Quality Mean Opinion Score （PESQ MOS） of synthesized speeches is about 2.8. It has been proved by the computer simulation and sea trial that the performance of the proposed algorithm is well and robust when bit error rate is no more than 10-3. The synthesized speech is vivid and intelligible, and keeps main individual characteristics of speaker.

作者 XIAO Dong MO Fuyuan CHEN Geng GUO Shengming MA Li

机构地区 Key Laboratory of Underwater Acoustic Environment University of Chinese Academy of Sciences Institute of Acoustics

出处《Chinese Journal of Acoustics》 2013年第4期411-424,共14页 声学学报（英文版）

基金 supported by the National Natural Science Foundation of China(61102152)

关键词 MELP Study of low bit rate speech codec algorithm in underwater acoustic communication RATE

分类号 TN912.3 [电子电信—通信与信息系统] TN929.3 [电子电信—信息与通信工程]

引文网络
相关文献

参考文献2

1吴江滨,丛键,苏旸.一种600bps声码器及其与典型中低速语音编码器的音质对比[J].中国电子科学研究院学报,2007,2(1):93-96. 被引量：5
2张晴晴,潘接林,颜永红.基于发音特征的汉语普通话语音声学建模[J].声学学报,2010,35(2):254-260. 被引量：14

二级参考文献21

1钱跃良,林守勋,刘群,刘宏.2005年度863计划中文信息处理与智能人机接口技术评测回顾[J].中文信息学报,2006,20(B03):1-6. 被引量：4
2吴宗济.普通话语句中的声调变化[J].中国语文,1982,6:439-449. 被引量：25
3Kirchhoff K. Robust speech recognition using articulatory information. PhD thesis, University of Bielefeld, Germany, 1999. 被引量：1
4Livescu K et al. Articulatory feature-based methods for acoustic and audio-visual speech recognition: JHU Summer Workshop Final Report. Technical report, Johns Hopkins University Center for Language and Speech Processing, 2007. 被引量：1
5Cetin Oet al. An articulatory feature-based tandem approach and factored tandem observation modeling, in ICASSP, 2007; 4: 645-648, ISBN: 1-4244-0727-3. 被引量：1
6Cetin O, Magimai-Doss M, Livescu K, Kantor A, King S, Bartels C, Frankel J. Monolingual and crosslingual comparison of tandem features derived from articulatory and phone MLPs. in Proc. ASRU, 2007:36-41. 被引量：1
7吴宗济.试论普通话语音的“区别特征”及其相互关系[J].中国语文,1982,(6). 被引量：1
8Peter Ladefoged. A course in phonetics. Third Edition, P11-P13, University of California, Los Angeles, 1993. 被引量：1
9Papcun J, Hochberg T R, Thomas F, Larouche J, Zacks J, Levy S. In ferring articulation and recognizing gestures from acoustics with a neural network trained on X-ray microbeam data. J. Acoust. Soc. Amer., 1992; 92(2): 688- 700. 被引量：1
10Schroeter J, Sondhi M M. Techniques for estimating vocaltract shapes from the speech signal. IEEE Trans. Speech Audio Process, 1994(2): 133-150. 被引量：1

共引文献16

1邵健,赵庆卫,颜永红.基于鼻韵尾分离的汉语声韵母识别模型[J].声学学报,2010,35(5):587-592. 被引量：3
2李倩,庄琳,达钊,郭霞生,章东.讲话人识别系统的鼻腔参数研究[J].声学技术,2012,31(3):291-295.
3肖东,莫福源,陈庚,马力.码率可调节的高质量语音编码算法[J].哈尔滨工程大学学报,2012,33(8):956-965.
4秦春香,黄浩.发音特征在维汉语音识别中的应用[J].计算机工程,2012,38(23):177-180.
5肖东,莫福源,陈庚,郭圣明,马力.水声通信中低码速率语音编码算法的研究[J].声学学报,2013,38(5):589-596. 被引量：4
6晁浩,杨占磊,刘文举.汉语语音识别中声学界标点引导的随机段模型解码算法[J].计算机科学,2013,40(10):208-212. 被引量：1
7ZHANG Long,LI Haifeng,MA Lin,WANG Jianhua.Automatic detection and evaluation of Erhua in the Putonghua proficiency test[J].Chinese Journal of Acoustics,2014,33(1):83-96.
8朱敏杰,王海,梁伟.一种低速率语音编解码系统设计[J].实验室研究与探索,2014,33(1):115-118. 被引量：2
9张珑,李海峰,马琳,王建华.汉语普通话水平测试中儿化音的自动检测与评价[J].声学学报,2014,39(5):639-646. 被引量：2
10晁浩,杨占磊,刘文举.汉语语音识别中融合发音信息的随机段模型研究[J].计算机应用研究,2014,31(11):3365-3368. 被引量：1

同被引文献32

1蔡惠智,刘云涛,蔡慧,邓红超,王永丰.第八讲水声通信及其研究进展[J].物理,2006,35(12):1038-1043. 被引量：26
2Kumar M and Tiwari S.Performance evaluation of conventional and wavelet based OFDM system[J].Journal of Electronics and Communication,2013,67(4):348-354. 被引量：1
3Abdallaa A,Ferreiraa R,and Shahparia A.Improved nonlinear tolerance in ultra-dense WDM OFDM system[J].Optics Communications,2014,325(1):88-93. 被引量：1
4Huang Xiao-peng and Lawrence V B.Capacity criterion- based bit and power loading for shallow water acoustic OFDM system with limited feedback[C].2011 IEEE 73rd Vehicular Technology Conference,Budapest,2011:372-377. 被引量：1
5Xu X K,Wang Z H,Zhou S L,et al..Parameterizing both path amplitude and delay variations of underwater acoustic channels for block decoding of orthogonal frequency division multiplexing[J].Acoustical Society of America,2012,131(6) :4672-4679. 被引量：1
6Hai H N,Soh W S,and Motani M.A Bidirectional-Concur- rent MAC protocol with packet bursting for underwater acoustic networks[J].IEEE Journal of Oceanic Engineering,2013,38(3):547-565. 被引量：1
7Liao W H and Huang C C.SF-MAC:a spatially fair MAC protocol for underwater acoustic sensor networks[J].IEEE Sensors Journal,2012,12(6):1686-1694. 被引量：1
8Lei Yang and Long Zhou.Adaptive bit loading algorithm for OFDM underwater acoustic communication system[C].2011 International Conference on Electronics and Optoelectronics (CEOE 2011),Dalian,China,2011:350-352. 被引量：1
9Huang Xiao-peng.Capacity criterion-based power loading for underwater acoustic OFDM system with limited feedback [C].Wireless Communications,Networking and Information Security,Beijing,2010:54-58. 被引量：1
10杨琳,张建平,颜永红.单通道语音增强算法对汉语语音可懂度影响的研究[J].声学学报,2010,35(2):248-253. 被引量：17

引证文献4

1罗亚松,许江湖,胡洪宁,贺静波,陈占伟.正交频分复用传输速率最大化自适应水声通信算法研究[J].电子与信息学报,2015,37(12):2872-2876. 被引量：6
2路成,田猛,周健,王华彬,陶亮.L_(1/2)稀疏约束卷积非负矩阵分解的单通道语音增强方法[J].声学学报,2017,42(3):377-384. 被引量：10
3罗亚松,胡生亮,刘志坤,吕显春.正交频分复用水声通信自适应调制算法[J].国防科技大学学报,2017,39(1):153-158. 被引量：32
4江伟华,郑思远,童峰,李斌.时变水声信道的动态压缩感知估计[J].声学学报,2019,44(3):360-368. 被引量：8

二级引证文献54

1庞立臣,胡涛,鹿力成,郭圣明,马力.负梯度水文环境下海山对声传播的影响[J].声学学报,2020,45(1):45-54. 被引量：4
2林涛,郭建松.自导飞行器控制指令传输均衡技术优化设计[J].智能计算机与应用,2020,10(7):166-169.
3郭建松,林涛.基于高速DSP的水下声探测系统设计[J].智能计算机与应用,2020(7):109-112. 被引量：1
4周仰平.用ASP技术实现在WEB网页上浏览目录及文件[J].电脑编程技巧与维护,2000(5):69-71.
5陈应琪.环境标志：人类生存的天使[J].乡镇企业之家,2000(4):8-9.
6陈杰男,费超,袁建生,曾维棋,卢浩,胡剑浩.超高速全并行快速傅里叶变换器[J].电子与信息学报,2016,38(9):2410-2414. 被引量：4
7普湛清,王巍,张扬帆,李宇,黄海宁.UUV平台OFDM水声通信时变多普勒跟踪与补偿算法[J].仪器仪表学报,2017,38(7):1634-1644. 被引量：14
8周健,刘荣敏,窦云峰,路成,陶亮.采用L1/2稀疏约束的梅尔倒谱系数语音重建方法[J].声学学报,2018,43(6):991-999. 被引量：5
9荆旭龙.混合OFDM调制在强化纯电动汽车通信信号中的应用[J].长春工程学院学报（自然科学版）,2019,0(3):16-19.
10周桂莉,李有明,付彩梅,余明宸,常生明,王旭芃.OFDM水声通信系统中基于中继选择的资源分配算法[J].应用声学,2017,36(2):182-188. 被引量：3

1IP电话的通话质量评价[J].通信工程,2004(2):49-49.
2SHU Xiujun,WANG Haibin,WANG Jun,YANG Xiaoxia.A method of multichannel chaotic phase modulation spread spectrum and its application in underwater acoustic communication[J].Chinese Journal of Acoustics,2017,36(1):130-144. 被引量：7
3HUI Junying,LIU li,FENG Haihong,LIU hong(Underwater Acoustics Institute, Harbin Engineering University Harbin 150001).A study on pattern time delay shift coding underwater acoustic communication[J].Chinese Journal of Acoustics,1999,18(2):105-120.
4HUANG Yong feng,ZHANG Jiang ling National Storage System Lab, School of Computer, Huazhong University of Science & Technology, Wuhan 430074,China.Implementation of ITU-T G.729 Speech Codec in IP Telephony Gateway[J].Wuhan University Journal of Natural Sciences,2000,5(2):159-163.
5Jingwei Yin,Pengyu Du,Guang Yang,Huanling Zhou.Space-division multiple access for CDMA multiuser underwater acoustic communications[J].Journal of Systems Engineering and Electronics,2015,26(6):1184-1190.
6马振,张雄伟,杨吉斌.基于语音个人特征信息分离的语音转换方法研究[J].信号处理,2013,29(4):513-519. 被引量：3
7YANG Zhen (Nanjing University of Posts & Telecommunications, Nanjing 210003, P.R.China).A New Speech Codec Based on ANN with Low Delay[J].The Journal of China Universities of Posts and Telecommunications,2002,9(4):1-7.
8杭州VOD用户个人特征与点播偏好关系分析[J].广告大观（媒介版）,2010(1):111-111.
9张戈.探秘Vivid audio[J].视听前线,2008(11):84-86.
10实力非凡后来居上——Hi-End音箱新贵Vivid Audio[J].高保真音响,2006(4):18-21.

Chinese Journal of Acoustics

2013年第4期

浏览历史

内容加载中请稍等...