一种基于非参数贝叶斯理论的语音增强算法

Speech Enhancement Based on Nonparametric Bayesian Method

下载PDF

导出

摘要提出一种基于非参数贝叶斯理论的语音增强算法,在稀疏表示的框架下,把字典学习、稀疏系数表示和噪声方差估计融合成一个贝叶斯后验估计的过程,并利用Spike-Slab先验加强稀疏性.首先,将带噪语音分解为干净语音、高斯噪声和残余噪声3个子信号,分别对该3种子信号采用不同的先验概率模型表达,接着采用马尔科夫链-蒙特卡洛算法计算出3个模型中每个参数对应的后验概率,最后基于稀疏表示的框架重构出干净语音.实验数据使用NOIZEUS语音库,采用PESQ和SegSNR作为质量评价指标,分别在信噪比为0,5和10dB的高斯白噪声、火车噪声和街道噪声上验证了其可行性,并与多种常用语音增强方法进行对比,发现其在低信噪比非平稳噪声情况下的增强效果更为理想. A new speech enhancement strategy is proposed by utilizing a nonparametrie Bayesian method with Spike-blab priori（NBSP）.As a sparse representation framework,the dictionary learning,sparse coefficients representation and noise variance estima-tion are replaced by a single procedure of Bayesian posterior estimation.First,the noisy speech is divided into clean speech,Gaussiannoise and rest noise.Then,each part is modeled with a certain priori distribution.Finally, upon the adoption of Markov Chain MonteCarlo sampling algorithm, the posterior distribution can be obtained, as the clean speech and all other parameters.Without knowingthe noise varianee,NBSPeould be performed directly on the noisy speech to infer the sparsity of the speech.Experiments were execu-ted on NOIZEUS database.Experiments are executed on noisy speeches from NOIZEUS database with SNR ranging from 0 dB to 10dB,which contain three types of noise （white,train and street）. And the subjective and objective measures like PESQ score and theoutput SegSNR are implemented to evaluate the performance of NBSP and the other state-of-the-art methods.Corresponding resultsshow that NBSP achieves better performances,especially in conditions of non-stationary noise with low input SNR.

作者吴佳雯刘沁婷曾德炉丁兴号李琳

机构地区厦门大学信息科学与技术学院华南理工大学数学学院

出处《厦门大学学报（自然科学版）》 CAS CSCD 北大核心 2017年第3期423-428,共6页 Journal of Xiamen University：Natural Science

关键词稀疏表示非参数贝叶斯 Spike-Slab先验自适应字典语音增强 sparse representation nonparametrie Bayesian estimation Spike-Slab priori dictionary learning speech enhancement

分类号 TN912 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献2

1ZHAO Nan XU Xin YANG Yi.Sparse Representations for Speech Enhancement[J].Chinese Journal of Electronics,2011,20(2):268-272. 被引量：9
2Ding Xinghao,Mi Zengyuan,Huang Yue,Jin Wenbo.ROBUST RVM BASED ON SPIKE-SLAB PRIOR[J].Journal of Electronics(China),2012,29(6):593-597. 被引量：2

二级参考文献11

1V. N. Vapnik. The nature of statistical learningtheory. 1995. 被引量：1
2M. E. Tipping. Sparse Bayesian learning and therelevance vector machine. Journal of MachineLearning Research, 1(2001), 211-244. 被引量：1
3H. Takeda, S. Farsiu, and P. Milanfar. Robust kernelregression for restoration and reconstruction ofimages from sparse noisy data. IEEE InternationalConference on Image Processing, Atlanta, GA, Oct.2006, 1257-1260. 被引量：1
4B. Demir and S. ErtUrk. Hyperspectral Imageclassification using relevance vector machines. IEEEGeoscience and Remote Sensing Letters, 4(2007)4,586—590. 被引量：1
5A. Veeraxaghavan, K. Mitra, and R. Chellappa.Robust RVM regression using sparse outlier model.IEEE Computer Vision and Pattern RecognitionConference, San Francisco, U.S., June 2010,1887-1894. 被引量：1
6Pang Xun and Jeff Gill. Spike and slab priordistributions for simultaneous bayesian hypothesistoting, model selection, and prediction, of nonlinearoutcome. Working Paper, Washington University inSt. Louis, http://poly_meth. wustl.edu/mediaDetail.php?docid=914, July 13,2009. 被引量：1
7T. J. Mitchell and J. J. Beauchamp. Bayian variableselection in linear regression. Journal of the AmericanStatistical Association, 83(1988), 1023-1036. 被引量：1
8I. G. Eduward and R. E. McCulloch. Variableselection via gibbs sampling. Journal of the AmericanStatistical Association, 88(1993), 881-889. 被引量：1
9B. Chen, J. Paisley, and L. Carin. Sparse linearregression with beta process priors. IEEE AcousticsSpeech and Signal Processing Conference, Dallas,U.S., March 2010, 1234-1237. 被引量：1
10M. Aharon, M. Elad, and A. M. Bruckstein. K-SVD:An algorithm for designing over complete dictionariesfor sparse representation. IEEE Transactions onSignal Processing、54(2006)11, 4311-4322. 被引量：1

共引文献9

1马振,张雄伟,杨吉斌.一种基于K-SVD的说话人识别方法[J].计算机工程与应用,2012,48(34):112-115. 被引量：2
2马振,张雄伟,杨吉斌.基于语音个人特征信息分离的语音转换方法研究[J].信号处理,2013,29(4):513-519. 被引量：3
3黄玲,李琳,王薇,易才钦,郭东辉.基于Sparse K-SVD学习字典的语音增强方法[J].厦门大学学报（自然科学版）,2014,53(1):36-40. 被引量：9
4胡正平,高红霄,赵淑欢.基于低秩分解的联合动态稀疏表示多观测样本分类算法[J].电子学报,2015,43(3):440-446. 被引量：3
5文元美,钟鸿科,许根鹏.基于一步字典学习的语音增强方法[J].计算机应用,2016,36(A02):214-217. 被引量：1
6周伟力,贺前华,王亚楼,庞文丰.基于自适应逼近残差的稀疏表示语音降噪方法[J].电子与信息学报,2017,39(2):309-315. 被引量：4
7彭威,张祺威.基于字典学习的碰摩声发射信号降噪算法[J].电子器件,2019,42(1):116-119. 被引量：2
8李桂会,李晋江,范辉.自适应匹配追踪图像去噪算法[J].计算机科学,2020,47(1):176-185. 被引量：4
9姜峰,霍彦明,李争.稀疏表示及区分性联合字典学习语音降噪算法[J].小型微型计算机系统,2020,41(5):985-989.

1Xie Zongbo Feng Jiuchao.AN ALGORITHM FOR DICTIONARY GENERATION IN SPARSE REPRESENTATION[J].Journal of Electronics(China),2009,26(6):836-841.
2朱思白,张燕虹.对52个子公司统一了线路科和PCB设计[J].最新电子技术应用,1997,2(4):39-41.
3沈亚强,程仲文.一种基于自适应滤波的语音增强方法[J].信号处理,1993,9(1):9-14. 被引量：5
4杨宁侠,赵红录.基于蒙特卡洛算法的WCDMA网络规划方法研究[J].科学大众（智慧教育）,2015(1).
5柯熙政,解孟其,高海涛,李铁.自由空间光通信中的空时网格码[J].红外与激光工程,2012,41(4):1022-1027. 被引量：7
6刘源,王洪先,纠博,严俊坤,赵永波,刘宏伟.米波MIMO雷达低空目标波达方向估计新方法[J].电子与信息学报,2016,38(3):622-628. 被引量：8
7张雪英,贾海蓉,靳晨升.子空间与维纳滤波相结合的语音增强方法[J].计算机工程与应用,2011,47(14):146-148. 被引量：6
8侯丽霞,曾以成,熊民权.EMD与自适应滤波相结合的语音增强法[J].计算机工程与应用,2012,48(9):104-107. 被引量：3
9李从鹤,郑辉.自适应字典压缩算法中的误码传播分析[J].电讯技术,2005,45(5):50-53. 被引量：1
10黄玲,李琳,王薇,易才钦,郭东辉.基于Sparse K-SVD学习字典的语音增强方法[J].厦门大学学报（自然科学版）,2014,53(1):36-40. 被引量：9

厦门大学学报（自然科学版）

2017年第3期

浏览历史

内容加载中请稍等...

一种基于非参数贝叶斯理论的语音增强算法

参考文献2

二级参考文献11

共引文献9

相关作者

相关机构

相关主题

浏览历史