期刊文献+

L_(1/2)稀疏约束卷积非负矩阵分解的单通道语音增强方法 被引量:10

A single-channel speech enhancement approach using convolutive non-negative matrix factorization with L_(1/2) sparse constraint
下载PDF
导出
摘要 为了刻画语音信号帧间相关性和使用更少的语音基表示语音特征,提出一种采用L_(1/2)稀疏约束的卷积非负矩阵分解方法进行单通道语音增强。首先,进行噪声学习得到噪声基;然后,以噪声基为先验信息结合L_(1/2)稀疏约束卷积非负矩阵分解方法学习含噪语音中的语音基成分;最后,利用学习到的语音基和系数重建出干净语音信号。在不同噪声环境下进行的实验结果表明,本文方法优于采用L_1稀疏约束的卷积非负矩阵方法及传统的统计语音增强方法。 A single-channel speech enhancement approach is presented, where a novel convolution non-negative matrix factorization algorithm with L1/2 sparse constraint is proposed, aiming at characterizing the inter-correlation of the speech signal and using less basis to present the speech signal. The noise basis is obtained firstly by training the noise, the speech basis is learnt from noisy speech by using the proposed approach combined with pre-trained noise basis. Then, the enhanced speech is reconstructed by the speech basis and its corresponding coefficients. Experimental results in different noise environments show that the proposed approach outperforms the convolution non-negative matrix factorization algorithm with L1 sparse constraint and conventional statistical speech enhancement algorithms.
作者 路成 田猛 周健 王华彬 陶亮 LU Cheng TIAN Meng ZHOU Jian WANG Huabin TAO Liang(Key Laboratory of Intelligent Computing and Signal Processing of Ministry of Education, Anhui University Hefei 230031 Institute of Media Computing, Anhui University Hefei 230601)
出处 《声学学报》 EI CSCD 北大核心 2017年第3期377-384,共8页 Acta Acustica
基金 国家自然科学基金项目(61301295 61372137) 安徽大学博士科研启动经费项目 安徽省自然科学基金项目(1708085MF151)资助
关键词 语音增强 非负矩阵分解 含噪语音 可懂度 稀疏性 正则化 语音质量 幅度谱 无监督 帧间 Convolution Matrix algebra Speech Speech communication Speech enhancement
  • 相关文献

参考文献6

二级参考文献88

  • 1双志伟,张世磊,秦勇.语音转换分析及相似度改进[J].清华大学学报(自然科学版),2009(S1):1408-1412. 被引量:3
  • 2王晶,傅丰林,张运伟.语音增强算法综述[J].声学与电子工程,2005(1):22-26. 被引量:20
  • 3张家禄 齐士钤 宋美珍 等.汉语声调在言语可懂度中的重要作用.声学学报,1981,7:237-237. 被引量:10
  • 4Song Myung-Suk, Lee Chang-Heon, Kang Hong-Goo. Performance analysis of various single channel speech enhancement algorithms for automatic speech recognition. Inter- speech2006, 1451-1454, Pittsburgh, Pennsylvania. 被引量:1
  • 5Hu Guoning, Wang DeLiang. Monaural speech segregation based on pitch tracking and amplitude modulation. IEEE Trans. Neural Networks, 2004; 15(5): 1135-1150. 被引量:1
  • 6Hu Yi, Loizou P C. A comparative intelligibility study of single-microphone noise reduction algorithms. J. Acoust. Soc. Am., 2007; 122(3): 1777-1786. 被引量:1
  • 7Hu Yi, Loizou P C. Subjective evaluation and comparison of speech enhancement algorithms. Speech Communication, 2007; 49:588-601. 被引量:1
  • 8Kang Jian. Comparison of speech intelligibility between English and Chinese. J. Acoust. Soc. Am., 1998; 103(2): 1213-1216. 被引量:1
  • 9Loizou P C. Speech enhancement: Theory and practice. CRC Press, 2007. 被引量:1
  • 10Kong Y Y, Zeng F G. Temporal and spectral cues in Mandarin tone recogntion. J. Acoust. Soc. Am., 2006; 120(5): 2830-2840. 被引量:1

共引文献44

同被引文献52

引证文献10

二级引证文献28

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部