期刊文献+

基于GCC-NMF的语音分离研究 被引量:1

Study of speech separation based on Non-Negative Matrix Factorization combined with Generalized Cross-Correlation algorithm
下载PDF
导出
摘要 为了使盲源分离算法能更好地应用于一些实际噪声和训练数据较少且不需要做标记的环境,文章提出了一种无监督的非负矩阵字典学习方法。该方法对混合信号进行字典学习,随后在每个时间点上根据其空间源对字典原子进行分组来实现分离。通过从SiSEC获取语音和现实噪声的两通道混合信号作为数据集,使用PEASS和BSS Eval工具包分别基于感知、基于SNR和PEMO-Q的度量来量化性能。此外,还评估模型了参数对分离质量的影响,并将该方法与其他无监督和半监督的语音分离方法进行比较。结果证明,GCC-NMF是一种灵活的源分离算法,在3种评估参数中的每个参数均胜过特定任务的方法,包括盲源以及需要先验知识或信息的多种已知方法。 An unsupervised non-negative matrix dictionary learning method is proposed to make the blind source separation algorithm better applicable under the circumstances with actual noise and less training data which don′t need to be marked.Dictionary learning is performed on the mixture signal and separated by grouping dictionary atoms according to their spatial origins.By acquiring a two-channel mixed signal of speech and real noise from SiSEC as a data set,the PEASS and BSS Eval toolkits are used to quantify performance using perceptual-based,SNR-based,and PEMO-Q metrics,respectively.Besides,the effect of separation quality via model parameters is also evaluated and compared with other unsupervised and semi-supervised separation methods.The results prove that GCC-NMF is a flexible algorithm for origins separation,as each parameter is superior to that from other specific target approaches,including blind separation speech and other existed approaches that require priori knowledge and information.
作者 吴君钦 王迎福 WU Junqin;WANG Yingfu(School of Information Engineering,Jiangxi University of Science and Technology,Ganzhou 341000,Jiangxi,China)
出处 《江西理工大学学报》 CAS 2020年第5期65-72,共8页 Journal of Jiangxi University of Science and Technology
基金 国家自然科学基金资助项目(61741109)。
关键词 盲源分离 非负矩阵分解 听觉场景分析 广义互相关 字典学习 blind source separation NMF CASA GCC dictionary learning
  • 相关文献

参考文献4

二级参考文献68

  • 1冯涛,韩纪庆.基于听觉特性的音频水印能量估计及自适应嵌入算法研究[J].声学学报,2006,31(1):48-54. 被引量:13
  • 2CHERRY E C. Some experiments on the recognition of speech, with one and with two ears [ J ]. Journal of the Acoustical Society of America, 1953,25 ( 5 ) : 975- 979. 被引量:1
  • 3CICHOCKI A,PARK H M,LEE S Y. Blind source separation and independent component analysis: a review [ J ]. Neural Information Processing-Letters and Reviews ,2005,6( 1 ) : 1-57. 被引量:1
  • 4BELL A J, SEJNOWSK! T J. An information maximization approach to blind separation and blind deconvolution[ J]. Neural Computation, 1995,7(6) :1129-1159. 被引量:1
  • 5DOUGLAS S C. Blind separation of acoustic signals[ M ]//BRANDSTEIN M, WARD D. Microphone Arrays: Signal Processing Techniques and Applications. New York:Springer,2001. 被引量:1
  • 6SMARAGDIS P. Blind separation of convolved mixtures in the frequency domain [ J ]. Neurocomputing, 1998,22 ( 1 ) :21 - 34. 被引量:1
  • 7IKEDA S, MURATA N. A method of ICA in time-frequency domain [ C ]//Proc of International Conferenee on Independent Component Analysis and Signal Separation. 1999:365-371. 被引量:1
  • 8PEDERSEN M S, LARSEN J, KJEMS U, et al. A survey of convolutive blind source separation methods [ M ]//BENESTY J, SONDHI M M, HUANG Y. Springer Handbook of Speech Processing. Berlin: Springer, 2007. 被引量:1
  • 9COOKE M, LEE T W. The speech separation challenge[ EB/OL]. (2006) [ 2010-04-24 ]. http://www. dcs. shef. ac. uk/- martin/ SpcechSeparationChallenge. htm. 被引量:1
  • 10NAKATANI T, OKUNO H G. Harmonic sound stream segregation using localization and its application to speech stream segregation[ J ]. Speech Communication, 1999,27 (3) :209-222. 被引量:1

共引文献7

同被引文献1

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部