期刊文献+

基于声音定位和听觉掩蔽效应的语音分离研究 被引量:16

Speech Separation Based on Sound Localization and Auditory Masking Effect
下载PDF
导出
摘要 人耳具有在嘈杂环境中将感兴趣的语言信息提取出来的能力 ,而双耳听觉特性有助于这种能力的加强 .据此本文提出了一种基于声音定位和听觉掩蔽效应的混叠语音分离方法 .根据声音到达双耳的时间差和强度差在时频域内确定相应的掩蔽系数 ,该系数是二值的 ,以直接去除干扰信号 ,保留有用信号并达到语音分离的目的 .实验表明 ,本文提出的方法是有效的 .该方法不仅适用于混叠语音为浊音情形 ,对清音的情况同样适用 ,因而比基于基音提取的语音分离方法的适用范围更广 . Human has the ability to attend to a single interested speech in a noised condition and this ability can be improved in the presence of binaural cues. In this paper a speech separation method is presented based on sound localization and auditory masking effect. By two important parameters-the interaural time differences (ITD) and interaural intensity differences (IID)-we estimate the binary masking coefficients in corresponding time-frequency regions. The coefficients are helpful of speech separation by holding interested signal and reducing noise signal. Experiments indicate that the approach described here is efficient not only for voiced speech but also for unvoiced speech and it has more extensive applications than pitch-based speech separation algorithms.
出处 《电子学报》 EI CAS CSCD 北大核心 2005年第1期158-160,共3页 Acta Electronica Sinica
基金 国家自然科学基金 (No 60 1 72 0 1 6)
关键词 双耳时间差 双耳强度差 声音定位 语音分离 掩蔽效应 Algorithms Audition Estimation Signal processing Speech intelligibility
  • 相关文献

参考文献12

  • 1梁之安著..听觉感受和辨别的神经机制[M].上海:上海科技教育出版社,1999:262.
  • 2D L Wang, G J Brown.Separation of speech from inlerfering sounds based on oscillatory correlation[J].IEEE Trans,1999,NN-10(3):684- 697. 被引量:1
  • 3G J Brown, M Cooke. Computational auditory scene analysis [J].Computer Speech and Language, 1994,8(24) :297 - 336. 被引量:1
  • 4D F Rosenthal, H G Okuno. Computational Auditory Scene Analysis[M]. Mahwah: Lawrence Erlbaum, 1998. 被引量:1
  • 5W Roman, D L Wang. Speech segregation based on sound localization[A]. Proc IJCNN[C]. Washington DC : IEEE,2001. 2861 - 2866. 被引量:1
  • 6A J W Kouwe, D L Wang. A Comparison of Auditory and Blind Separation Techniques for Speech Segregation[J]. IEEE Trans,2001,SAP-9(3) : 189 - 194. 被引量:1
  • 7W Gardner, K Martin. HRTF measurements of a KEMAR[J] .J Acoust Soc Am, 1995,97(6) :3901 - 3908. 被引量:1
  • 8R Patterson, et al. An efficient auditory filterbank based on the gammatone functions[R]. APU Report No. 2341, Cambrige, Applied Psychology Unit. 1988. 被引量:1
  • 9F Wightman, D Kistler. The dominant role of low-frequency interaural time differences in sound localization[J] .J Acoust Soc Am. 1992,91(3) : 1648 - 1660. 被引量:1
  • 10J Blauert. Spatial Hearing-The Psychophysics of Human Sound Localization[ M]. Cambridge: MIT Press, 1997. 被引量:1

同被引文献185

引证文献16

二级引证文献54

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部