期刊文献+

利用投票选择机制进行语音分割的新方法 被引量:2

New method for speech segmentation using candidate selection
下载PDF
导出
摘要 针对在噪声背景下连续语音信号的语音分割性能会明显下降的问题,提出了一种针对连续语音信号分割的新方法。该方法不再采用单一的端点检测方法,而是将基于分形维数的端点检测方法,基于倒谱特征的端点检测方法,基于HMM的端点检测方法等多种不同方法下得到的端点检测结果,通过投票选择的方式,得到最终的端点检测结果,从而达到对连续语音信号进行分割的目的。实验结果表明,该方法较明显地提高了语音分割的准确性。 Aiming at the question that the performance of speech segmentation declines distinctly in noise environment,this paper proposes a new speech segmentation method for continuous speech signal.The method doesn't employ a single method for endpoint detection,but combines several different results derived from different endpoint detection methods based on fractal dimension,cepstral feature and HMM model,using a candidate selection approach to get the final boundary in order to segment the continuous speech signaLThe experimental results show that the proposed approach rather improves the speech segmentation accuracy.
出处 《计算机工程与应用》 CSCD 北大核心 2009年第24期21-24,共4页 Computer Engineering and Applications
基金 国家自然科学基No60702053 黑龙江省自然科学基NoF2004-08~~
关键词 语音分割 倒谱特征 分形维数 隐马尔科夫模型(HMM) 投票选择 背景噪声 speech segmentation cepstral feature fractal dimension Hidden Markov Mode(lHMM) candidate selection noise environment
  • 相关文献

参考文献6

二级参考文献4

  • 1[1]Thompson C, Mulpur A,Mehta V. Transition to chaos in acoustically driven flow (acoustic streaming)[J]. J.Acoust. Soc.Am., 1991,90:2 097-2 103. 被引量:1
  • 2[2]Maragos P. Fractal aspects of speech signals: dimension and interpolation[A]. Proc. IEEE ICASSP[C]. 1991:417-420. 被引量:1
  • 3[3]Lai X Y., Huang A S,Wu M J., Intelligent interface of voice and speech system using fuzzy controller and fractal dimension[A]. Proc. ICCPCOL[C], Florida:1992. 被引量:1
  • 4Lee C H,Automatic Speech and speaker recognition-advanced topics,1996年 被引量:1

共引文献76

同被引文献17

  • 1Bregman A S.Auditory scene analysis[M].Cambridge, MA: MIT Press, 1990. 被引量:1
  • 2Cooke M P, Brown G J.Computational auditory scene analysis: exploiting principles of perceived continuity[J].Speech Communication, 1993,13(3/4) :391-399. 被引量:1
  • 3Cooke M P.Modelling auditory processing and organisation[M]. Cambridge,UK:Cambridge Univ Press,1993. 被引量:1
  • 4Hu G, Wang D L.Monaural speech segregation based on pitch tracking and amplitude modulation[J].IEEE Trans on Neural Network,2004,15(5) : 1135-1150. 被引量:1
  • 5Wang D L, Brown G J.Separation of speech from interfering sounds based on oscillatory correlation[J].IEEE Trans on Neural Network, 1999,10 (3) : 684-697. 被引量:1
  • 6Meddis R.Simulation of auditory-neural transduction:further studies[J].J Acoust Soc Am, 1988,83(3) : 1056-1063. 被引量:1
  • 7Romeny B,Florack L, Koenderink J, et al.Scale-space theory in computer vision[M].New York:Springer, 1997. 被引量:1
  • 8Meddis R.Simulation of mechanical to neural transduction in the auditory receptor[J].Journal of Acoustical Society of America, 1986,79(3) :702-711. 被引量:1
  • 9Darwin C J.Perceiving vowels in the presence of another sound: constraints on formant perception[J].J Acoust Soc Amer, 1984,76 (6) : 1636-1647. 被引量:1
  • 10Li Q,Zheng J S, Tsai A, et al. Robust endpoint detection and energy normalization for real-time speech and speaker recog- nition [ J ]. IEEE Transactions on Acoustic, Speeeh and Audio Processing,2002,10 ( 3 ) : 146 - 157. 被引量:1

引证文献2

二级引证文献9

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部