期刊文献+

从存在伴奏的歌曲中提取歌声基音的时域算法 被引量:1

A Time-domain Algorithm of Extraction Singing Voice’s Pitch with Musical Instrument Accompaniment
下载PDF
导出
摘要 提取歌曲中的唱者基音拥有广泛的用途,如可用于基于内容的音频检索等。在歌曲中提取唱者基音存在许多与普通语音处理不同的问题,传统的时域算法在强音乐背景的干扰下很难正确提取唱者人声基音,近年来研究歌声特殊性的各系统则采用较为复杂的训练模型和频域算法。本系统为改进传统时域算法,提高歌声基音提取的准确度,同时为降低算法复杂度,利于用硬件实现基于内容的音频检索系统,提出一种以人声特征着眼、以开环-闭环基音提取为框架的时域算法。实验证明此算法在歌声基音提取上相对传统时域算法准确度有显著提高。 Extraction singing voice from music with music instrument accompaniment can be applied in many areas such as content-based music retrieval. Conventional time-domain algorithms perform poor in strong noise circumstance due to its remarkable diversity from ordinary speech processing. Newly-emerging systems dealing with singing voice and background music employ complex training models and realize it in frequency domain. With the consideration of hardware implementation and resource saving, our system proposes a new time-domain algorithm, which is based on human voice feature. The system adopted an open-loop and closeloop framework. Simulations on MATLAB show that accuracy of pitch extraction is superior to conventional time-domain algorithms and some other modified ones.
出处 《电子工程师》 2007年第11期33-36,61,共5页 Electronic Engineer
关键词 浊音能量判决法 开环-闭环基音提取 人声/乐音分离 voiced sound energy decision open-loop and close-loop pitch extraction speech/music discrimination
  • 相关文献

参考文献9

  • 1RABINER L R, CHENG M J, ROSENBERG A E, et al. A comparative performance study of several pitch detection algorithms[ J ]. IEEE Trans on Acoustics, Speech, and Signal Processing, 1976, 24(5) : 399-418. 被引量:1
  • 2ROSS M J, SHAFFER H L, COHEN A, et al. Average magnitude difference function pitch extractor[ J ]. IEEE Trans on Acoustics, Speech, and Signal Processing, 1974, 22 (5) : 353-362. 被引量:1
  • 3MARKEL J D. The SIFT algorithm for fundamental frequency estimation[ J]. IEEE Trans on Audio Electroacoust, 1972,20 (5) : 367-377. 被引量:1
  • 4赵力编著..语音信号处理[M].北京:机械工业出版社,2003:316.
  • 5张文耀,许刚,王裕国.循环AMDF及其语音基音周期估计算法[J].电子学报,2003,31(6):886-890. 被引量:40
  • 6MERON Y, HIROSE K. Separation of singing and piano sounds[ C ]//Proceedings of 5th International Conference on Spoken Language Processing ( ICSLP' 98 ) : Vol 3, Nov 30Dec 4, 1998, Sydney, Australia. 1998: 1059-1062. 被引量:1
  • 7HU G, WANG D L. Monaural speech segregation based on pitch tracking and amplitude modulation [ J ]. IEEE Trans on Neural Networks 2004, 15(5): 1135-1150. 被引量:1
  • 8LI Y, WANG D L. Detecting pitch of singing voice in polyphonic audio[ C ]//Proceedings of International Conference on Acoustics, Speech, and Signal Processing ( ICASSP' 05 ) : Vol 3, Mar 18-23, 2005, Philadelphia, PA, USA. Piscataway, NJ, USA: IEEE, 2005: 17-20. 被引量:1
  • 93GPP TS 26. 190 V 6.0.0. AMR wideband speech codec: transcoding function[ S]. 2004. 被引量:1

二级参考文献6

  • 1A.V奥本海姆 黄建国等(译).离散时间信号处理[M].北京:科学出版社,1998.. 被引量:3
  • 2杨行逡 迟惠生 等.语音信号数字处理[M].北京:电子工业出版社,1995.. 被引量:1
  • 3Wolfgang Hess. Pitch Determination of Speech Signals [ M ]. New York: Springer-Verlag, 1983. 被引量:1
  • 4Ross M J, et al. Average magnitude difference function pitch extractor[J]. IEEE Trans on Acoustics, Speech, and Signal Processing, 1974,22(5) :353 - 362. 被引量:1
  • 5Thomas W Parsons. Voice and Speech Processing [ M]. New York:Mc-Graw-Hill, 1986. 被引量:1
  • 6顾良,刘润生.高性能汉语语音基音周期估计[J].电子学报,1999,27(1):8-11. 被引量:19

共引文献39

同被引文献4

引证文献1

二级引证文献5

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部