期刊文献+

基于分层熵检测的音频分割算法 被引量:1

Audio Segmentation Based on Layer Entropy Detection
下载PDF
导出
摘要 音频分割是提取音频结构和内容语义的重要手段,是基于内容的音频分析、检索的基础。提出分层熵检测音频分割算法,采用定长分析窗分层结构遍历音频流,窗内根据熵变化趋势检测跳变点。实验结果表明,该算法避免了ΔBIC分割算法中的硬门限判决和数据累积问题,是一种更加有效的音频分割方法。 Audio segmentation is an important access to extract audio structure and content, aria is a basis for further audio retrieval and analysis. An audio segmentation algorithm is proposed based on layer entropy detection. The algorithm searches the audio stream using a size-fixed analysis window in which a top-down detection structure is employed and locates the change points according to the entropy trend. The experimental results demonstrate that algorithm avoids detection errors due to data accumulation and establishing hard threshold, it is a more effective segmentation algorithm.
出处 《科学技术与工程》 2009年第17期5012-5016,5023,共6页 Science Technology and Engineering
基金 国家863计划资助项目(2006AA01Z146)资助
关键词 音频分割 分层检测 熵变化趋势 audio segmentation layer detection the entropy trend
  • 相关文献

参考文献12

  • 1卢坚,毛兵,孙正兴,张福炎.一种改进的基于说话者的语音分割算法[J].软件学报,2002,13(2):274-279. 被引量:17
  • 2贾磊,穆向禺,徐波.广播语音的音频分割[J].中文信息学报,2002,16(1):37-42. 被引量:11
  • 3张一彬,周杰,边肇祺,张大鹏.一种新的基于分类的音频流分割方法[J].电子学报,2006,34(4):612-617. 被引量:10
  • 4Meinedo H, Neto J. Audio segmentation, classification and clustering in a broadcast news task. IEEE International Conference on Acoustics, Speech and Signal Processing. Hong Kong, China: Kluwer Academic Press,2003 ;5-8. 被引量:1
  • 5Wu Chunghsien, Hsieh Chiahsin. Multiple change-point audio segmentation and classification using an MDL-based ganssian model. IEEE Transactions on Audio, Speech and Language Processing,2006 ; 14 : 647-657. 被引量:1
  • 6Woodland P, Gales M, Pye D,et al. The development of the 1996 HTK broadcast news transcription system. In: Proc Speech Recognition Workshop, 1997 : 73-78. 被引量:1
  • 7Bakis R, Chen S, Gopalakrishnan P, et al. Transcription of broadcast news shows with the IBM large vocabulary speech recognition system. In: Proceedings of the DARPA Speech Recognition Workshop Chantilly, 1997:67-72. 被引量:1
  • 8Siegler M A, Jain U, Raj B,et al. Automatic segmentation, classification and clustering of broadcast news audio. In: Proceedings of the DARPA Speech Recognition Workshop Chantilly, 1997:97-99. 被引量:1
  • 9Cover T M, Tomas J A. Elements of information theory. New York: John Wiley&Sons, 1991 : 1197-1208. 被引量:1
  • 10Gish H, Schmidt N. Text-Independent speaker identification. IEEE Signal Processing Magazine. 1994:18-32. 被引量:1

二级参考文献29

  • 1[1]R. Bakis et al., Transcription of broadcast news shows with the IBM large vocabulary speech recognition system, proceedings of the Speech Recognition Workshop, 1997,67-72,1997 被引量:1
  • 2[2]F. Kubala et al. The 1996 BBN Byblos Hub-4 transcription system, Proceedings of the Speech Recognition Workshop, 1997,90-93 被引量:1
  • 3[3]M. Siegler, U. Jain, B. Ray and R. Stem, Automation segment, classification and clustering of broadcast news audio, Proceedings of the Speech Recognition Workshop, 1997,97-99 被引量:1
  • 4[4]S. Chen and P. S. Gopalakrishnan, Speaker, Environment and Channel Change Detection and Clustering via Bayesian Information Criterion, Proceedings of the Speech Recognition Workshop, 1998 被引量:1
  • 5[5]azumasa MORI and Seiichi NAKAGAWA, Speaker Change Detection and Speaker Clustering Using VQ Distortion For Broadcast News Recognition,Proceedings of ICASSP 2001 被引量:1
  • 6[6]V.V. Digalakis,P. Monaco,andH. Murveit,Generalized MixtureTying in Continuous Hideen Markov ModelBased Speech Recognizers, IEEE Transactions On Speech and Audio Processing,1996,4(4) :281-288 被引量:1
  • 7Chou W, Gu L. Robust singing detection in speech/music discriminator design[ A]. In. Proc ICASSP[ C ].Salt Lake City, USA : IEEE,2001,2:865 - 868. 被引量:1
  • 8Ajmera J, Mccowan I A, Bourlard H. Robust HMM-based speech/music segmentation [ A ]. In: Proc ICASSP[ C]. Orlando, USA: IEEE,2002 ,1:297 -300. 被引量:1
  • 9Sundaram H, Chang S F. Audio scene segmentation using multiple features, models and time scales [ A ]. In:IEEE Proc ICASSP [ C ]. Istanbul, Turkey: IEEE, 2000.4.2441 - 2444. 被引量:1
  • 10Foote J. Automatic audio segmentation using a measure of audio novelty [ A ]. In: IEEE Proc Multimedia and Expo [ C ]. New York, USA: IEEE, 2000.1. 452 - 455. 被引量:1

共引文献27

同被引文献9

引证文献1

二级引证文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部