基于分层熵检测的音频分割算法被引量：1

Audio Segmentation Based on Layer Entropy Detection

下载PDF

导出

摘要音频分割是提取音频结构和内容语义的重要手段,是基于内容的音频分析、检索的基础。提出分层熵检测音频分割算法,采用定长分析窗分层结构遍历音频流,窗内根据熵变化趋势检测跳变点。实验结果表明,该算法避免了ΔBIC分割算法中的硬门限判决和数据累积问题,是一种更加有效的音频分割方法。 Audio segmentation is an important access to extract audio structure and content, aria is a basis for further audio retrieval and analysis. An audio segmentation algorithm is proposed based on layer entropy detection. The algorithm searches the audio stream using a size-fixed analysis window in which a top-down detection structure is employed and locates the change points according to the entropy trend. The experimental results demonstrate that algorithm avoids detection errors due to data accumulation and establishing hard threshold, it is a more effective segmentation algorithm.

作者王志明张瑞杰李弼程

机构地区怀化职业技术学院信息工程大学信息工程学院

出处《科学技术与工程》 2009年第17期5012-5016,5023,共6页 Science Technology and Engineering

基金国家863计划资助项目(2006AA01Z146)资助

关键词音频分割分层检测熵变化趋势 audio segmentation layer detection the entropy trend

分类号 TP391.42 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献12

1卢坚,毛兵,孙正兴,张福炎.一种改进的基于说话者的语音分割算法[J].软件学报,2002,13(2):274-279. 被引量：17
2贾磊,穆向禺,徐波.广播语音的音频分割[J].中文信息学报,2002,16(1):37-42. 被引量：11
3张一彬,周杰,边肇祺,张大鹏.一种新的基于分类的音频流分割方法[J].电子学报,2006,34(4):612-617. 被引量：10
4Meinedo H, Neto J. Audio segmentation, classification and clustering in a broadcast news task. IEEE International Conference on Acoustics, Speech and Signal Processing. Hong Kong, China: Kluwer Academic Press,2003 ;5-8. 被引量：1
5Wu Chunghsien, Hsieh Chiahsin. Multiple change-point audio segmentation and classification using an MDL-based ganssian model. IEEE Transactions on Audio, Speech and Language Processing,2006 ; 14 : 647-657. 被引量：1
6Woodland P, Gales M, Pye D,et al. The development of the 1996 HTK broadcast news transcription system. In: Proc Speech Recognition Workshop, 1997 : 73-78. 被引量：1
7Bakis R, Chen S, Gopalakrishnan P, et al. Transcription of broadcast news shows with the IBM large vocabulary speech recognition system. In: Proceedings of the DARPA Speech Recognition Workshop Chantilly, 1997:67-72. 被引量：1
8Siegler M A, Jain U, Raj B,et al. Automatic segmentation, classification and clustering of broadcast news audio. In: Proceedings of the DARPA Speech Recognition Workshop Chantilly, 1997:97-99. 被引量：1
9Cover T M, Tomas J A. Elements of information theory. New York: John Wiley&Sons, 1991 : 1197-1208. 被引量：1
10Gish H, Schmidt N. Text-Independent speaker identification. IEEE Signal Processing Magazine. 1994:18-32. 被引量：1

二级参考文献29

1[1]R. Bakis et al., Transcription of broadcast news shows with the IBM large vocabulary speech recognition system, proceedings of the Speech Recognition Workshop, 1997,67-72,1997 被引量：1
2[2]F. Kubala et al. The 1996 BBN Byblos Hub-4 transcription system, Proceedings of the Speech Recognition Workshop, 1997,90-93 被引量：1
3[3]M. Siegler, U. Jain, B. Ray and R. Stem, Automation segment, classification and clustering of broadcast news audio, Proceedings of the Speech Recognition Workshop, 1997,97-99 被引量：1
4[4]S. Chen and P. S. Gopalakrishnan, Speaker, Environment and Channel Change Detection and Clustering via Bayesian Information Criterion, Proceedings of the Speech Recognition Workshop, 1998 被引量：1
5[5]azumasa MORI and Seiichi NAKAGAWA, Speaker Change Detection and Speaker Clustering Using VQ Distortion For Broadcast News Recognition,Proceedings of ICASSP 2001 被引量：1
6[6]V.V. Digalakis,P. Monaco,andH. Murveit,Generalized MixtureTying in Continuous Hideen Markov ModelBased Speech Recognizers, IEEE Transactions On Speech and Audio Processing,1996,4(4) :281-288 被引量：1
7Chou W, Gu L. Robust singing detection in speech/music discriminator design[ A]. In. Proc ICASSP[ C ].Salt Lake City, USA : IEEE,2001,2:865 - 868. 被引量：1
8Ajmera J, Mccowan I A, Bourlard H. Robust HMM-based speech/music segmentation [ A ]. In: Proc ICASSP[ C]. Orlando, USA: IEEE,2002 ,1:297 -300. 被引量：1
9Sundaram H, Chang S F. Audio scene segmentation using multiple features, models and time scales [ A ]. In:IEEE Proc ICASSP [ C ]. Istanbul, Turkey: IEEE, 2000.4.2441 - 2444. 被引量：1
10Foote J. Automatic audio segmentation using a measure of audio novelty [ A ]. In: IEEE Proc Multimedia and Expo [ C ]. New York, USA: IEEE, 2000.1. 452 - 455. 被引量：1

共引文献27

1陈莘萌,陈刚,姚昱.基于最小平均复杂度的矢量量化音频分类方法[J].武汉大学学报（理学版）,2005,51(1):69-73. 被引量：1
2杨新旭,王长山,王东琦,郑丽娜.基于隐马尔可夫模型的入侵检测系统[J].计算机工程与应用,2005,41(12):149-151. 被引量：9
3肖述才,欧智坚,王作英.语音识别中的一种说话人聚类算法[J].中文信息学报,2005,19(4):84-88. 被引量：4
4李超,熊璋,薛玲,刘云.一种阈值自适应调整的实时音频分割方法[J].北京航空航天大学学报,2005,31(12):1317-1321. 被引量：2
5张世磊,张树武,徐波.一种两层次无监督的音频分割算法[J].中文信息学报,2007,21(2):106-111. 被引量：5
6付中华,张艳宁.在线无监督说话人检索中稳健的模型自举算法[J].软件学报,2007,18(3):608-616. 被引量：3
7王志明,周序生.基于定长窗分层检测的音频分割算法[J].中小企业管理与科技,2009(21):296-297.
8郑继明,俞佳.基于GLR距离和BIC的混合音频分割算法[J].计算机工程与设计,2009,30(13):3120-3123. 被引量：3
9王志明,周序生.基于定长窗分层检测的音频分割算法[J].计算机仿真,2009,26(9):350-354. 被引量：1
10王志明.一种有效的音频分割算法[J].湖南理工学院学报（自然科学版）,2009,22(3):37-40. 被引量：3

同被引文献9

1Taras Butko,Climent Nadeu. Audio segmentation of broadcast news in the Albayzin-2010 evaluation: Overview, results, and discussion [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2011 (1): 1-10. 被引量：1
2Sebastien Lefevre, Nicole Vincent. A two level strategy for au- dio segmentation[J]. Journal of Digital Signal Processing, 2010, 21 (2): 270-277. 被引量：1
3Dalibor Mitrovic, Matthias Zeppelzauer, Christian Breithene- der. Features for content-based audio retrieval [J]. Journal of Advances in Computer, 2010, 78 (10): 71-150. 被引量：1
4Cheng Shisian, Wang Hsinmin, Fu Hsinchia. BIC-based au- dio segmentation by divide and conquer [C] //International Conference on Acoustics, 2008: 4841-4844. 被引量：1
5郑继明,俞佳.基于GLR距离和BIC的混合音频分割算法[J].计算机工程与设计,2009,30(13):3120-3123. 被引量：3
6张瑞杰,李弼程,屈丹.基于可信度变化趋势的音频分割算法[J].计算机工程,2010,36(8):177-179. 被引量：3
7于俊清,胡小强,孙凯.改进的音频混合分割方法[J].计算机辅助设计与图形学学报,2010,22(7):1174-1181. 被引量：4
8郑继明,张萍.改进的BIC说话人分割算法[J].计算机工程,2010,36(17):240-242. 被引量：7
9郑继明,司可宁.改进的T^2-BIC说话人二级分割算法[J].计算机工程,2011,37(6):291-292. 被引量：1

引证文献1

1冷娇娇,赵彤洲,方晖,李翔,李碧.基于方差稳定性度量的乐器音频分割算法[J].计算机工程与设计,2016,37(3):768-772. 被引量：4

二级引证文献4

1刘莹,赵彤洲,江逸琪,柴悦,李翔.基于自相关函数的钢琴乐音改进识别算法[J].武汉工程大学学报,2018,40(2):208-213. 被引量：6
2刘莹,赵彤洲,邹冲,赵娜.基于频谱包络分析的音乐推荐算法[J].软件导刊,2018,17(6):74-76. 被引量：5
3刘超.基于频谱包络的钢琴乐音仿真模型构建[J].自动化技术与应用,2021,40(6):104-108. 被引量：4
4杨静.基于三维时空域的音符信号切分识别方法研究[J].科技通报,2019,35(9):119-122. 被引量：1

1王志明.一种有效的音频分割算法[J].湖南理工学院学报（自然科学版）,2009,22(3):37-40. 被引量：3
2张瑞杰,李弼程,屈丹.基于可信度变化趋势的音频分割算法[J].计算机工程,2010,36(8):177-179. 被引量：3
3白亮,老松杨,陈剑赟,吴玲达.基于支持向量机的音频分类与分割[J].计算机科学,2005,32(4):87-90. 被引量：13
4王志明,周序生.基于定长窗分层检测的音频分割算法[J].计算机仿真,2009,26(9):350-354. 被引量：1
5经小川,刘克强,胡昌振.基于分层检测的协同攻击检测技术研究[J].科技导报,2005,23(4):4-7. 被引量：2
6王志明,周序生.基于定长窗分层检测的音频分割算法[J].中小企业管理与科技,2009(21):296-297.
7王余奎,李洪儒,叶鹏.基于多尺度排列熵的液压泵故障识别[J].中国机械工程,2015,26(4):518-523. 被引量：30
8兰景英,王永恒,朱培栋.入侵检测系统分析及改进[J].计算机应用,2007,27(B12):144-145. 被引量：3
9冯舸,王华军,谢羽佳.基于存储过程的累积实现[J].数字通信,2012,39(5):59-61. 被引量：1
10徐川,杜成,唐红.DDoS攻击检测研究综述[J].电信科学,2011,27(3):85-89. 被引量：5

科学技术与工程

2009年第17期

浏览历史

内容加载中请稍等...

基于分层熵检测的音频分割算法被引量：1

参考文献12

二级参考文献29

共引文献27

同被引文献9

引证文献1

二级引证文献4

相关作者

相关机构

相关主题

浏览历史

基于分层熵检测的音频分割算法 被引量：1

参考文献12

二级参考文献29

共引文献27

同被引文献9

引证文献1

二级引证文献4

相关作者

相关机构

相关主题

浏览历史

基于分层熵检测的音频分割算法被引量：1