摘要
提出了基于MPEG压缩域音频流的足球比赛精彩场景自动分析与提取算法 首先直接提取出压缩域音频特征 ;然后基于提取出来的压缩域特征实现解说音的检测和分割 ,并且分别识别足球比赛中解说员激动解说和观众激昂欢呼两种类型音频事件 ;最后通过概率融合生成最终结果 ,融合结果所对应的比赛片段就是提取出的足球比赛精彩场景
This paper presents an algorithm to automatically analyze and extract soccer program highlights scene based on MPEG compressed audio-track analysis. In this algorithm, audio compressed features are directly extracted first and then based on the features the algorithm detects and segments commentary speeches, to recognize in particular the events of excited commentary and crowd cheers respectively. Finally, the recognized results are integrated by probability fusion, and the corresponding video clips of fusion result are chosen as soccer highlights. Experimental data shows that the algorithm works well.
出处
《计算机辅助设计与图形学学报》
EI
CSCD
北大核心
2004年第6期856-860,共5页
Journal of Computer-Aided Design & Computer Graphics
基金
国家自然科学基金 ( 60 2 72 0 3 1)
教育部博士点科研基金( 2 0 0 10 3 3 5 0 49)
国家"十五"重大科技攻关项目 ( 2 0 0 1BA10 1A0 7 0 3 )
浙江省科技计划项目重点科研项目 ( 2 0 0 3C2 10 10 )
浙江省自然科学基金(M 60 3 2 0 2 )资助