期刊文献+

多层视频语义概念分析与理解 被引量:8

Analysis and Understanding for Multi-Level Video Semantic Concepts
下载PDF
导出
摘要 基于统计学理论,提出了一种视频多粒度语义分析的通用方法,使得多层次语义分析与多模式信息融合得到统一.为了对时域内容进行表示,首先提出一种具有时间语义语境约束的关键帧选取策略和注意力选择模型;在基本视觉语义识别后,采用一种多层视觉语义分析框架来抽取视觉语义;然后应用隐马尔可夫模型(HMM)和贝叶斯决策进行音频语义理解;最后用一种具有两层结构的仿生多模式融合方案进行语义信息融合.实验结果表明,该方法能有效融合多模式特征,并提取不同粒度的视频语义. Based on statistics theory, a generic method for video multi-granularity semantic analysis is proposed in this paper, where multi-level semantics analysis and multi-modal information fusion are unified to represent temporal content, a key-frame selection strategy with temporal semantic context restriction and an attention selection mode[ are presented firstly. After recognizing basic visual semantics, a framework for multi-level visual semantics analysis is introduced for visual semantics extraction. Then, Hidden Markov model and Bayesian decision are applied to audio semantic understanding. Finally, a bionic muhimodal fusion scheme with two level structures is used for video semantic information fusion. Experimental results demonstrate the effectiveness of the proposed method to fuse multimodal features, as well as to extract video semantics with different granularity.
出处 《计算机辅助设计与图形学学报》 EI CSCD 北大核心 2008年第1期85-92,共8页 Journal of Computer-Aided Design & Computer Graphics
基金 国家自然科学基金(60273035) 四川省教育厅青年基金(2006B063) 成都信息工程学院发展基金(KYTZ20060904)
关键词 视频语义分析 视频语义概念 层次隐马尔可夫模型 多模式融合 video semantic analysis video semantic concept HHMM multimodal fusion
  • 相关文献

参考文献18

  • 1Li B, Sezan I. Semantic sports video analysis: approaches and new applications [C] //Proceedings of 2003 International Conference on Image Processing, Barcelona, 2003, 1 : 17-20 被引量:1
  • 2Wu Chuan, Ma Yu-Fei, Zhan Hong-Jiang, et al. Events recognition by semantic inference for sports video [C] // Proceedings of International Conference on Multimedia and Expo, Tokyo, 2002, 1: 805-808 被引量:1
  • 3Babaguchi N, Nitto N. Intermodal collaboration: a strategy for semantic content analysis for broadcasted sports video [C] // Proceedings of 2003 International Conference on Image Processing, Barcelona, 2003, 1: 13-16 被引量:1
  • 4Adams W H, Iyengar Giridharan, Lin Ching-Yung, et al. Semantic indexing of multimedia content using visual, audio and text cues [J]. EURASIP Journal on Applied Signal Processing, 2003, (2): 170-185 被引量:1
  • 5Lu S, Lyu M R, King I. Semantic video summarization using mutual reinforcement principle and shot arrangement patterns [C]//Proceedings of the 11th International Multimedia Modelling Conference, Melbourne, 2005 : 60-67 被引量:1
  • 6Chen Shu-Ching, Shyu Mei-Ling, Chen Min, et al. A decision tree-based multimodal data mining framework for soccer goal detection [C]//Proceedings of IEEE International Conference on Multimedia and Expo, Taipei, 2004, 1:27-30 被引量:1
  • 7Huang C-L, Shih H-C, Chen C-L. Shot and scoring events identification of basketball videos [C] //Proceedings of IEEE International Conference on Multimedia and Expo, Toronto, 2006 : 1885-1888 被引量:1
  • 8Snoek C G M, Worring M, Geusebroek J M, et al. The semantic pathfinder for generic news video indexing [C] // Proceedings of IEEE International Conference on Multimedia and Expo, Toronto, 2006:1469-1472 被引量:1
  • 9Izquierdo E. Knowledge-based image processing for classification and recognition in surveillance applications [C] //Proceedings of IEEE International Conference on Image Processing, Atlanta, GA, 2006:2377-2380 被引量:1
  • 10Chien S-Y, Ma S-Y, Chen L-G. Efficient moving object segmentation algorithm using background registration technique [J]. IEEE Transactions on Circuits and Systems for Video Technology, 2002, 12(7): 577-586 被引量:1

二级参考文献8

  • 1Jensen R,Shen Q.Semantics-Preserving Dimensionality Reduction:Rough and Fuzzy-Rough-Based Approaches[J].IEEE Transactions on Knowledge and Data Engineering(S1041-4347),2004,16(12):1457-1471. 被引量:1
  • 2Rasheed Z,Sheikh Y,Shah M.On the Use of Computable Features for Film Classification[J].IEEE Transactions on Circuits and Systems for Video Technology(S1051-8215),2005,15(1):52-64. 被引量:1
  • 3Liu H,Setiono R.Feature selection via discretization[J].IEEE Transactions on Knowledge and Data Engineering (S1041-4347),1997,9(4):642-645. 被引量:1
  • 4Mittal A,Cheong L-F.Addressing the Problems of Bayesian Network Classification of Video Using High-dimensional Features[J].IEEE Transactions on Knowledge and Data Engineering (S1041-4347),2004,16(2):230-244. 被引量:1
  • 5JTC1/SC29/WG11 I.I.Coding of Moving Pictures and Audio,"Overview of the MPEG-7 Standard" Int'l Organization for Standariation.Oct.2000[S]. 被引量:1
  • 6Lynch R S J,P K W.Bayesian Classification and Feature Reduction Using Uniform Dirichlet Priors[J].IEEE Transactions on Systems,Man and Cybernetics(S1083-4419),Part B,2003,33(3):448-464. 被引量:1
  • 7Ji H,Bang S Y.Feature Selection for Multi-class Classification Using Pairwise Class Discriminatory Measure and Covering Concept[J].Electronics Letters (S0013-5194),2000,36(6):524-525. 被引量:1
  • 8C.Blake C M.UCI Machine Learning Repository[DB/OL].(2005)[2005].http://www.ics.uci.edu/~mlearn/MLRepository.html. 被引量:1

共引文献4

同被引文献63

  • 1胡宏宇,李志慧,曲昭伟,王殿海.基于上下文的交通事件表达与识别[J].吉林大学学报(工学版),2009,39(S2):158-162. 被引量:4
  • 2余卫宇,谢胜利,余英林,潘晓舟.语义视频检索的现状和研究进展[J].计算机应用研究,2005,22(5):1-7. 被引量:14
  • 3史迎春,周献中,方鹏飞.综合利用形状和颜色特征的台标识别[J].模式识别与人工智能,2005,18(2):216-222. 被引量:13
  • 4魏维,游静,刘凤玉,许满武.语义视频检索综述[J].计算机科学,2006,33(2):1-7. 被引量:18
  • 5Alan Hanjalic.Content-Based Analysis of Digital Video[M].Germany:Springer,2004. 被引量:1
  • 6Bai Liang,Lao Songyang,Zhang Weiming.A Semantic Event Detection Approach For Soccer Video Based On Perception Concepts And Finite State Machines[C] //Eight International Workshop on Image Analysis for Multimedia Interactive Services(WIAMIS 07).Greece:European Association for Signal Image Processing.2007:30-34. 被引量:1
  • 7William Gibson.Pattern Recognition[M].America:Berkley Publishing Group,2004. 被引量:1
  • 8HAMPAPUR A, JAIN R, WEYMOUTH T. Production Model Based Digital Video Segmentation [ J ]. Multimedia Tools and Applications, 1995, 1 ( 1 ) : 29-46. 被引量:1
  • 9LEUNG C, SO S, TAM A, et al. Semantic-based Retrieval of Visual Data [ J ]. Principles of Visual Information Retrieval, 2001, (12) : 297-318. 被引量:1
  • 10NGO C W, PONG T C. Recent Advances in Contentbased Video Analysis[ J]. International Journal of Image and Graphics, 2001, 1(3): 445-468. 被引量:1

引证文献8

二级引证文献10

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部