多层视频语义概念分析与理解被引量：8

Analysis and Understanding for Multi-Level Video Semantic Concepts

下载PDF

导出

摘要基于统计学理论,提出了一种视频多粒度语义分析的通用方法,使得多层次语义分析与多模式信息融合得到统一.为了对时域内容进行表示,首先提出一种具有时间语义语境约束的关键帧选取策略和注意力选择模型;在基本视觉语义识别后,采用一种多层视觉语义分析框架来抽取视觉语义;然后应用隐马尔可夫模型(HMM)和贝叶斯决策进行音频语义理解;最后用一种具有两层结构的仿生多模式融合方案进行语义信息融合.实验结果表明,该方法能有效融合多模式特征,并提取不同粒度的视频语义. Based on statistics theory, a generic method for video multi-granularity semantic analysis is proposed in this paper, where multi-level semantics analysis and multi-modal information fusion are unified to represent temporal content, a key-frame selection strategy with temporal semantic context restriction and an attention selection mode[ are presented firstly. After recognizing basic visual semantics, a framework for multi-level visual semantics analysis is introduced for visual semantics extraction. Then, Hidden Markov model and Bayesian decision are applied to audio semantic understanding. Finally, a bionic muhimodal fusion scheme with two level structures is used for video semantic information fusion. Experimental results demonstrate the effectiveness of the proposed method to fuse multimodal features, as well as to extract video semantics with different granularity.

作者魏维邹书蓉刘凤玉

机构地区南京理工大学计算机科学与技术学院成都信息工程学院计算机科学与技术系

出处《计算机辅助设计与图形学学报》 EI CSCD 北大核心 2008年第1期85-92,共8页 Journal of Computer-Aided Design & Computer Graphics

基金国家自然科学基金(60273035) 四川省教育厅青年基金(2006B063) 成都信息工程学院发展基金(KYTZ20060904)

关键词视频语义分析视频语义概念层次隐马尔可夫模型多模式融合 video semantic analysis video semantic concept HHMM multimodal fusion

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献18

1Li B, Sezan I. Semantic sports video analysis: approaches and new applications [C] //Proceedings of 2003 International Conference on Image Processing, Barcelona, 2003, 1 : 17-20 被引量：1
2Wu Chuan, Ma Yu-Fei, Zhan Hong-Jiang, et al. Events recognition by semantic inference for sports video [C] // Proceedings of International Conference on Multimedia and Expo, Tokyo, 2002, 1: 805-808 被引量：1
3Babaguchi N, Nitto N. Intermodal collaboration: a strategy for semantic content analysis for broadcasted sports video [C] // Proceedings of 2003 International Conference on Image Processing, Barcelona, 2003, 1: 13-16 被引量：1
4Adams W H, Iyengar Giridharan, Lin Ching-Yung, et al. Semantic indexing of multimedia content using visual, audio and text cues [J]. EURASIP Journal on Applied Signal Processing, 2003, (2): 170-185 被引量：1
5Lu S, Lyu M R, King I. Semantic video summarization using mutual reinforcement principle and shot arrangement patterns [C]//Proceedings of the 11th International Multimedia Modelling Conference, Melbourne, 2005 : 60-67 被引量：1
6Chen Shu-Ching, Shyu Mei-Ling, Chen Min, et al. A decision tree-based multimodal data mining framework for soccer goal detection [C]//Proceedings of IEEE International Conference on Multimedia and Expo, Taipei, 2004, 1:27-30 被引量：1
7Huang C-L, Shih H-C, Chen C-L. Shot and scoring events identification of basketball videos [C] //Proceedings of IEEE International Conference on Multimedia and Expo, Toronto, 2006 : 1885-1888 被引量：1
8Snoek C G M, Worring M, Geusebroek J M, et al. The semantic pathfinder for generic news video indexing [C] // Proceedings of IEEE International Conference on Multimedia and Expo, Toronto, 2006:1469-1472 被引量：1
9Izquierdo E. Knowledge-based image processing for classification and recognition in surveillance applications [C] //Proceedings of IEEE International Conference on Image Processing, Atlanta, GA, 2006:2377-2380 被引量：1
10Chien S-Y, Ma S-Y, Chen L-G. Efficient moving object segmentation algorithm using background registration technique [J]. IEEE Transactions on Circuits and Systems for Video Technology, 2002, 12(7): 577-586 被引量：1

二级参考文献8

1Jensen R,Shen Q.Semantics-Preserving Dimensionality Reduction:Rough and Fuzzy-Rough-Based Approaches[J].IEEE Transactions on Knowledge and Data Engineering(S1041-4347),2004,16(12):1457-1471. 被引量：1
2Rasheed Z,Sheikh Y,Shah M.On the Use of Computable Features for Film Classification[J].IEEE Transactions on Circuits and Systems for Video Technology(S1051-8215),2005,15(1):52-64. 被引量：1
3Liu H,Setiono R.Feature selection via discretization[J].IEEE Transactions on Knowledge and Data Engineering (S1041-4347),1997,9(4):642-645. 被引量：1
4Mittal A,Cheong L-F.Addressing the Problems of Bayesian Network Classification of Video Using High-dimensional Features[J].IEEE Transactions on Knowledge and Data Engineering (S1041-4347),2004,16(2):230-244. 被引量：1
5JTC1/SC29/WG11 I.I.Coding of Moving Pictures and Audio,"Overview of the MPEG-7 Standard" Int'l Organization for Standariation.Oct.2000[S]. 被引量：1
6Lynch R S J,P K W.Bayesian Classification and Feature Reduction Using Uniform Dirichlet Priors[J].IEEE Transactions on Systems,Man and Cybernetics(S1083-4419),Part B,2003,33(3):448-464. 被引量：1
7Ji H,Bang S Y.Feature Selection for Multi-class Classification Using Pairwise Class Discriminatory Measure and Covering Concept[J].Electronics Letters (S0013-5194),2000,36(6):524-525. 被引量：1
8C.Blake C M.UCI Machine Learning Repository[DB/OL].(2005)[2005].http://www.ics.uci.edu/～mlearn/MLRepository.html. 被引量：1

共引文献4

1魏维,叶斌,张元茂.视频语义分析内容表征方式研究[J].计算机工程,2007,33(13):218-220.
2曾孝平,李勇明,王靖,张晓娟,郑雅敏.基于竞争策略的链式智能体遗传算法用于特征选择的研究[J].系统仿真学报,2008,20(8):1973-1979. 被引量：7
3李士进,陶剑,林林,冯钧.面向宏观地表分类的特征选择算法比较研究[J].计算机工程与应用,2008,44(21):130-132. 被引量：1
4李勇明,张素娟,曾孝平,覃剑,韩亮.轮询式多准则特征选择算法的研究[J].系统仿真学报,2009,21(7):2010-2013. 被引量：6

同被引文献63

1胡宏宇,李志慧,曲昭伟,王殿海.基于上下文的交通事件表达与识别[J].吉林大学学报（工学版）,2009,39(S2):158-162. 被引量：4
2余卫宇,谢胜利,余英林,潘晓舟.语义视频检索的现状和研究进展[J].计算机应用研究,2005,22(5):1-7. 被引量：14
3史迎春,周献中,方鹏飞.综合利用形状和颜色特征的台标识别[J].模式识别与人工智能,2005,18(2):216-222. 被引量：13
4魏维,游静,刘凤玉,许满武.语义视频检索综述[J].计算机科学,2006,33(2):1-7. 被引量：18
5Alan Hanjalic.Content-Based Analysis of Digital Video[M].Germany:Springer,2004. 被引量：1
6Bai Liang,Lao Songyang,Zhang Weiming.A Semantic Event Detection Approach For Soccer Video Based On Perception Concepts And Finite State Machines[C] //Eight International Workshop on Image Analysis for Multimedia Interactive Services(WIAMIS 07).Greece:European Association for Signal Image Processing.2007:30-34. 被引量：1
7William Gibson.Pattern Recognition[M].America:Berkley Publishing Group,2004. 被引量：1
8HAMPAPUR A, JAIN R, WEYMOUTH T. Production Model Based Digital Video Segmentation [ J ]. Multimedia Tools and Applications, 1995, 1 ( 1 ) : 29-46. 被引量：1
9LEUNG C, SO S, TAM A, et al. Semantic-based Retrieval of Visual Data [ J ]. Principles of Visual Information Retrieval, 2001, (12) : 297-318. 被引量：1
10NGO C W, PONG T C. Recent Advances in Contentbased Video Analysis[ J]. International Journal of Image and Graphics, 2001, 1(3): 445-468. 被引量：1

引证文献8

1张良.一种视频关键语义对象的检测方法[J].北京信息科技大学学报（自然科学版）,2010,25(2):75-78. 被引量：1
2袁正午,朱冠宇,丰江帆,任菲.基于支持向量机的视频语义场景分割算法研究[J].重庆邮电大学学报（自然科学版）,2010,22(4):458-463. 被引量：4
3张良,周长胜.基于概念本体的视频内容分析框架[J].计算机技术与发展,2011,21(12):116-119. 被引量：1
4康维新,曹宇亭.交通事件的语义理解[J].应用科技,2013,40(2):5-10. 被引量：3
5罗娜,魏维.视频故事单元语义相似度算法研究[J].成都信息工程学院学报,2013,28(3):205-210.
6李敏,高珏,吴佳家,许华虎.基于本体的多模式融合语义提取模型[J].计算机技术与发展,2013,23(9):28-31.
7周教生.基于隐含语义分析的视频语义概念检测方法[J].信息通信,2018,31(2):141-143. 被引量：1
8翁子林.一种音频情感区间划分方法[J].电脑知识与技术（过刊）,2014,0(9X):6184-6186.

二级引证文献10

1张利军,李战怀,陈群,娄颖,李宁.基于关键字语义信息的XML文档分类[J].吉林大学学报（工学版）,2012,42(6):1510-1514. 被引量：6
2徐国浪,魏延.基于多核函数的模糊支持向量机学习算法[J].重庆师范大学学报（自然科学版）,2012,29(6):50-53. 被引量：11
3马斌,柴智.基于领域本体的方剂知识获取与研究[J].计算机技术与发展,2013,23(6):227-229. 被引量：1
4李敏,高珏,吴佳家,许华虎.基于本体的多模式融合语义提取模型[J].计算机技术与发展,2013,23(9):28-31.
5朱宇光,闫婷,张建明,杨雄,胡维礼.一种基于反馈模糊图论的视频多语义标注算法[J].计算机科学,2013,40(12):270-275. 被引量：1
6于云,曹凯.交通场景中动态事件的语义表达方法[J].信息与控制,2015,44(1):83-90. 被引量：2
7南春丽,史潇,裴勃丽.交通事故点相关道路线形Web数据获取[J].应用科技,2017,44(6):36-40. 被引量：1
8李旭晖,吴青峰.面向事件的视频语义表示方法[J].图书情报工作,2020,64(10):99-108. 被引量：3
9汪超宇,杜震洪,汪愿愿.结合通道交互空间组注意力与金字塔池化的高分影像语义分割网络[J].浙江大学学报（理学版）,2024,51(2):131-142. 被引量：1
10于云,曹凯,刘春,黄肖肖.道路交通场景事件的语义解释方法[J].山东理工大学学报（自然科学版）,2016,30(1):5-11.

1魏维,李千目,刘凤玉,许满武.视频语义分析两级多模式融合算法[J].中国图象图形学报,2007,12(5):893-898. 被引量：1
2李敏,高珏,吴佳家,许华虎.基于本体的多模式融合语义提取模型[J].计算机技术与发展,2013,23(9):28-31.
3徐红江,钱宇豪,朱晶晶,沈微微,刘杰.基于Android的一键智能报警系统设计与实现[J].软件导刊,2016,15(9):79-82. 被引量：1
4张瑞杰,李弼程,魏晗.基于LSI和软加权的视频语义概念检测[J].信息工程大学学报,2013,14(2):196-201.
5孙君顶,李海华,靳姣林.基于视觉语义主题的图像自动标注[J].测控技术,2016,35(12):11-15. 被引量：3
6王敏超,詹永照,苟建平,毛启容.面向视频语义分析的局部敏感的可鉴别稀疏表示[J].计算机科学,2015,42(9):313-318. 被引量：3
7罗娜,魏维.视频故事单元语义相似度算法研究[J].成都信息工程学院学报,2013,28(3):205-210.
8魏维,何嘉,刘凤玉.视频语义分析运动特征表征与抽取技术研究[J].计算机工程与应用,2007,43(16):213-215.
9万建平,彭天强,李弼程.基于证据理论的视频语义概念检测[J].数据采集与处理,2011,26(5):536-541. 被引量：6
10杨春蓉.基于内容的流媒体视频检索技术[J].科技视界,2012(32):36-37. 被引量：1

计算机辅助设计与图形学学报

2008年第1期

浏览历史

内容加载中请稍等...

多层视频语义概念分析与理解被引量：8

参考文献18

二级参考文献8

共引文献4

同被引文献63

引证文献8

二级引证文献10

相关作者

相关机构

相关主题

浏览历史

多层视频语义概念分析与理解 被引量：8

参考文献18

二级参考文献8

共引文献4

同被引文献63

引证文献8

二级引证文献10

相关作者

相关机构

相关主题

浏览历史

多层视频语义概念分析与理解被引量：8