期刊文献+

层次化新闻视频处理框架的设计与实现 被引量:3

The Design and Realization of Hierarchical Framework of News Video Process
下载PDF
导出
摘要 提出了一个通用的层次化新闻视频处理框架,将新闻视频处理分为句法分段、语义标注以及视频摘要三个层次,并给出了三个层次中涉及的故事单元探测、字幕探测、视频摘要等关键技术的解决方案。框架突破了传统的新闻视频处理框架仅局限于句法分段以及单媒体特征进行处理的缺陷,通过对视音频特征进行多模态的综合分析来获取新闻视频高层的语义内容。实验通过一个新闻视频处理原型系统NVPS验证了框架的可行性,重点对故事单元探测、标题探测以及口播帧探测三个算法进行了实验,实验结果分别达到88%,86%和86%的探测准确率,从而进一步证实了层次框架在新闻视频处理方面的有效性。 A general hierarchical framework of news video process is presented. It divides the news video process into three levels: syntax segmentation level, semantic labeling level and abstraction level. Some key techniques related to these levels are described and solutions of them are introduced. The proposed framework overcomes the shortcomings of traditional news video process methods, which are limited to the content-based segmentation and process based on the single media feature. It acquires the semantic content by the analysis of audio-visual features synthetically. Experiments are carried out on a news video process prototype called NVPS, which validates the feasibility of the framework. Three methods, namely story detection, caption detection and anchor detection methods are tested on NVPS. The results reach to the detection precision of 88%, 86% and 86% respectively, which prove the efficiency of the layered framework in the semantic content analysis of news videos.
出处 《国防科技大学学报》 EI CAS CSCD 北大核心 2004年第5期99-103,共5页 Journal of National University of Defense Technology
基金 国家863高技术资助项目(2001AA115123)
关键词 新闻视频 句法分段 语义标注 视频摘要 news video syntax segmentation semantic labeling video abstraction
  • 相关文献

参考文献8

  • 1Michael R L, Edward Yan, Sam Sze. A Multilingual Multimodal Digital Video Library System[A]. JCDL'02, July 13-17,2002, Portland, Oregon, USA.145-153. 被引量:1
  • 2Shin'ichi Satoh, Name-It: Naming and Detecting Faces in News Videos[J]. IEEE Multimedia, 1999:22-35. 被引量:1
  • 3Lienhart R, Pfeiffer S,Effelsberg W. Video Abstracting[J]. Communications of the ACM, 55-62, Dec. 1997. 被引量:1
  • 4Ma Y F, Lu L, Zhang H J, et al. A User Attention Model for Video Summarization[A]. Proceeding of ACM Multimedia'02, Juan-les-Pins, France, December, 2002. 被引量:1
  • 5Christel M G, Hauptmann A G, Wactlar H D, et al. Collages as Dynamic Summaries for News Video[A]. Proceeding of ACM Multimedia'02, Juan-les-Pins, France, December, 2002. 被引量:1
  • 6谢毓湘,栾悉道,吴玲达,老松杨.一种基于解压的镜头探测方法[J].系统工程与电子技术,2003,25(8):1028-1031. 被引量:8
  • 7马宇飞,白雪生,徐光祐,史元春.新闻视频中口播帧检测方法的研究[J].软件学报,2001,12(3):377-382. 被引量:24
  • 8Hua X S, Chen X R, Liu W Y, et al. Automatic Location of Text in Video Frames[A]. Proceeding of ACM Multimedia 2001 Workshops: MIR2001, Ottawa, Canada, October 5, 2001:24-27. 被引量:1

二级参考文献14

  • 1Ardizzaae E, Caseia M. Automatic Video Database Indexing and Retrieval[J]. Multimedia Tools and Applications, 1997:29 - 56. 被引量:1
  • 2Yu H, Wolf W. A Visual Search System for Video and Image Databases[C]. in Proc.IEEE Int'l Conf. on Multimedia Computing and Systems(Ottawa, Canada), June, 1997: 517-524. 被引量:1
  • 3Zabih R, Miller J, Mai K. A Feature-Based Algorithm for Detecting and Classifying Scene Breaks[ C ]. in Proc. of ACM Multimedia' 95, ( San Francisco, CA), 1995 : 189 - 200. 被引量:1
  • 4J Oh, Hua K A, Liang N. A Content-Based Scene Change Detection and Classification Technique Using Background Tracking[C]. in SPIE Conf.on Multimedia Computing and Networking 2000(San Jose, CA), Jan.2000: 254-265. 被引量:1
  • 5Oh J, Hua K A. An Efficient and Cost-Effective Technique for Browsing and Indexing Large Video Databases[C]. in Prec. of 2000 ACM SIGMOD Intl. Conf. on Management of Data(Dallas, TX), May 2000:415- 426. 被引量:1
  • 6Annan F, Hsu A, Chiu M Y. Image Processing on Compressed Data for Large Video Databases [ C ]. in Proc ACM Multimedia 93 ( Anaheim,CA), 1993:267 - 272. 被引量:1
  • 7Yeo B L, Liu B. An Unified Approach to Temporal Segmentation of Motion JPEG and MPEG Compressed Video[C]. in Proc IEEE Intl. Conf.on Multimedia Computing and Systems(Washington, DC), 1995:81 -90. 被引量:1
  • 8Zhang H J, Chien Y L, Smoliar S W. Video Parsing and Browsing Using Compressed Data[J]. Multimedia Tools and Applications, 1995 (1) : 89- 113. 被引量:1
  • 9Hampapur A, Jain R, Weym outh T E. Production Model Based Digital Video Segmentation[J]. Multimedia Tools and Applications, 1995(1) :9- 47. 被引量:1
  • 10Jongho Nang, Seungwook Hong, Youngin Ibm. An Efficient Video Segmentation Scheme for MPEG Video Stream Using Macroblock Information[C]. in Proc. of ACM Multimedia'99, 1999: 23-26. 被引量:1

共引文献28

同被引文献19

引证文献3

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部