期刊文献+

网络媒体大数据流异构多模态目标识别策略 被引量:12

A Heterogeneous Multimodal Object Recognition Strategy of the Massive Network Data Flow
下载PDF
导出
摘要 如何对海量的网络媒体大数据进行准确地目标识别,是当前的一个研究热点和难点.针对此问题提出一种利用媒体流时间相关特性的异构多模态目标识别策略.首先基于媒体流中同时存在音频和视频信息的特征,建立一种异构多模态深度学习结构;结合卷积神经网络(convolutional neural network,CNN)和限制波尔兹曼机(restricted Boltzmann machine,RBM)的算法优点,对音频信息和视频信息分别并行处理,这种异构模式可以充分利用不同深度神经网络的特点;然后生成基于典型关联分析的共享特征表示,并进一步利用时间相关特性进行参数的优化.3种对比实验用来验证所提策略的效果,首先将策略与单一模态算法进行对比;然后再在复合的数据库上建立对比实验;最后在网络视频库上建立对比实验,这些对比实验验证了策略的有效性. It is a research hot to achieve the object recognition of the massive network media data nowadays.To address the problem,an object recognition strategy is proposed to handle the massive network media data flow which adopts heterogeneous multimodal structure while utilizing the temporal coherence.Firstly,based on the video and audio co-existing feature of media network data,a heterogeneous multimodal structure is constructed to incorporate the convolutional neural network(CNN)and the restricted Boltzmann machine(RBM).The audio information is processed by restricted Boltzmann machine and the video information is processed by convolutional neural network respectively.The heterogeneous multimodal structure can exploit the merits of different deep learning neural networks.After that,the share characteristic representation are generated by using the canonical correlation analysis(CCA).Then the temporal coherence of video frame is utilized to improve the recognizing accuracy further.There kinds of experiments are adopted to validate the effectiveness of the proposed strategy.The first type of experiment compares the proposed strategy with single-mode algorithm.The second type of experiment illustrates the result based on composite database.Finally the videos coming from real websites are extracted to compare the proposed strategy with other algorithms.These experiments prove the effectiveness of the proposed heterogeneous multimodal strategy.
出处 《计算机研究与发展》 EI CSCD 北大核心 2017年第1期71-79,共9页 Journal of Computer Research and Development
基金 湖南省教育科学"十二五"规划重点资助项目(XJK014AJC001) 国家自然科学基金项目(61379111 61672539 61202342)~~
关键词 目标识别 深度学习 卷积神经网络 限制玻尔兹曼机 典型关联分析 object recognition deep learning convolutional neural network(CNN) restricted Boltzmann machine(RBM) canonical correlation analysis(CCA)
  • 相关文献

参考文献4

二级参考文献126

  • 1张奥千,宋韶旭,王建民.基于数据质量规则的缺失结果解释约减[J].计算机研究与发展,2013,50(S1):221-229. 被引量:2
  • 2金连,王宏志,黄沈滨,高宏.基于Map-Reduce的大数据缺失值填充算法[J].计算机研究与发展,2013,50(S1):312-321. 被引量:18
  • 3李石君,于俊清,欧伟杰.基于HTML模式代数的Web信息提取方法[J].计算机研究与发展,2006,43(9):1644-1650. 被引量:8
  • 4Rahm E, Do H H. Data cleaning: Problems and current approaches . IEEE Data Engineering Bulletin, 2000, 23 (4) 3-13. 被引量:1
  • 5Yu Wei, Li Shijun, Zhang Yunlu, et al, Micro blog ranking for personalize recommendation [J]- Journal of Computational Information Systems, 2013, 9(20): 8243- 8250. 被引量:1
  • 6Bohannon P, Fan Wenfei, Geerts F, et at. Conditional functional dependencies for data cleaning [C] //Proc of the 23rd IEEE Int Conf on Data Engineering Alamitos, CA: IEEE, 2007:746-755. 被引量:1
  • 7Bravo L, Fan Wenfei, Ma Shuai, et al. Extending dependencies with conditions [C]//Proc of the 33rd Int Conf on Very Large Databases. Vienna, CA: CTIT, 2007: 243- 254. 被引量:1
  • 8Koudas N, Saha A, Srivastava D, et al. Metric functional dependencies [C] //Proc of the 25th IEEE Int Conf on Data Engineering. Alamitos, CA: IEEE, 2009:1275-1278. 被引量:1
  • 9Korn F, Muthukrishnan S, Zhu Y. Checks and balances: Monitoring data quality problems in network traffic databases [C]//Proe of the 29th Int Conf on Very Large Databases. San Francisco: Morgan Kaufmann, 2003:536-547. 被引量:1
  • 10Xiong Hui, Pandey G, Steinbaeh M, et al, Enhancing data analysis with noise remora [J]. IEEE Trans on Knowledge and Data Engineering, 2006, 18(3) : 304-319. 被引量:1

共引文献312

同被引文献84

引证文献12

二级引证文献27

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部