基于3D卷积的视频错帧筛选方法

A Wrong Temporal-order Frames Identification Method Based on 3D Convolution

下载PDF

导出

摘要为了提取更好的视频特征,提高训练精准度,提出了一个基于CNN(convolutional neural network,卷积神经网络)的错帧筛选模型。所谓错帧,是指在时间上乱序的帧序列,相反,有序帧是指遵守时间顺序的帧序列。其目标是从若干组帧序列中,筛选出顺序错误的一组帧序列。采用无监督学习的方法来训练模型,因此不需要依赖有标签的数据集。基于这个模型的目标以及无标签的训练方式,采用了一个多分支的CNN结构,并且是端到端的。其输入的若干组帧序列从视频中采样获得,分别进行3D卷积编码后,能够提取出每组帧序列在时间和空间上的特征。为了找出帧顺序有误的一组序列,该模型对每组帧序列进行对比,找出它们之间的共同规则,从而筛选出违背此规则的那一组序列。在UCF101数据集上的实验结果证实了该方法的有效性,错帧筛选的准确率高。 In order to extract better video features and improve training accuracy, we propose a model of wrong temporal-ordered frames based on CNN （convolutionai neural network）, whose task is identifying the sequence of wrong temporal-ordered frames from several sequences of frames. The sequence of wrong frames is wrong temporal-ordered while the right sequence is temporal-ordered. Unsuper- vised video representation learning is applied to train this model, therefore labeled data sets are unnecessary. Based on the task and no se- mantic labels, a multi-branched CNN structure is implemented which is learned end-to-end. As the model input,the sequences of frames are sampled from one video. Then,these sequences of frames are encoded with the method of 3D convolution to extract the temporal and spatial features of each sequence of frames. To find out the sequence of frames with wrong temporal-order, the model has to compare all the inputs,analyze the regularities among them,and identify the one with irregularities. The experiments on UCF101 dataset verify the ef- fectiveness of the proposed method, and the accuracy of this model is high.

作者缪宇杰吴智钧宫婧 MIAO Yu-jie;WU Zhi-jun;GONG Jing(School of Internet of Things, Nanjing University of Posts and Telecommunications, Nanjing 210003, China;School of Science, Nanjing University of Posts and Telecommunications, Nanjing 210003, China)

机构地区南京邮电大学物联网学院南京邮电大学理学院

出处《计算机技术与发展》 2018年第5期179-181,186,共4页 Computer Technology and Development

基金国家自然科学基金(61373135) 南京市六大高峰人才资助项目(C类)

关键词无监督学习卷积神经网络错帧筛选 3D卷积 unsupervised learning CNN frame- sequence identification 3 D convolution

分类号 TP301 [自动化与计算机技术—计算机系统结构]

引文网络
相关文献

参考文献6

1林海波,李扬,张毅,罗元.基于时序分析的人体运动模式的识别及应用[J].计算机应用与软件,2014,31(12):225-228. 被引量：7
2殷瑞刚,魏帅,李晗,于洪.深度学习中的无监督学习方法综述[J].计算机系统应用,2016,25(8):1-7. 被引量：47
3王满一,宋亚玲,李玉,张良.结合区域光流特征的时序模板行为识别[J].系统仿真学报,2015,27(5):1146-1151. 被引量：5
4杨祎玥,伏潜,万定生.基于深度循环神经网络的时间序列预测模型[J].计算机技术与发展,2017,27(3):35-38. 被引量：40
5徐庆伶,汪西莉.一种基于支持向量机的半监督分类方法[J].计算机技术与发展,2010,20(10):115-117. 被引量：18
6朱陶,任海军,洪卫军.一种基于前向无监督卷积神经网络的人脸表示学习方法[J].计算机科学,2016,43(6):303-307. 被引量：9

二级参考文献84

1左其亭,高峰.水文时间序列周期叠加预测模型及3种改进模型[J].郑州大学学报（工学版）,2004,25(4):67-73. 被引量：13
2王文,马骏.若干水文预报方法综述[J].水利水电科技进展,2005,25(1):56-60. 被引量：79
3Zhu X J.Semi-supervised learning literature survey[R].U.S.A:University of Wisconsin-Madison,2005. 被引量：1
4Vapnik V.The Nature of Statistical Learning[M].New York:Springer,1995. 被引量：1
5Ge M,Du R,Zhang C C,et al.Fault diagnosis using support vector machine with an application in sheet metal stamping operations[J].Mechanical Systems and Signal Processing,2004,18:143-159. 被引量：1
6Guo G D,Li S Z.Content-based Audio Classification and Retrieval by Support Vector Machines[J].IEEE Trans.on Neural Network,2003,14(1):209-215. 被引量：1
7Gunn S R.Support Vector Machines for Classification and Regression[R].Britain:University of Southampton,1997. 被引量：1
8Cristianini N,Shawe-Taylor J.An Introduction to Support Vector Machines and Other Kernel-based Learning Methods[M].Beijing:Publishing House of Electronics Industry,2004. 被引量：1
9Zhou Z H,Li M.Tri-training:Exploiting unlabeled data using three classifiers[J].IEEE Transactions on Knowledge and Data Engineering,2005,17(11):1529-1541. 被引量：1
10Carolien J van Andel,Nienke Wolterbeek,Caroline A M Doorenbosch,et al.Complete 3D Kinematics of Upper Extermity Functional Tasks[J].Gait&Posture,2008,27:120-127. 被引量：1

共引文献117

1WANG Hongliang,MU Longxin,SHI Fugeng,DOU Hongen.Production prediction at ultra-high water cut stage via Recurrent Neural Network[J].Petroleum Exploration and Development,2020,47(5):1084-1090. 被引量：4
2谈笑.基于Spark大数据平台的老年病风险预警模型[J].微型电脑应用,2020,36(2):71-74. 被引量：2
3刘辉,江千军,桂前进,张祺,王梓豫,王磊,王京景.实体关系抽取技术研究进展综述[J].计算机应用研究,2020,37(S02):1-5. 被引量：24
4葛志,常青,江山,柯文俊,杜泽峰.典型软件的故障仿真和预测方法[J].计算机应用研究,2020,37(S01):230-234.
5南方哲,钱育蓉,行艳妮,赵京霞.基于深度学习的单图像超分辨率重建研究综述[J].计算机应用研究,2020,37(2):321-326. 被引量：23
6李玉鹏,刘婷婷,张良.基于深度学习的人体动作识别方法[J].计算机应用研究,2020,37(1):304-307. 被引量：6
7周宏宇,严春峰,宋旭,刘国英.基于加权三视角运动历史图像与时序分割的动作识别算法[J].电子测量与仪器学报,2020(11):194-203. 被引量：6
8李莎,陶红,高尚.基于属性约简与参数优化的SVM故障诊断研究[J].计算机技术与发展,2012,22(4):175-178. 被引量：1
9张德晨,顾帮华.菌糠原是废弃料用于养殖则成宝[J].科技致富向导,2000(4):31-31.
10杨康鹏.一种改进的滚动轴承故障诊断方法[J].机械制造,2012,50(5):83-86. 被引量：1

1杜东亮.《青少年修养》(第八课)研究[J].思想政治课教学,1984,0(9):8-8.
2赵鹏.小学高段学生计算教学的一点想法[J].青少年日记（教育教学研究）,2017,0(3):138-138.
3郑戈.如何为人工智能立法[J].新华月报,2018,0(11):77-79.
4刁波,杨前,王刚,袁紫林,陈力,王方.大鼠颅脑损伤后脑组织miR-122-5p含量变化及其对神经功能的影响[J].中国临床神经外科杂志,2018,23(4):250-253. 被引量：5
5杜久辉.青少年田径训练应注意的若干问题探讨[J].神州,2018,0(14):289-289.
6马永敏.WTO规则下中小型企业的制度创新[J].克拉玛依学刊,2004,7(1):52-54.
7马建红,杨浩,姚爽.基于自动编码器的句子语义特征提取及相似度计算[J].郑州大学学报（理学版）,2018,50(2):86-91. 被引量：6
8张檬,刘洋,孙茂松.基于非平行语料的双语词典构建[J].中国科学：信息科学,2018,48(5):564-573. 被引量：5
9殷晓阳.英国国防工业和武器装备出口的政策特点[J].国防科技工业,2017(11):67-69.
10袁岳.劳改工作改革的背景和内容[J].法学研究,1990,12(4):49-54.

计算机技术与发展

2018年第5期

浏览历史

内容加载中请稍等...

基于3D卷积的视频错帧筛选方法

参考文献6

二级参考文献84

共引文献117

相关作者

相关机构

相关主题

浏览历史