基于稀疏编码的时空金字塔匹配的动作识别被引量：2

Spatio-temporal Pyramid Matching Using Sparse Coding for Action Recognition

下载PDF

导出

摘要针对复杂场景下的动作识别,提出一种基于稀疏编码的时空金字塔匹配的动作识别方法.通过稀疏编码的方法学习更具有判别性的码书和计算局部块(cuboids)的稀疏表示;然后基于max pooling的时空金字塔匹配进行动作分类.该方法在KTH和YouTube两大公开数据集上进行了评价,实验结果表明,与基于K-means的时空金字塔匹配方法相比,该方法提高了2%-7%左右的识别率,在复杂的视频中取得了较好的识别效果. A spatio-temporal pyramid matching （STPM ） using sparse coding is proposed for action recognition in complex environment, which learns a more discriminative codebook and computes the cuboids＇sparse representations by sparse coding followed by action classification using the STPM based on the max pooling. Experiments are evaluated on KTH and YouTube datasets. The results demonstrate that our approach achieves 2% to 7% improvement over the STPM based on k-means and obtains high recognition rate in complex videos.

作者刘长红杨杨刘应辉

机构地区江西师范大学计算机信息工程学院北京科技大学信息工程学院北京邮电大学电子工程学院

出处《小型微型计算机系统》 CSCD 北大核心 2012年第1期169-172,共4页 Journal of Chinese Computer Systems

基金国家自然科学基金项目(60873192)资助江西省教育厅科技项目(GJJ09143)资助江西师范大学青年基金项目资助

关键词动作识别稀疏编码时空金字塔匹配词袋 action recognition sparse coding spatio-temporal pyramid matching bag of words

分类号 TP319 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献18

1H, Battle A, Raina R, ctal. Efficient sparse coding algorithms [ C ]. Proceedings of Neural Information Processing Sys, terns, 2006:801-808. 被引量：1
2DoLlar P, Rabaud V, Cottrcll G, et al. Behavior recognition via sparse spatio-temporal features[ C ]. Proceedings of IEEE International Workshop on VSPETS, 2005:65-72. 被引量：1
3Schuldt C, l.,aptev I, Caputo B. Recognizing human actions: a local SVM approach [ C ]. Proceedings of the International Conference on Patuml Recognition, Los Alan~tos, C.alifomia, 2004:32-36. 被引量：1
4Liu Jr Yang Y, Shah M. Learning semantic visual vocabularies using diffusion distance [ C ]. Proceedings of the Conference on Computer Vision and Pattern Recognition, Los Alamitos, California, 2009:461-468. 被引量：1
5Lazebnik S, Schmid C, Ponce J. Beyond bags of features: spatial pyramid matching for recognizing natural scene categories [ C ]. Proceedings of the Conference on Computer Vision and Pattern Recognition, Los Alamitos, California, 2006:2169-2178. 被引量：1
6杨跃东,郝爱民,褚庆军,赵沁平,王莉莉.基于动作图的视角无关动作识别[J].软件学报,2009,20(10):2679-2691. 被引量：5
7Liu J, Shah M. Learning human actions via information maximization[C]. Proceedings of the Conference on Computer Vision and Pattern Recognition, Los Alamitos, California, 2008 : 1-8. 被引量：1
8Laptev I. On space-time interest points[ J]. International Journal of Computer Vision, 2005,64(2-3 ) : 107-123. 被引量：1
9Wong S, Kim T, CipoUa R. Learning motion categories using both somantics and structural information[ C]. Proceedings of the Conference on Computer Vision and Pattern Recognition, Los Alamitos, California, 2007 : 1-6. 被引量：1
10Donoho D. For most large underdetermined systems of linear quation the minimal ll-norm solution is also the sparsest solution[ J]. Comm. on Pure and Applied Mathematics, 2006,59(6) :797-829. 被引量：1

二级参考文献15

1Turaga P, Chellappa R, Subrahmanian V S, Udrea O. Machine recognition of human activities: A survey. IEEE Transactions on Circuits and Systems for Video Technology, 2008, 18(11): 1473-1488. 被引量：1
2Niebles J C, Wang H, Li Fei-Fei. Unsupervised learning of human action categories using spatial-temporal words. International Journal of Computer Vision, 2008, 79(3): 299-318. 被引量：1
3Oliver N M, Rosario B, Pentland A P. A Bayesian computer vision system for modeling human interactions. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2000, 22(8) : 831-843. 被引量：1
4Xiang T, Gong S. Beyond tracking: Modeling activity and understanding behavior. International Journal of Computer Vision, 2006, 67(1): 21-51. 被引量：1
5Ivanov Y A, Bobick A F. Recognition of visual activities and interactions by stochastic parsing. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2000, 22 (8): 852-872. 被引量：1
6Park S, Aggarwal J K. A hierarchical Bayesian network for event recognition of human action and interaction. ACM Journal of Multimedia Systems, Special Issue on Video Surveillance, 2004, 10(2): 164-179. 被引量：1
7Ryoo M S, Aggarwal J K. Recognition of composite human activities through context-free grammar based representation//Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. NY, USA, 2006, 1709-1719. 被引量：1
8Du Y, Chen F, Xu W, Zhang W. Activity recognition through multi-scale motion detail analysis. Neurocomputing, 2008, 71(16-18): 3561-3574. 被引量：1
9Hongeng S, Nevatia R, Bremond F. Video-hased event recognition: Activity representation and probabilistie reeognition methods. Computer Vision and Image Understanding, 2004, 96(2) : 129-162. 被引量：1
10Hakeem A, Shah M. Learning, detection and representation of multi-agent event in videos. Artificial Intelligence, 2007, 171(8-9): 586-605. 被引量：1

共引文献27

1徐从富,郝春亮,苏保君,楼俊杰.马尔可夫逻辑网络研究[J].软件学报,2011,22(8):1699-1713. 被引量：8
2吴联世,夏利民,罗大庸.人的交互行为识别与理解研究综述[J].计算机应用与软件,2011,28(11):60-63. 被引量：9
3汪超,黄东晋,王冠,丁友东.增强现实系统中基于先验知识的虚拟抓取识别[J].传感器与微系统,2011,30(12):28-31. 被引量：1
4金标,胡文龙,王宏琦.基于时空语义信息的视频运动目标交互行为识别方法[J].光学学报,2012,32(5):145-151. 被引量：6
5谌先敢,刘娟,高智勇,刘海华.基于累积边缘图像的现实人体动作识别[J].自动化学报,2012,38(8):1380-1384. 被引量：15
6刘海华,程志君,谌先敢,高智勇.基于注意机制的仿生人体动作识别[J].中南民族大学学报（自然科学版）,2012,31(2):64-70. 被引量：3
7赵海勇,李成友.基于多特征融合的运动人体行为识别[J].计算机应用研究,2012,29(8):3169-3172. 被引量：6
8余瑞星,袁博,宋军艳.一种新的时空局部特征提取方法及在目标识别中的应用[J].西北工业大学学报,2012,30(6):886-891.
9王科俊,吕卓纹,孙国振,阎涛.基于分层分数条件随机场的行为识别[J].计算机应用,2013,33(4):957-959. 被引量：3
10鲁统伟,任莹.基于光流的人体行为识别[J].电脑知识与技术,2013,9(3):1610-1612. 被引量：1

同被引文献7

1陈世佳,尹东,张荣,王德建.认知推理的家庭服务机器人演示学习研究[J].小型微型计算机系统,2013,34(6):1441-1445. 被引量：3
2谭论正,夏利民.基于协方差描述子和LogitBoost的交通场景图像分割[J].湖南大学学报（自然科学版）,2013,40(8):58-63. 被引量：1
3谭论正,夏利民,黄金霞,夏胜平.基于pLSA模型的人体动作识别[J].国防科技大学学报,2013,35(5):102-108. 被引量：4
4秦磊,胡琼,黄庆明,田琦.基于特征点轨迹的动作识别[J].计算机学报,2014,37(6):1281-1288. 被引量：18
5李鸿利,单征,郭浩然.基于MDTW的飞行动作识别算法[J].计算机工程与应用,2015,51(9):267-270. 被引量：19
6冯铭,陈军.基于局部二进制描述符的高效动作识别方法[J].小型微型计算机系统,2016,37(6):1289-1292. 被引量：3
7Xiao-Fei Ji,Qian-Qian Wu,Zhao-Jie Ju,Yang-Yang Wang.Study of Human Action Recognition Based on Improved Spatio-temporal Features[J].International Journal of Automation and computing,2014,11(5):500-509. 被引量：7

引证文献2

1谭论正,丁锐,夏利民.基于光流关键点多尺度轨迹的人体动作识别[J].计算机工程与设计,2017,38(9):2546-2550. 被引量：6
2刘沿,姚亚强,陈欢欢.一种使用3D骨架片段表示的人体动作识别方法[J].小型微型计算机系统,2018,39(3):508-514. 被引量：5

二级引证文献11

1赵雪章,居华倩,席运江.时空兴趣点结合HMM的人体动作识别方法[J].微型电脑应用,2018,34(12):1-4. 被引量：4
2肖瑜.多媒体视觉图像运动轨迹标识仿真研究[J].计算机仿真,2018,35(10):242-245. 被引量：1
3王婧,谷林.一种优化动作特征表示的动作姿态评测模型[J].西安工程大学学报,2019,33(5):562-567. 被引量：5
4张慧智,曹运华,田士合.基于红外激光器的人体动作捕获技术[J].激光杂志,2020,41(6):155-159.
5华钢,曹青峰,朱艾春,张赛,唐士宇,崔冉.多流卷积神经网络的骨架行为识别[J].小型微型计算机系统,2020,41(6):1286-1290. 被引量：4
6张继凯,顾兰君.基于骨架信息的人体动作识别与实时交互技术[J].内蒙古科技大学学报,2020,39(3):266-272. 被引量：4
7姜灵芝,李士途.一种应用深度知觉测试仪的人体行为预判检测方法研究[J].自动化与仪器仪表,2020(9):40-43.
8王彩玲.不同监控视频条件下行人动作特征三维识别方法[J].信息工程大学学报,2020,21(5):565-568.
9李丽,庄庆华.基于时域分割的人体行为连续性动作预测仿真[J].计算机仿真,2021,38(5):339-343. 被引量：1
10于燕山,郭鹏.基于改进坐标转换的人体运动轨迹识别方法[J].微型电脑应用,2021,37(7):111-115. 被引量：3

1王崇科,卫娟,王少东.视频剪辑查询结合时空金字塔匹配的视频检索方法[J].重庆邮电大学学报（自然科学版）,2015,27(3):411-417. 被引量：2
2刘硕明,刘佳.基于生成/判别混合模型的动作识别[J].电子技术与软件工程,2014(12):212-213.
3孟明,罗志增.基于眼动辅助脑电信号的手部动作分类方法[J].模式识别与人工智能,2012,25(6):1007-1012. 被引量：1
4刘硕明,刘佳.基于动作——身份模型的动作分类[J].中国新技术新产品,2014(8):16-16.
5李奇,李木子.基于KINECT的人机交互系统的研究[J].电脑知识与技术,2014,10(9X):6469-6471.
6丁承君,李根,苑光明,申敏,崔超.面向消防、救援、治安的无盲区定位系统的研究[J].天津工业大学学报,2014,33(1):59-64. 被引量：4
7王斌,刘煜,王炜,徐玮,张茂军.面向人体动作识别的局部特征时空编码方法[J].四川大学学报（工程科学版）,2014,46(2):72-78. 被引量：4
8胡斐,罗立民,刘佳,左欣.基于时空兴趣点和主题模型的动作识别[J].东南大学学报（自然科学版）,2011,41(5):962-966. 被引量：3
9禹继国,马炳先,曹宝香,刘桂真.多主体行为模拟的层次Petri网方法[J].计算机工程,2006,32(16):26-28. 被引量：4
10丁其川,赵新刚,韩建达.基于肌电信号容错分类的手部动作识别[J].机器人,2015,37(1):9-16. 被引量：12

小型微型计算机系统

2012年第1期

浏览历史

内容加载中请稍等...

基于稀疏编码的时空金字塔匹配的动作识别被引量：2

参考文献18

二级参考文献15

共引文献27

同被引文献7

引证文献2

二级引证文献11

相关作者

相关机构

相关主题

浏览历史

基于稀疏编码的时空金字塔匹配的动作识别 被引量：2

参考文献18

二级参考文献15

共引文献27

同被引文献7

引证文献2

二级引证文献11

相关作者

相关机构

相关主题

浏览历史

基于稀疏编码的时空金字塔匹配的动作识别被引量：2