基于视觉词汇形状描述的图像表示方法被引量：1

Image representation based on visual vocabulary shape description

下载PDF

导出

摘要针对目前图像表示中引入空间位置信息的空间金字塔匹配方法缺乏对图像中视觉物体平移、缩放和旋转的考虑,提出一种基于视觉词汇形状描述模型的图像表示方法。该方法相对于每个视觉单词的几何中心建立空间几何模型,保证平移不变性;给出对数极坐标空间金字塔匹配,对对数极半径做归一化,保证缩放不变性;在空间金字塔划分过程中确定极角的主方向,从而保证旋转不变性。分别在Caltech-101数据集和自建图像数据集上对该方法进行了验证和比较。实验结果表明,该方法提高了分类识别准确率,特别是对于包含明显平移、缩放和旋转变化的图像数据集;该方法的方差较小,说明其鲁棒性更强。 The Spatial Pyramid Matching（SPM）approach,which is based on approximate global geometric correspondence,disregards invariance to translation,scale and rotation of visual objects in images.This paper proposes an image representation method based on visual vocabulary shape description model.According to this method,spatial geometric model relative to the geometric center of each visual word is constructed to guarantee translation invariance;this paper presents log polar spatial pyramid matching,log polar radius is normalized and a consistent orientation to visual word is assigned in order to achieve scaling and rotation invariance.Experiments have been conducted for comparing and evaluating the proposed method utilizing the Caltech-101 dataset and this paper’s dataset.Experimental results show that the proposed method improves the classification accuracy,especially for the dataset containing images with obvious translation,scaling and rotation changes,and is more robust because of its smaller variance.

作者王红霞杨克俭张敏艾浩军陈先桥

机构地区武汉理工大学计算机科学与技术学院武汉大学计算机学院

出处《计算机工程与应用》 CSCD 2012年第21期191-196,204,共7页 Computer Engineering and Applications

基金国家自然科学基金(No.51179146) 武汉市科学技术局科技攻关计划项目(No.201010621208)

关键词物体分类视觉词袋模型图像表示空间金字塔匹配视觉词汇形状描述模型 object categorization bag-of-visual-words image representation spatial pyramid matching visual vocabulary shape description model

分类号 TP391.4 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献10

1Leibe B,Leonardis A, Schiele B.Combined object catego- rization and segmentation with an implicit shape model[C]// Workshop on Statistical Learning in Computer Vision, ECCV, Prague, 2004. 被引量：1
2Lazebnik S, Schmid C, Ponce J.Beyond bags of features: spatial pyramid matching for recognizing natural scene categories[C]//Proceedings-2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR, 2006: 2169-2178. 被引量：1
3Salti S,Tombari F.On the use of implicit shape models for recognition of object categories in 3D data[C]//Lec- ture Notes in Computer Science, 2011,6494(3 ) : 653-666. 被引量：1
4Pan Hong, Zhu Yaping, Xia Liangzheng, et al.Combining generic and class-specific codebooks for object categori- zation and detection[C]//IEEE International Conference on Acoustics, Speech and Signal Processing-Proceedings, ICASSP, 2011 : 2264-2267. 被引量：1
5Lampert C H,Blaschko M B,Hofmann T.Efficient sub- window search: a branch and bound framework for ob- ject localization[C]//IEEE Transactions on Pattern Analy- sis and Machine Intelligence,2009,31 (12) :2129-2142. 被引量：1
6Yang Jianchao, Yu Kai, Gong Yihong, et al.Linear spatial pyramid matching using sparse coding for image classi- fication[C]//2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, CVPR, 2009: 1794-1801. 被引量：1
7Wang Jinjun, Yang Jianchao, Yu Kai, et al.Locality-con- strained linear coding for image classification[C]//Pro- ceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR, 2010: 3360-3367. 被引量：1
8Perronnin F, Sanchez J, Mensink T.Improving the Fisher kernel for large-scale image classification[C]//Lecture Notes in Computer Science,2010,6314(4) : 143-156. 被引量：1
9Zhou Xi, Yu Kai, Zhang Tong, et al.Image classification using super-vector coding of local image descriptors[C]// Lecture Notes in Computer Science, 2010, 6315(5): 141-154. 被引量：1
10Li Feifei,Rob F,Pietro ELearning generative visual mod- els from few training examples: an incremental bayesian approach tested on 101 object categories[J].Computer Vision and Image Understanding, 2007,106( 1 ) : 59-70. 被引量：1

同被引文献2

1王欢,汪同庆,李阳.利用Kinect深度信息的三维点云配准方法研究[J].计算机工程与应用,2016,52(12):153-157. 被引量：13
2彭天强,栗芳.哈希编码结合空间金字塔的图像分类[J].中国图象图形学报,2016,21(9):1138-1146. 被引量：8

引证文献1

1向程谕,王冬丽,周彦,李雅芳.基于RGB-D融合特征的图像分类[J].计算机工程与应用,2018,54(8):178-182. 被引量：7

二级引证文献7

1朱峰山.丛林式盆景的制作[J].花卉,2000(3):21-21.
2彭媛,段先华,王万耀,鲁文超.基于多线索特征融合的图像分类方法[J].计算机工程与应用,2019,55(20):164-169. 被引量：2
3王灿,李凤莲,胡风云,张雪英,贾文辉.面向特征融合的脑卒中脑电信号分类方法[J].计算机工程与应用,2019,55(24):154-158.
4李凌乐,李瑞华.基于RGB-D弹性可形变物体跟踪识别控制披萨厨师机器人方法研究[J].食品与机械,2020,36(2):100-104.
5陈卓然,丛飚,张会萍.基于视觉词典的多目标截面投影图像特征分割[J].计算机仿真,2020,37(6):347-351. 被引量：2
6李珣,李林鹏,Alexander Lazovik,王文杰,王晓华.基于改进双流卷积递归神经网络的RGB-D物体识别方法[J].光电工程,2021,48(2):21-30. 被引量：7
7李珣,王高平,李林鹏,王晓华,景军锋,张凯兵.基于RGB-D图像的物体识别方法[J].西安工程大学学报,2021,35(4):55-70. 被引量：9

1张启忠,杨纪春,罗志增.用于物体分类的多传感器集成与信息融合系统[J].模式识别与人工智能,1998,11(1):112-117. 被引量：11
2张立和,潘磊,刘涛,马臣.基于核拉普拉斯稀疏编码的图像分类[J].大连理工大学学报,2015,55(2):192-197. 被引量：2
3刘栋,李素,曹志冬.深度学习及其在图像物体分类与检测中的应用综述[J].计算机科学,2016,43(12):13-23. 被引量：31
4徐涛,庹红娅,方正,刘力,敬忠良.基于特征筛选的码本区分性增强方法[J].计算机应用研究,2014,31(5):1597-1600.
5江悦,王润生,王程.采用上下文金字塔特征的场景分类[J].计算机辅助设计与图形学学报,2010,22(8):1366-1373. 被引量：14
6黄凯奇,任伟强,谭铁牛.图像物体分类与检测算法综述[J].计算机学报,2014,37(6):1225-1240. 被引量：195
7付毅,田畅,吴泽民,曾明勇,胡银记.一种快速的全局场景分类算法[J].红外与激光工程,2013,42(S01):242-248. 被引量：1
8李娜.全局多阶统计中混合应用局部多核度量学习的实验分析[J].电脑迷,2016(10).
9李建生,李艺.基于形状的图书馆图像特征提取模型[J].情报杂志,2006,25(12):30-31.
10赵嵩,冯湘.一种基于稀疏编码空间金字塔匹配的图像分类算法[J].应用光学,2016,37(5):706-711. 被引量：2

计算机工程与应用

2012年第21期

浏览历史

内容加载中请稍等...

基于视觉词汇形状描述的图像表示方法被引量：1

参考文献10

同被引文献2

引证文献1

二级引证文献7

相关作者

相关机构

相关主题

浏览历史

基于视觉词汇形状描述的图像表示方法 被引量：1

参考文献10

同被引文献2

引证文献1

二级引证文献7

相关作者

相关机构

相关主题

浏览历史

基于视觉词汇形状描述的图像表示方法被引量：1