期刊文献+

融合策略优选和双注意力的单阶段目标检测 被引量:4

Single stage object detection algorithm based on fusing strategy optimization selection and dual attention mechanism
原文传递
导出
摘要 目的 特征融合是改善模糊图像、小目标以及受遮挡物体等目标检测困难的有效手段之一,为了更有效地利用特征融合来整合不同网络层次的特征信息,显著表达其中的重要特征,本文提出一种基于融合策略优选和双注意力机制的单阶段目标检测算法FDA-SSD(fusion double attention single shot multibox detector)。方法 设计融合策略优化选择方法,结合特征金字塔(feature pyramid network, FPN)来确定最优的多层特征图组合及融合过程,之后连接双注意力模块,通过对各个通道和空间特征的权重再分配,提升模型对通道特征和空间信息的敏感性,最终产生包含丰富语义信息和凸显重要特征的特征图组。结果 本文在公开数据集PASCAL VOC2007(pattern analysis, statistical modelling and computational learning visual object classes)和TGRS-HRRSD-Dataset(high resolution remote sensing detection)上进行对比实验,结果表明,在输入为300×300像素的PASCAL VOC2007测试集上,FDA-SSD模型的精度达到79.8%,比SSD(single shot multibox detector)、RSSD(rainbow SSD)、DSSD(de-convolution SSD)、FSSD(feature fusion SSD)模型分别高了2.6%、1.3%、1.2%、1.0%,在Titan X上的检测速度为47帧/s(frame per second, FPS),与SSD算法相当,分别高于RSSD和DSSD模型12 FPS和37.5 FPS。在输入像素为300×300的TGRS-HRRSD-Dataset测试集上的精度为84.2%,在Tesla V100上的检测速度高于SSD模型10%的情况下,准确率提高了1.5%。结论 通过在单阶段目标检测模型中引入融合策略选择和双注意力机制,使得预测的速度和准确率同时得到提升,并且对于小目标、受遮挡以及模糊图像等难目标的检测能力也得到较大提升。 Objective Object detection is essential to computer vision and in-depth learning recently. It has been widely used in industrial detection, intelligent transportation, human facial recognition and contexts. There are two main categories of recognized target detection algorithms. One of current target detection algorithms is two-stage algorithm, such as region-based convolution neural network(R-CNN), Fast R-CNN, online hard example mining(OHEM), Faster R-CNN, Mask R-CNN etc. The methods generate target candidate boxes first, and implement the candidate boxes classification and regression following. The other one is single-stage algorithms, such as you only look once(YOLO), single shot multibox detector(SSD) etc. In addition, the demonstrated corner network(CornerNet) & center network(CenterNet)-anchor free models have tried to ignore the anchor frame and conduct detection and matching based on key points, which has achieved quite good results, but there is still a little gap from the detection method based on anchor frame. In the practical application of single-stage target detection, a main challenging issue is target detection like blurred image, small target and occluded object, and the predicted performance and efficiency. Feature fusion can improve the detection ability of difficult targets effectively by fusing different deep and shallow features of the network, which has been used in many improved SSD models in common. However, most of the improved models use feature fusion methods directly, and the specific fusion strategies like the issues of fused graphs option and fused graphs processing. In addition, current attention mechanism can make the feature graph have a certain “focus” effect by giving dimension weight. The issue of combining attention mechanism to single-stage target detection effectively has its potentials. Method The shallow Visual Geometry Group(VGG) network in the original SSD algorithm is replaced by the deep residual network as the backbone network. First, an optimized selection meth
作者 戴坤 许立波 黄世旸 李鋆铃 Dai Kun;Xu Libo;Huang Shiyang;Li Yunling(School of Computer and Data Enginering,NingboTech Unixersity,Ningbo 315000,China)
出处 《中国图象图形学报》 CSCD 北大核心 2022年第8期2430-2443,共14页 Journal of Image and Graphics
基金 国家自然科学基金项目(61872321) 宁波市科技创新2025重大专项项目(2019B10036,2020Z005)。
关键词 单阶段目标检测 SSD算法 特征金字塔(FPN) 特征融合 注意力机制 single-stage object detection single shot multibox detector(SSD) feature pyramid network(FPN) feature fusion attention mechanism
  • 相关文献

参考文献7

二级参考文献133

  • 1侯志强,韩崇昭.视觉跟踪技术综述[J].自动化学报,2006,32(4):603-617. 被引量:254
  • 2万缨,韩毅,卢汉清.运动目标检测算法的探讨[J].计算机仿真,2006,23(10):221-226. 被引量:121
  • 3王永忠,潘泉,赵春晖,程咏梅.一种对光照变化鲁棒的均值漂移跟踪方法[J].电子与信息学报,2007,29(10):2287-2291. 被引量:5
  • 4王震宇,张可黛,吴毅,卢汉清.基于SVM和AdaBoost的红外目标跟踪[J].中国图象图形学报,2007,12(11):2052-2057. 被引量:11
  • 5Adam A,Rivlin E,Shimshoni I.Robust fragments-basedtracking using theintegral histogram[C]// Proc of the 19th IEEE Computer Vision and Pattern Recognition.LosAlamitos,CA:IEEE Computer Society,2006;798-805. 被引量:1
  • 6Comaniciu D,Ramesh V,Meer P.Kernel-based objecttracking[J],IEEE Trans on Pattern Analysis and Machine Intelligence,2003,25(5):564-575. 被引量:1
  • 7Liang D,Huang Q,Jiang S,et al.Mean-shift blob trackingwith adaptive feature selection and scale adaptation[C]//Proc of the 11th IEEE Int Conf on Computer Vision.LosAlamitos,CA:IEEE Computer Society,2007:369-372. 被引量:1
  • 8Ning J,Zhang L,Zhang D,et al.Scale and orientationadaptive mean shift tracking[J].Computer Vision,IET,2012,6(1);52-61. 被引量:1
  • 9Yu T,Wu Y.Differential tracking based on spatial-appearance model (SAM)[C]// Proc of the 19th IEEE Computer Vision and Pattern Recognition.Los Alamitos,CA:IEEE Computer Society,2006:720-727. 被引量:1
  • 10Han B,Davis L.On-line density-based appearance modeling for object tracking[C]// Proc of the 10th IEEE Int Conf onComputer Vision.Los Alamitos,CA:IEEE Computer Society,2005:1492-1499. 被引量:1

共引文献538

同被引文献18

引证文献4

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部