摘要
针对一般跟踪算法不能很好地解决航拍视频下目标分辨率低、视场大、视角变化多等特殊难点,该文提出一种融合目标显著性和在线学习干扰因子的无人机(UAV)跟踪算法。通用模型预训练的深层特征无法有效地识别航拍目标,该文跟踪算法能根据反向传播梯度识别每个卷积滤波器的重要性来更好地选择目标显著性特征,以此凸显航拍目标特性。另外充分利用连续视频丰富的上下文信息,通过引导目标外观模型与当前帧尽可能相似地来在线学习动态目标的干扰因子,从而实现可靠的自适应匹配跟踪。实验证明:该算法在跟踪难点更多的UAV123数据集上跟踪成功率和准确率分别比孪生网络基准算法高5.3%和3.6%,同时速度达到平均28.7帧/s,基本满足航拍目标跟踪准确性和实时性需求。
In view of the fact that the general tracking algorithm can not solve the special problems such as low resolution,large field of view and many changes of view angle,a Unmanned Aerial Vehicle(UAV)tracking algorithm combining target saliency and online learning interference factor is proposed.The deep feature that the general model pre-trained can not effectively identify the aerial target,the tracking algorithm can better select the salient feature of each convolution filter according to the importance of the back propagation gradient,so as to highlight the aerial target feature.In addition,it makes full use of the rich context information of the continuous video,and learn the interference factor of the dynamic target online by guiding the target appearance model as similar as possible to the current frame,so as to achieve reliable adaptive matching tracking.It is proved that the tracking success rate and accuracy rate of the algorithm are 5.3%and 3.6%higher than that of the siamese network benchmark algorithm on the more difficult UAV123 dataset,respectively,and the speed reaches an average of 28.7 frames per second,which basically meet the aerial target tracking accuracy and real-time requirements.
作者
孙锐
方林凤
梁启丽
张旭东
SUN Rui;FANG Linfeng;LIANG Qili;ZHANG Xudong(School of Computer and Information,Hefei University of Technology,Hefei 230009,China;Anhui Province Key Laboratory of Industry Safety and Emergency Technology,Hefei 230009,China)
出处
《电子与信息学报》
EI
CSCD
北大核心
2021年第5期1414-1423,共10页
Journal of Electronics & Information Technology
基金
国家自然科学基金(61471154,61876057)
安徽省重点研发计划-科技强警专项(202004d07020012)。
关键词
目标跟踪
无人机航拍场景
孪生网络
目标显著性
在线学习干扰因子
Object tracking
Unmanned Aerial Vehicle(UAV)scene
Siamese network
Target saliency
Online leaning interference factor