动态模态交互和特征自适应融合的RGBT跟踪被引量：1

RGBT tracking based on dynamic modal interaction and adaptive feature fusion

导出

摘要目的可见光和热红外模态数据具有很强的互补性,RGBT(RGB-thermal)跟踪受到越来越多的关注。传统RGBT目标跟踪方法只是将两个模态的特征进行简单融合,跟踪的性能受到一定程度的限制。本文提出了一种基于动态交互和融合的方法,协作学习面向RGBT跟踪的模态特定和互补表示。方法首先,不同模态的特征进行交互生成多模态特征,在每个模态的特定特征学习中使用注意力机制来提升判别性。其次,通过融合不同层次的多模态特征来获得丰富的空间和语义信息,并通过设计一个互补特征学习模块来进行不同模态互补特征的学习。最后,提出一个动态权重损失函数,根据对两个模态特定分支预测结果的一致性和不确定性进行约束以自适应优化整个网络中的参数。结果在两个基准RGBT目标跟踪数据集上进行实验,数据表明,在RGBT234数据集上,本文方法的精确率(precision rate,PR)为79.2%,成功率(success rate,SR)为55.8%;在GTOT(grayscale-thermal object tracking)数据集上,本文方法的精确率为86.1%,成功率为70.9%。同时也在RGBT234和GTOT数据集上进行了对比实验以验证算法的有效性,实验结果表明本文方法改善了RGBT目标跟踪的结果。结论本文提出的RGBT目标跟踪算法,有效挖掘了两个模态之间的互补性,取得了较好的跟踪精度。 Objective Visual target tracking can be applied to the computer vision analysis,such as video surveillance,unmanned autopilot systems,and human-computer interaction.Thermal infrared cameras have the advantages of long-range of action,strong penetrating ability,hidden objects.As a branch of visual tracking,RGBT(RGB-thermal)tracking aims to estimate the status of the target in a video sequence by aggregating complementary data from two different modalities given the groundtruth bounding box of the first frame of the video sequence.Previous RGBT tracking algorithms are constrained of traditional handcraft features or insufficient to explore and utilize complementary information from different modalities.In order to explore the complementary information between the two modalities,we propose a dynamic interaction and fusion method for RGBT tracking.Method Generally,RGB images capture visual appearance information(e.g.,colors and textures)of target,and thermal images acquire temperature information which is robust to the conditions of lighting and background clutter.To obtain more powerful representations,we can introduce the useful information of another modality.However,the fusion of different modalities is opted from addition or concatenation in common due to some noisy information of the obtained modality features.First,a modality interaction module is demonstrated to suppress clutter noise based on the multiplication operation.Second,a fusion module is designed to gather cross-modality features of all layers.It captures different abstractions of target representations for more accurate localization.Third,a complementary gate mechanism guided learning structure calculates the complementary features of different modalities.As the input of the gate,we use the modality-specific features and the cross-modality features obtained from the fusion module.The output of the gate is a numerical value.To obtain the complementary features,we carry out a dot product operation on this value and the cross-modality features.Finally

作者王福田张淑云李成龙罗斌 Wang Futian;Zhang Shuyun;Li Chenglong;Luo Bin(Anhui Provincial Key Laboratory of Multimodal Cognitive Computation,School of Computer Science and Technology,Anhui University,Hefei 230000,China;Institute of Artificial Intelligence,Hefei Comprehensive National Science Center,Hefei 230000,China)

机构地区安徽大学计算机科学与技术学院多模态认知计算实验室合肥综合性国家科学中心人工智能研究院

出处《中国图象图形学报》 CSCD 北大核心 2022年第10期3010-3021,共12页 Journal of Image and Graphics

基金国家自然科学基金项目(62076003) 安徽高校协同创新项目(GXXT-2019-007) 安徽省自然科学基金项目(1908085MF206)。

关键词模态交互模态融合互补特征学习模态特定信息 RGBT目标跟踪 modality interaction modality fusion complementary features learning modality-specific information RGBT object tracking

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献1

1LIU HuaPing,SUN FuChun.Fusion tracking in color and infrared images using joint sparse representation[J].Science China(Information Sciences),2012,55(3):590-599. 被引量：25

二级参考文献1

1XUE Jianru ZHENG Nanning ZHONG Xiaopin.Sequential stratified sampling belief propagation for multiple targets tracking[J].Science in China(Series F),2006,49(1):48-62. 被引量：6

共引文献24

1孙韶媛,赵海涛,谷小婧.Color Estimation for Thermal Infrared Imagery Based on Kernel PCA and Sparse Representation[J].Journal of Donghua University(English Edition),2012,29(6):475-479.
2侯跃恩,李伟光,容爱琼,叶国强.融合背景信息的分块稀疏表示跟踪算法[J].华南理工大学学报（自然科学版）,2013,41(8):21-27. 被引量：9
3JING ZhongLiang,PAN Han,QIN YanYuan.Current progress of information fusion in China[J].Chinese Science Bulletin,2013,58(36):4533-4540. 被引量：4
4CEN YiGang,ZHAO RuiZhen,MIAO ZhenJiang,CEN LiHui,CUI LiHong.A new approach of conditions on δ_(2s)(Φ) for s-sparse recovery[J].Science China(Information Sciences),2014,57(4):103-109. 被引量：1
5王江涛,陈得宝,李素文,杨一军.局部鉴别分析驱动的红外与可见光图像协同目标跟踪[J].计算机辅助设计与图形学学报,2014,26(6):870-878. 被引量：4
6Wenchang ZHANG,Fuchun SUN,Hang WU,Haolin YANG.A framework for the fusion of visual and tactile modalities for improving robot perception[J].Science China(Information Sciences),2017,60(1):141-152. 被引量：2
7张灿龙,唐艳平,李志欣,王智文,蔡冰.红外可见光目标的空间直方图表示与联合跟踪[J].中国图象图形学报,2017,22(4):492-501. 被引量：2
8马海菲,张灿龙,李志欣.基于L1-APG的红外与可见光目标实时融合跟踪[J].计算机工程,2017,34(7):274-280. 被引量：2
9蔡冰,张灿龙,李志欣.基于联合直方图的红外与可见光目标融合跟踪[J].广西师范大学学报（自然科学版）,2017,35(3):37-44. 被引量：1
10鲁玉龙,李成龙,汤进,罗斌.基于可靠相关度的实时多模态目标跟踪方法[J].安徽大学学报（自然科学版）,2019,43(3):33-38. 被引量：1

同被引文献21

1周海涛,朱纪洪.基于自检测的多数一致表决算法[J].清华大学学报（自然科学版）,2005,45(4):488-491. 被引量：13
2欧阳城添,王曦,郑剑.自适应一致表决算法[J].计算机科学,2011,38(7):130-133. 被引量：14
3王珊,王会举,覃雄派,周烜.架构大数据:挑战、现状与展望[J].计算机学报,2011,34(10):1741-1752. 被引量：616
4斯雪明,王伟,曾俊杰,杨本朝,李光松,苑超,张帆.拟态防御基础理论研究综述[J].中国工程科学,2016,18(6):62-68. 被引量：20
5马海龙,江逸茗,白冰,张建辉.路由器拟态防御能力测试与分析[J].信息安全学报,2017,2(1):43-53. 被引量：21
6仝青,张铮,张为华,邬江兴.拟态防御Web服务器设计与实现[J].软件学报,2017,28(4):883-897. 被引量：99
7刘勤让,林森杰,顾泽宇.面向拟态安全防御的异构功能等价体调度算法[J].通信学报,2018,39(7):188-198. 被引量：23
8佘平,李宁波,谢彬,李程.面向拟态防御系统的存储校验模型[J].数字技术与应用,2018,36(9):54-56. 被引量：3
9王晓梅,杨文晗,张维,杨镇.基于BSG的拟态Web服务器调度策略研究[J].通信学报,2018,39(A02):112-120. 被引量：6
10武兆琪,张帆,郭威,卫今,谢光伟.一种基于执行体异构度的拟态裁决优化方法[J].计算机工程,2020,46(5):12-18. 被引量：13

引证文献1

1李淇,段鹏松,曹仰杰,张大龙,杨晓晗,王宇静.拟态防御架构设计方法研究进展[J].中国图象图形学报,2024,29(8):2319-2332.

1孙莹,杨琴,吴冬平,汪洋,王凌燕.初诊2型糖尿病合并颈动脉斑块患者颈动脉斑块灰阶中位数值预测脑梗死的临床研究[J].卒中与神经疾病,2022,29(5):448-452. 被引量：3
2王雪菲,李勇,余国晓,程晨,杨辉军.基于DMD-NARX模型的短期电力负荷预测方法[J].黑龙江大学自然科学学报,2022,39(3):307-316. 被引量：2
3郑景辉.精心打造翻转课堂,优化初中数学教学[J].亚太教育,2022(20):116-118. 被引量：3
4陈学炳,张人会,蒋利杰,郭广强.离心泵叶轮内非定常流动的动态模态分解分析[J].振动与冲击,2022,41(14):33-40. 被引量：3
5金杰灵,史晨军,邓院昌.基于Hankel-DMD的城市交通事故风险时空预测[J].中国安全生产科学技术,2022,18(8):18-23. 被引量：1
6赵江坤,孙玲,张侨禹,董冠华,汪进文.滚齿机振动特性和颤振稳定性分析[J].船海工程,2022,51(S02):86-91.
7何旭,罗凌,彭小云,冯丹.基于CiteSpace的协作学习研究热点及趋势分析[J].教育信息技术,2022(10):35-38. 被引量：1
8谈伟峰,程春玲,毛毅.基于动态时序移位的视频特征学习方法[J].计算机技术与发展,2022,32(12):43-49.
9唐伟,贾方秀,王晓鸣.基于双边滤波的可见光与红外图像自适应融合[J].兵工学报,2022,43(11):2836-2845. 被引量：6
10王茂,彭亚雄,陆安江.基于多模态融合的视觉问答传输注意网络[J].电子科技,2022,35(12):72-77.

中国图象图形学报

2022年第10期

浏览历史

内容加载中请稍等...

动态模态交互和特征自适应融合的RGBT跟踪被引量：1

参考文献1

二级参考文献1

共引文献24

同被引文献21

引证文献1

相关作者

相关机构

相关主题

浏览历史

动态模态交互和特征自适应融合的RGBT跟踪 被引量：1

参考文献1

二级参考文献1

共引文献24

同被引文献21

引证文献1

相关作者

相关机构

相关主题

浏览历史

动态模态交互和特征自适应融合的RGBT跟踪被引量：1