图像与点云多重信息感知关联的三维多目标跟踪被引量：1

3D multi-object tracking based on image and point cloud multi-information perception association

导出

摘要目的三维多目标跟踪是一项极具挑战性的任务,图像和点云的多模态融合能够提升多目标跟踪性能,但由于场景的复杂性以及多模态数据类型的不同,融合的充分性和关联的鲁棒性仍是亟待解决的问题。因此,提出图像与点云多重信息感知关联的三维多目标跟踪方法。方法首先,提出混合软注意力模块,采用通道分离技术对图像语义特征进行增强,更好地实现通道和空间注意力之间的信息交互。然后,提出语义特征引导的多模态融合网络,将点云特征、图像特征以及逐点图像特征进行深度自适应持续融合,抑制不同模态的干扰信息,提高网络对远距离小目标以及被遮挡目标的跟踪效果。最后,构建多重信息感知亲和矩阵,利用交并比、欧氏距离、外观信息和方向相似性等多重信息进行数据关联,增加轨迹和检测的匹配率,提升跟踪性能。结果在KITTI和NuScenes两个基准数据集上进行评估并与较先进跟踪方法进行对比。KITTI数据集上,HOTA(higher order tracking accuracy)和MOTA(multi-object tracking accuracy)指标分别达到76.94%和88.12%,相比于对比方法中性能最好的模型,分别提升1.48%和3.49%。NuScenes数据集上,AMOTA(average multi-object tracking accuracy)和MOTA指标分别达到68.3%和57.9%,相比于对比方法中性能最好的模型,分别提升0.6%和1.1%,两个数据集上的整体性能均优于先进的跟踪方法。结论提出的方法能够准确地跟踪复杂场景下的目标,具有更好的跟踪鲁棒性,更适合处理自动驾驶场景中的三维多目标跟踪任务。 Objective 3D multi object tracking is a challenging task in autonomous driving,which plays a crucial role in improving the safety and reliability of the perception system.RGB cameras and LiDAR sensors are the most commonly used sensors for this task.While RGB cameras can provide rich semantic feature information,they lack depth information.LiDAR point clouds can provide accurate position and geometric information,but they suffer from problems such as dense near distance and sparse far distance,disorder,and uneven distribution.The multimodal fusion of images and point clouds can improve multi object tracking performance,but due to the complexity of the scene and multimodal data types,the existing fusion methods are less effective and cannot obtain rich fusion features.In addition,existing methods use the intersection ratio or Euclidean distance between the predicted and detected bounding boxes of objects to calculate the similarity between objects,which can easily cause problems such as trajectory fragmentation and identity switching.Therefore,the adequacy of multimodal data fusion and the robustness of data association are still urgent problems to be solved.To this end,a 3D multi object tracking method based on image and point cloud multi-information perception association is proposed.Method First,a hybrid soft attention module is proposed to enhance the image semantic features using channel separation techniques to improve the information interaction between channel and spatial attention.The module includes two submodules.The first one is the soft channel attention submodule,which first compresses the spatial information of image features into the channel feature vector after the global average pooling layer,followed by two fully connected layers to capture the correlation between channels,followed by the Sigmoid function processing to obtain the channel attention map,and finally multiplies with the original features to obtain the channel enhancement features.The second is the soft spatial attention submodule.To

作者刘祥李辉程远志孔祥振陈双敏 Liu Xiang;Li Hui;Cheng Yuanzhi;Kong Xiangzhen;Chen Shuangmin(School of Information Science and Technology,Qingdao University of Science and Technology,Qingdao 266061,China;School of Computer Science and Technology,Harbin Institute of Technology,Harbin 150006,China;Department of Industrial Engineering and Innovation Sciences,Eindhoven University of Technology,Eindhoven 5612,the Netherlands)

机构地区青岛科技大学信息科学技术学院哈尔滨工业大学计算机科学与技术学院荷兰埃因霍芬理工大学工业工程学院

出处《中国图象图形学报》 CSCD 北大核心 2024年第1期163-178,共16页 Journal of Image and Graphics

基金国家自然科学基金项目(62002190,61702295) 山东省高等学校青创科技支持计划项目(2019KJN047) 山东省自然科学基金项目(ZR2020MF036)。

关键词点云三维多目标跟踪注意力多模态融合数据关联 point cloud 3D multi-object tracking attention multimodal fusion data association

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献8

1李功,赵巍,刘鹏,唐降龙.一种用于目标跟踪边界框回归的光滑IoU损失[J].自动化学报,2023,49(2):288-306. 被引量：10
2李娇娇,孙红岩,董雨,张若晗,孙晓鹏.基于深度学习的3维点云处理综述[J].计算机研究与发展,2022,59(5):1160-1179. 被引量：9
3李宗民,姚纯纯,刘玉杰,李华.点云场景下基于结构感知的车辆检测[J].计算机辅助设计与图形学学报,2021,33(3):405-412. 被引量：5
4刘旖菲,胡学敏,陈国文,刘士豪,陈龙.视觉感知的端到端自动驾驶运动规划综述[J].中国图象图形学报,2021,26(1):49-66. 被引量：7
5A.A.M.Muzahid,Wanggen Wan,Ferdous Sohel,Lianyao Wu,Li Hou.CurveNet:Curvature-Based Multitask Learning Deep Networks for 3D Object Recognition[J].IEEE/CAA Journal of Automatica Sinica,2021,8(6):1177-1187. 被引量：2
6张珂,冯晓晗,郭玉荣,苏昱坤,赵凯,赵振兵,马占宇,丁巧林.图像分类的深度卷积神经网络模型综述[J].中国图象图形学报,2021,26(10):2305-2325. 被引量：99
7张燕咏,张莎,张昱,吉建民,段逸凡,黄奕桐,彭杰,张宇翔.基于多模态融合的自动驾驶感知及计算[J].计算机研究与发展,2020,57(9):1781-1799. 被引量：21
8朱向雷,王海弛,尤翰墨,张蔚珩,张颖异,刘爽,陈俊洁,王赞,李克秋.自动驾驶智能系统测试研究综述[J].软件学报,2021,32(7):2056-2077. 被引量：27

二级参考文献23

1姚建华,吴加敏,杨勇,施祖贤.全卷积神经网络下的多光谱遥感影像分割[J].中国图象图形学报,2020,0(1):180-192. 被引量：16
2徐琳琳,张树美,赵俊莉.构建并行卷积神经网络的表情识别算法[J].中国图象图形学报,2019,24(2):227-236. 被引量：50
3穆春迪,谢剑斌,闫玮,刘通,李沛秦.面向动摄像机的高速运动目标检测[J].中国图象图形学报,2015,20(3):349-356. 被引量：4
4李军,吕绍和,陈飞,阳国贵,窦勇.结合视觉注意机制与递归神经网络的图像检索[J].中国图象图形学报,2017,22(2):241-248. 被引量：7
5Zhiling Cai,William Zhu.Feature Selection for Multi-label Classification Using Neighborhood Preservation[J].IEEE/CAA Journal of Automatica Sinica,2018,5(1):320-330. 被引量：10
6Jianquan Gu,Haifeng Hu,Haoxi Li.Local Robust Sparse Representation for Face Recognition With Single Sample per Person[J].IEEE/CAA Journal of Automatica Sinica,2018,5(2):547-554. 被引量：5
7Yang Xing,Chen Lv,Long Chen,Huaji Wang,Hong Wang,Dongpu Cao,Efstathios Velenis,Fei-Yue Wang.Advances in Vision-Based Lane Detection：Algorithms,Integration,Assessment,and Perspectives on ACP-Based Parallel Vision[J].IEEE/CAA Journal of Automatica Sinica,2018,5(3):645-661. 被引量：16
8Xiang FENG,Wanggen WAN,Richard Yi Da XU,Haoyu CHEN,Pengfei LI,J. Alfredo SANCHEZ.A perceptual quality metric for 3D triangle meshes based on spatial pooling[J].Frontiers of Computer Science,2018,12(4):798-812. 被引量：1
9谌华,郭伟,闫敬文.综合边界和纹理信息的合成孔径雷达图像目标分割[J].中国图象图形学报,2019,24(6):882-889. 被引量：7
10白静,徐浩钧.MSP-Net:多尺度点云分类网络[J].计算机辅助设计与图形学学报,2019,31(11):1917-1924. 被引量：13

共引文献171

1陈慧娴,吴一全,张耀.基于深度学习的三维点云分析方法研究进展[J].仪器仪表学报,2023,44(11):130-158. 被引量：3
2李莉,陈心宇,高文斌.一种基于FPGA的卷积神经网络加速器实现方案[J].北京电子科技学院学报,2022,30(4):96-104. 被引量：1
3钱多,殷俊.基于俯视角融合的多模态三维目标检测[J].南京大学学报（自然科学版）,2023,59(6):996-1002. 被引量：1
4张银胜,杨宇龙,吉茹,蓝天鹤,单慧琳.改进YOLOv5s的风力涡轮机表面缺陷检测[J].电子测量与仪器学报,2023,37(1):40-49. 被引量：12
5刘斌,贾浩强,杨一,申佳,盖美辰,宋天霖.基于改进OpenPose算法的矿工危险行为识别研究[J].电视技术,2023,47(2):20-23. 被引量：1
6朱洪波,张在岩,秦育罗,宋伟东,张晋赫.农村路面多类型病害检测方法研究[J].测绘科学,2022,47(9):170-180. 被引量：2
7王嘉凯,刘艾杉,李思民,刘祥龙,吴文峻.智能系统全生命周期安全测试理论与方法[J].智能安全,2023,2(1):27-36.
8杨子勋,陈广新,李长荣,曹文超.基于计算机辅助诊断的皮肤癌良恶性诊断研究[J].新一代信息技术,2022,5(8):134-138.
9管淑贤,葛万成.基于ResNet18的减速带识别及其环境影响研究[J].通信技术,2021,54(3):597-603. 被引量：6
10吕品,许嘉,李陶深,徐文彪.面向自动驾驶的边缘计算技术研究综述[J].通信学报,2021,42(3):190-208. 被引量：18

同被引文献4

1Hantong XU,Jiamin XU,Weiwei XU.Survey of 3D modeling using depth cameras[J].Virtual Reality & Intelligent Hardware,2019,1(5):483-499. 被引量：4
2曹家乐,李亚利,孙汉卿,谢今,黄凯奇,庞彦伟.基于深度学习的视觉目标检测技术综述[J].中国图象图形学报,2022,27(6):1697-1722. 被引量：68
3黄哲,王永才,李德英.3D目标检测方法研究综述[J].智能科学与技术学报,2023,5(1):7-31. 被引量：2
4晋帅,李煊鹏,杨凤,张为公.伪激光点云增强的道路场景三维目标检测[J].中国图象图形学报,2023,28(11):3520-3535. 被引量：2

引证文献1

1贾明达,杨金明,孟维亮,郭建伟,张吉光,张晓鹏.融合点云与图像的环境目标检测研究进展[J].中国图象图形学报,2024,29(6):1765-1784.

1李驰,周颖玥,姚韩敏,李小霞,秦佳敏,庄鸣,文黎明.食道病灶检测的多尺度细节增强金字塔网络[J].计算机工程与应用,2024,60(4):229-236.
2赵伟,刘磊,王鲲鹏,涂铮铮,罗斌.基于多模态双向信息增强的RGBT跟踪网络[J].北京航空航天大学学报,2024,50(2):596-605.
3陈章聪,蔡兴博,沈自启,娄忠骑,吴莹,黄俊深,陆声.统计形状模型在脊柱外科的应用进展[J].骨科临床与研究杂志,2024,9(2):106-109.
4Zhenzhen SU,Hongbing JI,Cong TIAN,Yongquan ZHANG.Performance evaluation for multi-target tracking with temporal dimension specifics[J].Chinese Journal of Aeronautics,2024,37(2):446-458.
5LI Zhao,WANG Yidi,ZHENG Wei.Accurately tracking hypersonic gliding vehicles via an LEO mega-constellation in relay tracking mode[J].Journal of Systems Engineering and Electronics,2024,35(1):211-221.
6Jing LI,Weipeng LI,Xiaoyan ZHANG,Hai HUANG.Design optimization of a hexapod vibration isolation system for electro-optical payload[J].Chinese Journal of Aeronautics,2024,37(2):330-342.
7Wenxue Chen,Yudong Hu,Changsheng Gao,Ruoming An.Trajectory tracking guidance of interceptor via prescribed performance integral sliding mode with neural network disturbance observer[J].Defence Technology（防务技术）,2024,32(2):412-429. 被引量：1

中国图象图形学报

2024年第1期

浏览历史

内容加载中请稍等...

图像与点云多重信息感知关联的三维多目标跟踪被引量：1

参考文献8

二级参考文献23

共引文献171

同被引文献4

引证文献1

相关作者

相关机构

相关主题

浏览历史

图像与点云多重信息感知关联的三维多目标跟踪 被引量：1

参考文献8

二级参考文献23

共引文献171

同被引文献4

引证文献1

相关作者

相关机构

相关主题

浏览历史

图像与点云多重信息感知关联的三维多目标跟踪被引量：1