基于单目视觉惯性的同步定位与地图构建方法综述

A review of monocular visual-inertial SLAM

导出

摘要单目视觉惯性同步定位与地图构建(visual-inertial simultaneous localization and mapping,VI-SLAM)技术因具有硬件成本低、无需对外部环境进行布置等优点,得到了广泛关注,在过去的十多年里取得了长足的进步,涌现出诸多优秀的方法和系统。由于实际场景的复杂性,不同方法难免有各自的局限性。虽然已经有一些工作对VISLAM方法进行了综述和评测,但大多只针对经典的VI-SLAM方法,已不能充分反映最新的VI-SLAM技术发展现状。本文首先对基于单目VI-SLAM方法的基本原理进行阐述,然后对单目VI-SLAM方法进行分类分析。为了综合全面地对比不同方法之间的优劣势,本文特别选取3个公开数据集对代表性的单目VI-SLAM方法从多个维度上进行定量评测,全面系统地分析了各类方法在实际场景尤其是增强现实应用场景中的性能。实验结果表明,基于优化或滤波和优化相结合的方法一般在跟踪精度和鲁棒性上比基于滤波的方法有优势,直接法/半直接法在全局快门拍摄的情况下精度较高,但容易受卷帘快门和光照变化的影响,尤其是大场景下误差累积较快;结合深度学习可以提高极端情况下的鲁棒性。最后,针对深度学习与V-SLAM/VI-SLAM结合、多传感器融合以及端云协同这3个研究热点,对SLAM的发展趋势进行讨论和展望。 Monocular visual-inertial simultaneous localization and mapping(VI-SLAM)is an important research topic in computer vision and robotics.It aims to estimate the pose(i.e.,the position and orientation)of the device in real-time using a monocular camera with an inertial sensor while constructing the map of the environment.With the rapid development of various fields,such as augmented/virtual reality(AR/VR),robotics,and autonomous driving,monocular VISLAM has received widespread attention due to its advantages,including low hardware cost and no requirement for an external environment setup,among others.Over the past decade or so,monocular VI-SLAM has made significant progress and spawned many excellent methods and systems.However,because of the complexity of real-world scenarios,different methods have also shown distinct limitations.Although some works have reviewed and evaluated VI-SLAM methods,most of them only focus on classic methods,which cannot fully reflect the latest development status of VI-SLAM technology.Based on optimization type,VI-SLAM can be divided into filtering-and optimization-based methods.Filtering-based methods use filters to fuse observations from visual and inertial sensors,continuously updating the device’s state information for localization and mapping.Additionally,depending on whether visual data association(or feature matching)is performed separately,existing methods can be divided into indirect methods(or feature-based methods)and direct methods.Furthermore,with the development and widespread application of deep learning technology,researchers have started to incorporate deep learning methods into VI-SLAM to enhance robustness in extreme conditions or perform dense reconstruction.This paper first elaborates on the basic principles of monocular VI-SLAM methods and then classifies them analytically into direct and filtering-,optimization-,feature-,and deep learning-based methods.However,most of the existing datasets and benchmarks are focused on applications like autonomous driving and dro

作者章国锋黄赣谢卫健陈丹鹏王楠刘浩敏鲍虎军 Zhang Guofeng;Huang Gan;Xie Weijian;Chen Danpeng;Wang Nan;Liu Haomin;Bao Hujun(State Key Laboratory of CAD&CG,Zhejiang University,Hangzhou 310058,China;SenseTime Research,Hangzhou 311215,China)

机构地区浙江大学计算机辅助设计与图形系统全国重点实验室商汤研究院

出处《中国图象图形学报》 CSCD 北大核心 2024年第10期2839-2858,共20页 Journal of Image and Graphics

基金国家自然科学基金项目(61932003)。

关键词视觉惯性同步定位与地图构建(VI-SLAM) 增强现实(AR) 视觉惯性数据集多视图几何多传感器融合 visual-inertial SLAM(VI-SLAM) augmented reality(AR) visual-inertial dataset multiple-view geometry multi-sensor fusion

分类号 TP391.4 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献5

1潘小鹍,刘浩敏,方铭,王政,张涌,章国锋.基于语义概率预测的动态场景单目视觉SLAM[J].中国图象图形学报,2023,28(7):2151-2166. 被引量：4
2曾庆化,罗怡雪,孙克诚,李一能,刘建业.视觉及其融合惯性的SLAM技术发展综述[J].南京航空航天大学学报,2022,54(6):1007-1020. 被引量：11
3王金科,左星星,赵祥瑞,吕佳俊,刘勇.多源融合SLAM的现状与挑战[J].中国图象图形学报,2022,27(2):368-389. 被引量：26
4刘浩敏,章国锋,鲍虎军.基于单目视觉的同时定位与地图构建方法综述[J].计算机辅助设计与图形学学报,2016,28(6):855-868. 被引量：168
5Jinyu LI,Bangbang YANG,Danpeng CHEN,Nan WANG,Guofeng ZHANG,Hujun BAO.Survey and evaluation of monocular visual-inertial SLAM algorithms for augmented reality[J].Virtual Reality & Intelligent Hardware,2019,1(4):386-410. 被引量：5

二级参考文献131

1盛超,潘树国,赵涛,曽攀,黄砺枭.基于图像语义分割的动态场景下的单目SLAM算法[J].测绘通报,2020(1):40-44. 被引量：5
2Smith R C, Cheeseman P. On the representation and estimationof spatial uncertainty[J]. International Journal of Robotics Research,1986, 5(4):56-68. 被引量：1
3Smith R, Self M, Cheeseman P. Estimating uncertain spatialrelationships in robotics[M] //Autonomous Robot Vehicles. NewYork: Springer, 1990: 167-193. 被引量：1
4Durrant-Whyte H, Bailey T. Simultaneous localization andmapping: Part I[J]. IEEE Robotics & Automation Magazine,2006, 13(2): 99-110. 被引量：1
5Bailey T, Durrant-Whyte H. Simultaneous localization andmapping(SLAM): Part II[J]. IEEE Robotics & AutomationMagazine, 2006, 13(3): 108-117. 被引量：1
6Hartley R, Zisserman A. Multiple view geometry in computervision[M]. Cambridge: Cambridge University Press, 2004. 被引量：1
7Aulinas J, Petillot Y R, Salvi J, et al. The SLAM problem: asurvey[J]. CCIA, 2008, 184(1): 363-371. 被引量：1
8Ros G, Sappa A, Ponsa D, et al. Visual SLAM for driverlesscars: a brief survey[C] //Proceedings of IEEE Workshop onNavigation, Perception, Accurate Positioning and Mapping forIntelligent Vehicles. Los Alamitos: IEEE Computer SocietyPress, 2012: Article No.3. 被引量：1
9Triggs B, Mclauchlan P F, Hartley R I, et al. Bundle adjustment -a modern synthesis[C] //Proceedings of International Workshopon Vision Algorithms: Theory and Practice. Heidelberg: Springer,1999: 298-372. 被引量：1
10Indelman V, Williams S, Kaess M, et al. Information fusion innavigation systems via factor graph based incremental smoothing[J]. Robotics and Autonomous Systems, 2013, 61(8): 721-738. 被引量：1

共引文献208

1蒋济州,徐文福,潘尔振.仿生扑翼飞行机器人自主导航系统研究进展[J].仪器仪表学报,2023,44(11):66-84. 被引量：1
2刘万元,何俐萍.基于YOLOv5神经网络检测模型的动态局部地图构建[J].机械设计,2023,40(S02):7-13. 被引量：1
3赵玉琛,叶海峰,林靖宇.结合DBSCAN与PTAM算法的室内家居无标记增强现实系统[J].计算机应用研究,2020,37(S02):302-304.
4危双丰,庞帆,刘振彬,师现杰.基于激光雷达的同时定位与地图构建方法综述[J].计算机应用研究,2020,37(2):327-332. 被引量：76
5王录涛,吴林峰.基于图优化的视觉SLAM研究进展与应用分析[J].计算机应用研究,2020,37(1):9-15. 被引量：6
6潘锡英,何元烈,孙盛,陈佳腾.基于图像感兴趣区域的机器人闭环检测算法[J].机器人,2019,41(5):676-682. 被引量：3
7朱锋,吴长水,茅健.基于ICP和NDT的激光点云匹配方法研究[J].智能计算机与应用,2022,12(4):140-145. 被引量：2
8陈文佑,章伟,胡陟,史晓帆.一种融合深度相机与激光雷达的室内移动机器人建图与导航方法[J].智能计算机与应用,2021,11(4):159-163. 被引量：5
9危双丰,李澔,刘光祖,刘畅畅,王尚兴.一种结合动态目标检测的视觉SLAM算法[J].测绘科学,2022,47(7):93-103. 被引量：1
10李歆,张国良,谢波.一种基于VINS的视觉里程计改进方法[J].国外电子测量技术,2023,42(1):20-27. 被引量：1

1付钰涓,杨悦,蒲馨怡,徐广宇.基于网络药理学和分子对接探讨马齿苋改善肝损伤的分子作用机制[J].吉林医药学院学报,2024,45(5):321-329.
2江跃龙,陈伟迅,孟思明,唐鹤芳,柯旭能.YOLOv8-LPRNet模型在智能交通中的车牌识别[J].中国科技信息,2024(21):115-117.
3吴晓雯,易雅琴,林东铨.多源数据自动构建三维地理场景技术[J].北京测绘,2024,38(9):1300-1305.
4高本锋,刘王锋,丁雨晴,吴林林,孙大卫,邓鹏程.基于惯性同步的构网型光伏并网系统次同步振荡特性分析[J].太阳能学报,2024,45(8):398-406. 被引量：1
5王红星,杨亚萍,王璟源,张勃阳.基于YOLOv5与多视图几何联合的动态V-SLAM[J].河南理工大学学报（自然科学版）,2024,43(6):129-138.
6蒋经纬,吉月辉,刘俊杰,高强.基于轻量级CNN的视觉SLAM快速回环检测算法[J].计算机仿真,2024,41(8):182-188.
7王文润,党建武,王阳萍,梁超.基于复杂背景的多尺度特征融合手-物交互检测方法[J].兰州交通大学学报,2024,43(5):94-102.
8雷富强,张博雅,张一帆,刘识灏.基于RetinaFace和ERT的眼部疲劳检测方法[J].计算机应用与软件,2024,41(10):227-232.
9赵雪维,董翠粉,王明远,许洋嘉.汽车电子技术应用与发展分析[J].汽车测试报告,2024(13):17-19.
10刘敬东,李旭,于凤启,苟丙荣,贺国庆,巩泽文.激光SLAM技术在巷道精细建模的应用研究[J].煤矿机械,2024,45(10):199-202.

中国图象图形学报

2024年第10期

浏览历史

内容加载中请稍等...

基于单目视觉惯性的同步定位与地图构建方法综述

参考文献5

二级参考文献131

共引文献208

相关作者

相关机构

相关主题

浏览历史