基于金字塔型残差神经网络的红外图像深度估计被引量：5

Depth Estimation of Infrared Image Based on Pyramid Residual Neural Networks

下载PDF

导出

摘要对车载红外图像进行深度估计,可应用于车辆的夜间辅助驾驶系统(Driver Assistant Systems,DAS),本文提出了一种新型的神经网络结构来估计红外图像的深度。受景物分类思想的启发,将传统深度估计方法中的回归问题转化为分类问题。首先,对红外图像进行归一化预处理,并将深度图置于自然对数空间对距离进行远近分类。其次,设计了一种新型的金字塔输入残差神经网络(Pyramid Residual Neural Networks,PRN),将红外图像以金字塔型结构作为网络输入,网络结构分为粗略特征提取和精细特征提取两部分。最后,将全连接层改为全卷积层,大大减少了网络中的参数个数,降低计算复杂度。金字塔型结构的输入使得网络能够多尺度提取特征,这使得估计出的深度图场景中的对象轮廓比同一网络单一红外图像输入估计出的景物轮廓更清晰。此外,通过计算错误和准确性评价指标,证明本文的提出方法能够很好地估计红外图像的深度,对比实验验证了本文方法更具优势。 Depth estimation of vehicle infrared images can be applied to a vehicle＇s night-assisted driving system（driver assistant system, DAS）. This paper presents a novel type of neural network structure to estimate the depth of infrared images. Inspired by the idea of classification of scenes, the regression problem proposed in the traditional depth estimation of images is transformed into the classification problem in this study. Firstly, the normalization of the infrared image is carried out, and the depth map is placed in a natural logarithmic space to classify the distance. Secondly, a new pyramid residual neural network（pyramid residual neural network, PRN） is designed, which uses the pyramid structure as the network input, and the network structure is divided into coarse and refined feature extractions. Fully connected layers are converted to fully convolutional layers, which greatly reduces the number of parameters in the network and the computational complexity compared to fully connected networks. The input of the pyramid structure allows the networks to extract features at multiple scales. This makes the contours of the objects in the depth map scene clearer than in the same network without a pyramid input structure. In addition, by calculating the error and accuracy evaluation index, it is proved that the method proposed in this paper can estimate the depth of the infrared images well. Moreover, the comparison experiments prove that the proposed method is more advantageous.

作者顾婷婷赵海涛孙韶媛 GU Tingting;ZHAO Haitao;SUN Shaoyuan(School of Information Science and Engineering, East China University of Science and Technology, Shanghai 200237, China;School of Information Science and Technology, Donghua University, Shanghai 201620, China)

机构地区华东理工大学信息科学与工程学院东华大学信息科学与技术学院

出处《红外技术》 CSCD 北大核心 2018年第5期417-423,共7页 Infrared Technology

基金国家自然科学基金(61375007) 上海市科委基础研究项目(15JC1400600)

关键词深度估计车载红外图像金字塔型输入残差网络多尺度特征 depth estimation vehicle infrared images pyramid input residual networks multi-scale features

分类号 TP391.9 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献1

1许路,赵海涛,孙韶媛.基于深层卷积神经网络的单目红外图像深度估计[J].光学学报,2016,36(7):188-197. 被引量：26

二级参考文献21

1Saxena A, Chung S H, Ng A Y. 3-D depth reconstruction from a single still image[J]. International Journal of Computer Vision, 2008, 76(1): 53-69. 被引量：1
2Horn B K P. Obtaining shape from shading information[M]. New York: MIT Press, 1989: 123-171. 被引量：1
3Saxena A, Chung S H, Ng A Y. Learning depth from single monocular images [C]. Advances in Neural Information Processing Systems, 2005: 1161-1168. 被引量：1
4Saxena A, Sun M, Ng A Y. Make 3D: Learning 3D scene structure from a single still image[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2009, 31(5): 824-840. 被引量：1
5Saxena A, Schulte J, Ng A Y. Depth estimation using monocular and stereo cues [C] . International Joint Conference on Artificial Intelligence, 2007: 2197-2203. 被引量：1
6Krizhevsky A, Sutskever I, Hinton G E. ImageNet classification with deep convolutional neural networks[C] Advances in Neural Information Processing Systems, 2012 : 1106-1114. 被引量：1
7Karpathy A, Toderici G, Shetty S, et ai. Large-scale video classification with convolutional neural networks[C] . IEEE Conference on Computer Vision and Pattern Recognition, 2014: 1725-1732. 被引量：1
8Liang M, Hu X. Recurrent convolutional neural network for object recognition[C]. IEEE Conference on Computer Vision and Pattern Recognition, 2015: 3367-3375. 被引量：1
9Lee S C, Nevatia R. Extraction and integration of window in a 3D building model from ground view images [C]. IEEE Computer Conference on Computer Vision and Pattern Recognition, 2004: 113-120. 被引量：1
10Liu L, Yu G, Zokai S, et al. Multiview geometry for texture mapping 2D images onto 3D range data [C]. IEEE Conference on Computer Vision and Pattern Recognition, 2006, 2: 2293-2300. 被引量：1

共引文献25

1何建争,简慧杰,马孟超,王克逸.基于虚拟双球面的仿生复眼系统标定[J].光学学报,2017,37(7):220-230. 被引量：3
2叶国林,孙韶媛,高凯珺,赵海涛.基于加速区域卷积神经网络的夜间行人检测研究[J].激光与光电子学进展,2017,54(8):117-123. 被引量：25
3高琳,王俊峰,范勇,陈念年.基于卷积神经网络与一致性预测器的稳健视觉跟踪[J].光学学报,2017,37(8):222-231. 被引量：8
4姚广顺,孙韶媛,方建安,赵海涛.基于红外与雷达的夜间无人车场景深度估计[J].激光与光电子学进展,2017,54(12):158-164. 被引量：9
5吴寿川,赵海涛,孙韶媛.基于双向递归卷积神经网络的单目红外视频深度估计[J].光学学报,2017,37(12):246-254. 被引量：11
6侯聪聪,何宇清,姜晓恒,潘静.基于二分支卷积单元的深度卷积神经网络[J].激光与光电子学进展,2018,55(2):186-192. 被引量：4
7吴桐,陈平.基于X射线的复杂结构件内部零件装配正确性检测[J].激光与光电子学进展,2018,55(4):168-176. 被引量：5
8鲍振强,李艾华,崔智高,袁梦.深度学习在视觉定位与三维结构恢复中的研究进展[J].激光与光电子学进展,2018,55(5):62-70. 被引量：2
9顾婷婷,赵海涛,孙韶媛.基于帧间信息提取的单幅红外图像深度估计[J].激光与光电子学进展,2018,55(6):163-172. 被引量：8
10安喆,徐熙平,杨进华,乔杨,刘洋.结合图像语义分割的增强现实型平视显示系统设计与研究[J].光学学报,2018,38(7):77-83. 被引量：21

同被引文献38

1张蓓蕾,孙韶媛,武江伟,谷小婧.基于DRF-MAP模型的单目图像深度估计的改进算法[J].红外技术,2009,31(12):712-715. 被引量：3
2席林,孙韶媛,李琳娜,邹芳喻.基于SVM模型的单目红外图像深度估计[J].激光与红外,2012,42(11):1311-1315. 被引量：12
3郭连朋,陈向宁,刘彬,刘田间.基于Kinect传感器多深度图像融合的物体三维重建[J].应用光学,2014,35(5):811-816. 被引量：20
4杨新锋,胡旭诺,粘永健.基于分类的高光谱图像压缩算法（英文）[J].红外与激光工程,2016,45(2):263-266. 被引量：6
5许路,赵海涛,孙韶媛.基于深层卷积神经网络的单目红外图像深度估计[J].光学学报,2016,36(7):188-197. 被引量：26
6胡晓芳.一种云平台下高识别率的手写汉字光学图像识别系统[J].量子电子学报,2016,33(5):530-536. 被引量：3
7黄风山,秦亚敏,任玉松.成捆圆钢机器人贴标系统图像识别方法[J].光电工程,2016,43(12):168-174. 被引量：13
8徐伟,陈彦彤,朴永杰,王绍举.基于吉林一号遥感图像的星载目标快速识别系统[J].光学精密工程,2017,25(1):255-262. 被引量：17
9曹永峰,赵燕君.基于GA-BP神经网络的计算机智能化图像识别技术探究[J].应用激光,2017,37(1):139-143. 被引量：25
10傅志中,王雪,李晓峰,徐进.基于视觉显著性和NSCT的红外与可见光图像融合[J].电子科技大学学报,2017,46(2):357-362. 被引量：37

引证文献5

1袁浩,毛颖颖.基于残差神经网络的不均衡纹理图像分类方法研究[J].新一代信息技术,2019,2(16):89-93.
2赵栓峰,黄涛,许倩,耿龙龙.面向无人机自主飞行的无监督单目视觉深度估计[J].激光与光电子学进展,2020,57(2):137-146. 被引量：7
3陈裕如,赵海涛.基于自适应像素级注意力模型的场景深度估计[J].应用光学,2020,41(3):490-499. 被引量：4
4王倩倩,赵海涛.基于深度CRF网络的单目红外场景深度估计[J].红外技术,2020,42(6):580-588. 被引量：2
5张源峰,程恩.光学图像信息多标记特征分层识别系统设计[J].激光杂志,2020,41(7):209-212. 被引量：1

二级引证文献11

1朱思敏,赵海涛.基于注意力机制与图卷积神经网络的单目红外图像深度估计[J].应用光学,2021,42(1):49-56. 被引量：2
2方琪,王晓华,苏杰.基于分组策略的点线特征融合同步定位与地图构建算法[J].激光与光电子学进展,2021,58(14):397-405. 被引量：10
3李航宇,黄翔,褚文敏,周蒯,赵子越.一种面向齿形结构装配的视觉测量方法[J].激光与光电子学进展,2021,58(16):172-181. 被引量：1
4江俊君,李震宇,刘贤明.基于深度学习的单目深度估计方法综述[J].计算机学报,2022,45(6):1276-1307. 被引量：18
5白琳,刘林军,李轩昂,吴沙,刘汝庆.基于自监督学习的单目图像深度估计算法[J].吉林大学学报（工学版）,2023,53(4):1139-1145.
6熊强强,赵旭.一种基于激光传感器的双目无人机室外场景视觉深度估计方法[J].应用激光,2023,43(5):94-98. 被引量：1
7周泓智.三维动画视频关键帧图像信息提取方法研究[J].贵阳学院学报（自然科学版）,2023,18(2):101-105. 被引量：1
8李恩华,闫梦若,张佃君.基于改进GhostNet模型的快速单目图像深度估计[J].信息记录材料,2023,24(6):137-140.
9裴慧华,韦小铃,甘运刚.有限角度下的实验室隐蔽固件非视域激光检测方法[J].激光杂志,2023,44(9):209-214.
10孟祥瑞,李成良,文继权.基于局部梯度的红外线列扫描图像小目标检测[J].激光杂志,2023,44(10):52-56. 被引量：1

1林春.3D Steerable Pyramid分解域地震资料随机噪声衰减[J].地球物理学进展,2018,33(3):1081-1087. 被引量：2
2俞庆华.罗德与施瓦茨展示为车载雷达回波生成和雷达罩测量提供的全新解决方案[J].汽车零部件,2017(10):79-79.
3Susana Olmedillas-López,Dennis César Lévano-Linares,Carmen Laura Aúz Alexandre,Luz Vega-Clemente,Edurne León Sánchez,Alejandro Villagrasa,Jaime Ruíz-Tovar,Mariano García-Arranz,Damián García-Olmo.Detection of KRAS G12D in colorectal cancer stool by droplet digital PCR[J].World Journal of Gastroenterology,2017,23(39):7087-7097. 被引量：1
4Zhi-Jun Zhu,Lin Wei,Wei Qu,Li-Ying Sun,Ying Liu,Zhi-Gui Zeng,Liang Zhang,En-Hui He,Hai-Ming Zhang,Ji-Dong Jia,Zhong-Tao Zhang.First case of cross-auxiliary double domino donor liver transplantation[J].World Journal of Gastroenterology,2017,23(44):7939-7944. 被引量：7
5Vivian Wu,René F. M. van Oers,Engelbert A. J. M. Schulten,Marco No Helder,Rommel G. Bacabac,Jenneke Klein-Nulend.Osteocyte morphology and orientation in relation to strain in the jaw bone[J].International Journal of Oral Science,2018,10(1):36-43.

红外技术

2018年第5期

浏览历史

内容加载中请稍等...

基于金字塔型残差神经网络的红外图像深度估计被引量：5

参考文献1

二级参考文献21

共引文献25

同被引文献38

引证文献5

二级引证文献11

相关作者

相关机构

相关主题

浏览历史

基于金字塔型残差神经网络的红外图像深度估计 被引量：5

参考文献1

二级参考文献21

共引文献25

同被引文献38

引证文献5

二级引证文献11

相关作者

相关机构

相关主题

浏览历史

基于金字塔型残差神经网络的红外图像深度估计被引量：5