摘要
针对现有立体匹配深度学习模型中常采用线性插值进行代价体上采样,而无法充分利用邻域纹理信息的问题,提出了一个自适应上采样模块。该模块首先为高分辨率输出中每一个像素位置自适应学习采样的权重窗口,然后采用最近邻方法将低分辨率输入上采样后在对应位置使用学习到的权重卷积得到最终对应高分辨输出的值。该模块具有三个特点:(1)大感受野,通过堆叠的空洞卷积以及多尺度窗口提高像素的邻域纹理感知能力;(2)轻量级,与线性插值相比,不需增加过多计算量;(3)通用性,可以移植到现有网络,替换其插值方法。在数据集SceneFlow、KITTI2015上的实验表明,通过采用所提模块替换PSMNet和AANet中的三线性插值,可以有效地降低各自的误差26.4%、10.3%(SceneFlow)和15.4%、18.9%(KITTI2015)。
Most deep learning based stereo matching networks upsample the cost volume by using the interpolation meth-ods.Aiming at solving the drawbacks of such methods which cannot fully aggregate the context information,a light-weight adaptive upsampling module(LAUM)is proposed.LAUM first learns an adaptive weight window for each pixel in high-resolution feature map,and then convolves such weights with the feature map upsampled from low-resolution by using nearest interpolation method.LAUM has several appealing properties:(1)It applies stacked dilation convolution modules and multi-scale windows to enhance the receptive field;(2)It is a lightweight module,which can increase the accu-racy without large computation compared with linear interpolation;(3)It can be assembled to each network easily.LAUM shows remarkable result after assembled to PSMNet and AANet,which reduces the error by 26.4%,10.3%(SceneFlow)and 15.4%,18.9%(KITTI2015).
作者
宋嘉菲
张浩东
SONG Jiafei;ZHANG Haodong(Bionic Vision System Laboratory,Shanghai Institute of Microsystem and Information Technology,Chinese Academy of Sciences,Shanghai 200050,China;School of Information Science and Technology,ShanghaiTech University,Shanghai 201210,China;University of Chinese Academy of Sciences,Beijing 100049,China)
出处
《计算机工程与应用》
CSCD
北大核心
2022年第16期139-146,共8页
Computer Engineering and Applications
基金
中国科学院前沿科学重点研究项目(QYZDY-SSW-JSC034)。
关键词
深度学习
立体匹配
代价体
上采样
轻量级
deep learning
stereo matching
cost volume
upsampling
lightweight