期刊文献+

融合注意力和多尺度的优化立体匹配算法研究 被引量:3

Research on optimal stereo matching algorithm combining attention and multi-scale
下载PDF
导出
摘要 当前基于卷积神经网络的立体匹配方法未充分利用图像中各个层级的特征图信息,造成图像在不适定区域的特征提取能力较差,因此,提出了一种基于PSMNet改进的优化立体匹配算法。在特征提取阶段,全新的特征金字塔模块(SPP)能更好的聚合不同尺度和不同位置的环境信息构建代价体,从而充分利用全局环境信息;在构建匹配代价体时,提出组相关的策略来充分地利用特征中的全局和局部信息;在代价聚合阶段,优化沙漏结构并引入通道注意力机制以便网络来提取具有高表示能力和高质量通道注意力向量的信息特征;为了进一步优化视差图,设计视差优化网络来改善初始的视差估计。在Scene Flow、KITTI 2012和KITTI 2015立体数据集上评估,所提模型在Scene Flow数据集上平均预测误差EPE降低到0.71 pixels,在KITTI 2012和KITTI 2015立体数据集上的误匹配率分别下降到1.20%和1.86%,在实验结果表明,方法取得了较优越的性能。 This paper presents an improved stereo matching algorithm based on PSMNet.In the feature extraction stage,the new SPP feature pyramid module can better aggregate the environmental information of different scales and different locations to construct cost volume,in order to make full use of the global environmental information.When constructing the matching cost volume,the group correlation strategy is proposed to make full use of the global and local information in features.In the cost aggregation stage,the hourglass structure is optimized and the channel attention mechanism is introduced so that the network can extract the information features with high representation ability and high quality channel attention vector.In order to further optimize the disparity map,a disparity optimization network is designed to improve the initial disparity estimation.The method in this paper is evaluated on Scene Flow,KITTI 2012 and KITTI 2015 stereo datasets,and the average prediction error EPE of the proposed model on Scene Flow dataset is reduced to 0.71 pixels.The mismatching rates on KITTI 2012 and KITTI 2015 stereo datasets decreased to 1.20%and 1.86%,respectively.The experimental results show that the proposed method achieves superior performance.
作者 谢鑫 张博 张美灵 朱磊 Xie Xin;Zhang Bo;Zhang Meiling;Zhu Lei(School of Electronics and Information,Xi'an Polytechnic University,Xi'an 710048,China)
出处 《国外电子测量技术》 北大核心 2023年第1期89-99,共11页 Foreign Electronic Measurement Technology
基金 国家自然科学基金(61971339) 陕西省重点研发计划(2019GY-113) 陕西省自然科学基础研究计划(2019JQ-361)项目资助
关键词 立体匹配 深度学习 注意力机制 卷积神经网络 分组相关量 视差优化 stereo matching deep learning attention mechanism convolution neural network group correlation quantity parallax optimization
  • 相关文献

参考文献7

二级参考文献73

  • 1王大海,梁宏光,邱娜,徐世录.红外探测技术的应用与分析[J].红外与激光工程,2007,36(z2):107-112. 被引量:7
  • 2王向军,郭文佳,韩双来,彭明,魏旭宾.基于计算机视觉的弹着点坐标远程测量系统[J].红外与激光工程,2006,35(5):624-628. 被引量:8
  • 3Scharstein D, Szeliski R. A Taxonomy and Evalua tion of Dense Two-frame Stereo Correspondence A1 gorithms[J]. International Journal of Computer Vi sion, 2001, 47(1/2/3):7-42. 被引量:1
  • 4Brown M Z, Burschka D, Hager G D. Advances in Computational Stereo [J]. IEEE Trans Pattern Anal. Mach Intell, 2003, 25(8): 993-1 008. 被引量:1
  • 5Kande T, Okutomi M. A Stereo Matching Algo- rithm with an Adaptive Window, Theroy and Ex- periment[J]. IEEE Transaction on Pattern Analysisand Machine Itelligence, 1994,16(9) : 920-932. 被引量:1
  • 6Hong L, Chen G. Segment-based Stereo Matching Using Graph Cut[C]. Coference on Computer Vi- sion and Pattern Recognition, Washington, D C, 2004. 被引量:1
  • 7Boykov Y, Veksler O, Zabih R. Fast Approximate Energy Minimization Vis Graph Cuts[R]. IEEE Conference on Computer Vision and Pattern Recog- nition, Tech Rep, 2003. 被引量:1
  • 8Sun J, Shum Y H, Zheng N N. Stereo Matching Using Belief Propagation[J]. IEEE Transaction on Pattern Ana[ysls and Maching Intelligence, 2003,25 (7) :787-800. 被引量:1
  • 9Bobick A F, Intile S S. Large Occlusion Stereo[J]. International Journal of Computer Vision, 1999,33 (3) : 181-200. 被引量:1
  • 10Abdollahifard M, Faez K, Pourfard M. Fast Stereo Matching Using Two Stage Color-based Segmenta- tion And Dynamic Programming[C]. The 6th Inter- national Symposium on Mechatronics and Its Appli- cation, Tehran, Iran, 2009. 被引量:1

共引文献94

同被引文献12

引证文献3

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部