期刊文献+

联合多连接特征编解码与小波池化的轻量级语义分割

Lightweight Semantic Segmentation by Combining Multi-Link Feature Codec with Wavelet Pooling
下载PDF
导出
摘要 语义分割是当前场景理解领域的基础技术之一。现存的语义分割网络通常结构复杂、参数量大、图像特征信息损失过多和计算效率低。针对以上问题,基于编-解码器框架和离散小波变换,设计了一个联合多连接特征编解码与小波池化的轻量级语义分割网络MLWP-Net(Multi-Link Wavelet-Pooled Network),在编码阶段利用多连接策略并结合深度可分离卷积、空洞卷积和通道压缩设计了轻量级特征提取瓶颈结构,并设计了低频混合小波池化操作替代传统的下采样操作,有效降低编码过程造成的信息丢失;在解码阶段,设计了多分支并行空洞卷积解码器以融合多级特征并行实现图像分辨率的恢复。实验结果表明,MLWP-Net仅以0.74 MB的参数量在数据集Cityscapes和CamVid上分别达到74.1%和68.2%mIoU的分割精度,验证了该算法的有效性。 Semantic segmentation is currently one of the basic technologies in the field of scene understanding.Existing semantic segmentation networks usually result in complex structures,a large number of parameters,excessive loss of image feature information,and low computational efficiency.To address these problems,this work proposes a lightweight semantic segmentation network named MLWP-Net(Multi-Link Wavelet-Pooled Network)which combines features with multiple connections and wavelet pooling based on the encoder-decoder framework and Discrete Wavelet Transform(DWT).In the encoding phase,a lightweight feature extraction bottleneck is designed by combining with the depthwise separable convolution,dilated convolution,and channel compression,using a multi-link strategy to fuse multi-level features;besides,a low-frequency-mixed wavelet pooling operation is employed to replace the traditional downsampling operation for effectively reducing the information loss during the encoding process.In the decoding stage,a multi-branch parallel dilated convolutional decoder is designed to fuse multiple features linked to the different layers in the encoder to recover the image resolution in parallel.The experimental results show that our MLWP-Net achieves 74.1%and 68.2%mIoU segmentation accuracy on the datasets of Cityscapes and Camvid with only 0.74M parameters,which demonstrates its effectiveness for semantic segmentation.
作者 易清明 王渝 石敏 骆爱文 YI Qingming;WANG Yu;SHI Min;LUO Aiwen(School of Information Science and Technology,Jinan University,Guangzhou 510632,China;Taidou Microelectronic Science and Technology Co.,Ltd.,Guangzhou 510663,China)
出处 《电子科技大学学报》 EI CAS CSCD 北大核心 2024年第3期366-375,共10页 Journal of University of Electronic Science and Technology of China
基金 国家自然科学基金(62002134) 广东省基础与应用基础研究基金(2020A1515110645,2023A1515010834) 广东省普通高校新型半导体与器件重点实验室项目(2021KSY001) 羊城创新创业领军人才支持计划(2019019) 广东省科技创新战略专项(大学生科技创新培育)(pdjh2023b0061)。
关键词 实时语义分割 轻量级神经网络 多连接特征融合 小波池化 多分支空洞卷积 real-time semantic segmentation lightweight neural network multi-link feature fusion wavelet pooling multi-branch dilated convolution
  • 相关文献

参考文献2

二级参考文献6

共引文献16

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部