摘要
针对水下目标检测中的目标模糊和遮挡问题,提出基于YOLOX改进的RepViTS-YOLOX水下目标检测方法。该方法采用RepViTS作为特征提取网络并通过结构重参数化,有效提升了对水下目标的特征提取能力和模型推理速度。引入空间和通道重构(spatial and channel reconstruction convolution,SCConv)模块,增强网络对模糊目标的关注。改进特征融合网络,通过跨尺度连接和多尺度融合,加强不同层次特征间的信息交流,使模型更好理解遮挡目标特征。针对定位和分类任务对特征的不同需求,引入上下文解耦头(task-specific context decoupling head,TSCODE),对遮挡目标更精准地定位和分类。实验结果证明,RepViTS-YOLOX方法在RUOD数据集上取得了85.7%的检测效果,较YOLOX提高了3.8个百分点。检测结果显示,该方法可以有效改善水下模糊和遮挡目标的检测情况,提高水下目标检测精度。
The RepViTS-YOLOX method is proposed to address target blurring and occlusion in underwater target detection,improving upon the YOLOX framework.This approach employs RepViTS for feature extraction and incorporates structural reparameterization to enhance underwater target feature extraction and model inference speed.It features a spatial and channel reconstruction convolution(SCConv)module to better focus on blurred targets.The feature fusion network uses cross-scale and multi-scale fusion to better interpret occluded target characteristics.Additionally,a task-specific context decoupling head(TSCODE)is introduced for more accurate localization and classification of occluded targets.Experiments on the RUOD dataset show that RepViTS-YOLOX achieves 85.7%detection accuracy,surpassing YOLOX by 3.8 percentage points.These findings indicate that the method effectively improves the detection of blurred and occluded underwater targets,thus enhancing the precision of underwater target detection.
作者
陶洋
朱腾
钟邦乾
周昆
周立群
TAO Yang;ZHU Teng;ZHONG Bangqian;ZHOU Kun;ZHOU Liqun(School of Communications and Information Engineering,Chongqing University of Posts and Telecommunications,Chongqing 400065,China)
出处
《计算机工程与应用》
CSCD
北大核心
2024年第13期200-208,共9页
Computer Engineering and Applications
基金
国家重点研发计划(2019YFB2102001)。
关键词
YOLOX
目标检测
结构重参数化
解耦检测头
注意力机制
YOLOX
target detection
structural re-parameterization
decouple detection head
attention mechanism