基于多尺度线性全局注意力的运动员检测算法

Athlete Detection Algorithm Based on Multi-scale Linear Global Attention

下载PDF

导出

摘要运动员在比赛过程中的快速移动且频繁遮挡,使得对视频中运动员检测容易出现漏检、多检、检测精度下降等问题。现有的主流方法对于移动和遮挡情况下的运动员检测表现不佳。当运动员受到遮挡后,检测目标框的尺度变化增大。引入cutout作为数据增强的方法来模拟遮挡情况,提出基于多尺度线性全局注意力Efficient ViT模块的运动员检测算法。使用线性全局注意力模块减少计算量,并辅以卷积模块来增强其局部的特征提取能力,通过轻量级小卷积聚合不同注意力头部的token获得多尺度信息,增强其全局特征提取能力。针对损失函数部分,选择EIo U作为边界框损失,加入检测框与目标框的宽高距离,使得检测框和真实目标框在尺度上更为贴近。在Sports MOT数据集中4个公开的篮球比赛视频数据集上的实验结果表明,该算法取得了98.0%准确率和98.2%的平均精度均值,相较于YOLOv5算法,其精度提升了4%,高置信度的平均精度均值提升了8.7%。 The rapid movement and frequent occlusion of athletes during a competition make it difficult to detect athletes in a video,along with causing multiple detections,a decline in the detection accuracy,and other problems.The current mainstream detection methods do not perform well for athlete detection under moving and occluding conditions.When the athletes are occluded,the size of the bounding box increases.In this study,a cutout is introduced as a data augmentation method to simulate occlusion,and an athlete detection algorithm based on a multi-scale linear global attention EfficientViT module is constructed.Specifically,the linear global attention module is used to reduce the amount of computation,and the convolution module is supplemented to enhance its local feature extraction capability.The tokens for different attention heads are aggregated through lightweight small convolution to obtain multi-scale information and enhance its global feature extraction capability.EIoU is selected as the bounding box loss for the loss function,with the width and height distances between the detection bounding box and target bounding box added.Thus,the detection and real target bounding boxes are closer in scale.The results of an experiment on four publicly available basketball game video datasets from the SportsMOT dataset show that the proposed algorithm can achieve a precision of 98.0% and mean Average Precision(mAP)of 98.2%.The precision and high-confidence mAP of the proposed algorithm are 4% and 8.7% higher,respectively,than that of the original YOLOv5 algorithm.

作者林芷薇杨祖元王斯秋杨超 LIN Zhiwei;YANG Zuyuan;WANG Siqiu;YANG Chao(Guangdong Key Laboratory of IoT Information Technology,School of Automation,Guangdong University and Technology,Guangzhou 510006,Guangdong,China)

机构地区广东工业大学自动化学院广东省物联网信息技术重点实验室

出处《计算机工程》 CAS CSCD 北大核心 2024年第7期352-359,共8页 Computer Engineering

基金国家自然科学基金(U1911401) 广东省基础与应用基础研究基金联合基金-面上基金项目(2022A1515010688)。

关键词 YOLOv5算法运动员检测多尺度线性全局注意力数据增强边界框损失 YOLOv5 algorithm athlete detection multi-scale linear global attention data augmentation bounding box loss

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献2

1吴珊,周凤.基于改进SSD算法的小目标检测[J].计算机工程,2023,49(7):179-188. 被引量：6
2宋华伟,屈晓娟,杨欣,万方杰.基于改进YOLOv5的火焰烟雾检测[J].计算机工程,2023,49(6):250-256. 被引量：5

二级参考文献12

1Lisha CUI,Rui MA,Pei LV,Xiaoheng JIANG,Zhimin GAO,Bing ZHOU,Mingliang XU.MDSSD:multi-scale deconvolutional single shot detector for small objects[J].Science China(Information Sciences),2020,63(2):98-100. 被引量：18
2回天,哈力旦.阿布都热依木,杜晗.结合Faster R-CNN的多类型火焰检测[J].中国图象图形学报,2019,24(1):73-83. 被引量：31
3刘丽娟,陈松楠.一种基于改进SSD的烟雾实时检测模型[J].信阳师范学院学报（自然科学版）,2020,33(2):305-311. 被引量：15
4赵文清,周震东,翟永杰.基于反卷积和特征融合的SSD小目标检测算法[J].智能系统学报,2020,15(2):310-316. 被引量：12
5陈金鹏,孙浩,东辉,范龙翔,李晨,姚立纲.采用卷积神经网络的烟火智能识别算法[J].福州大学学报（自然科学版）,2021,49(3):309-315. 被引量：5
6谢书翰,张文柱,程鹏,杨子轩.嵌入通道注意力的YOLOv4火灾烟雾检测模型[J].液晶与显示,2021,36(10):1445-1453. 被引量：32
7毛腾跃,宋阳,郑禄.基于多尺度与混合注意力机制的苹果目标检测[J].中南民族大学学报（自然科学版）,2022,41(2):235-242. 被引量：4
8朱傥,杨忠,周国兴,张驰,韩家明.一种轻量化网络的火焰烟雾检测算法[J].应用科技,2022,49(2):1-7. 被引量：7
9高娜,吴清,张满囤.多尺度特征增强的SSD目标检测算法[J].河北工业大学学报,2022,51(2):23-30. 被引量：2
10赵一鸣,王金聪,任洪娥,赵龙.融合ReFPN结构与混合注意力的小目标检测算法[J].哈尔滨理工大学学报,2022,27(2):85-91. 被引量：6

共引文献9

1王铮帅,邱联奎,李迎港.复杂环境下的YOLOv5s烟火检测方法[J].电子测量技术,2023,46(24):149-156.
2戴激光,徐飘玲,吴玉洁.复杂场景下对违规共享单车的细粒度检测方法[J].测绘科学,2024,49(1):90-96.
3黄思欣,王金坤,李茜,赵正伟.基于Arduino的口罩佩戴提醒系统设计与实践探索[J].电脑知识与技术,2023,19(29):111-114.
4陈垦,欧鸥,杨长志,龚帅,欧阳飞,向东升.基于改进YOLOX的落石检测方法[J].计算机测量与控制,2023,31(11):53-59. 被引量：2
5王勇,王柏容,慕东东.红外船舶检测的研究现状及展望[J].大连海事大学学报,2023,49(4):103-115.
6苏洪全,谭蕾,姜浩,郑亚强,马庆,董强,王新星.基于视频的施工作业监护人离岗识别模型[J].化工管理,2024(3):81-86.
7谢康康,朱文忠,肖顺兴,谢林森.一种改进YOLOX_S的火焰烟雾检测算法[J].科学技术与工程,2024,24(8):3298-3307. 被引量：2
8陈春霞,王玲,李洋洋,王贤钧.基于YOLOv5的消防机器人火焰检测研究[J].机械,2024,51(4):67-73.
9王凯,娄树理,王岩.基于改进YOLOv3的小目标检测算法[J].应用光学,2024,45(4):732-740.

1柳磊,马纯.基于卷积特征融合的篮球比赛视频遮挡运动员检测方法[J].河北北方学院学报（自然科学版）,2023,39(1):15-22.
2王子依,周斌,胡波.基于重叠视域的跨相机多目标跟踪[J].中南民族大学学报（自然科学版）,2023,42(5):702-711. 被引量：1

计算机工程

2024年第7期

浏览历史

内容加载中请稍等...

基于多尺度线性全局注意力的运动员检测算法

参考文献2

二级参考文献12

共引文献9

相关作者

相关机构

相关主题

浏览历史