This study explores the challenges posed by pedestrian detection and occlusion in AR applications, employing a novel approach that utilizes RGB-D-based skeleton reconstruction to reduce the overhead of classical pedes...This study explores the challenges posed by pedestrian detection and occlusion in AR applications, employing a novel approach that utilizes RGB-D-based skeleton reconstruction to reduce the overhead of classical pedestrian detection algorithms during training. Furthermore, it is dedicated to addressing occlusion issues in pedestrian detection by using Azure Kinect for body tracking and integrating a robust occlusion management algorithm, significantly enhancing detection efficiency. In experiments, an average latency of 204 milliseconds was measured, and the detection accuracy reached an outstanding level of 97%. Additionally, this approach has been successfully applied in creating a simple yet captivating augmented reality game, demonstrating the practical application of the algorithm.展开更多
针对目标检测算法对小目标行人识别率低、对监控远处视野目标检测精度不理想的问题,提出了改进YOLOv5高效多尺度特征利用的行人检测算法。首先,通过在原网络中改进高效的特征融合结构,提高模型对深层特征的感知力来提高模型精度;其次,采...针对目标检测算法对小目标行人识别率低、对监控远处视野目标检测精度不理想的问题,提出了改进YOLOv5高效多尺度特征利用的行人检测算法。首先,通过在原网络中改进高效的特征融合结构,提高模型对深层特征的感知力来提高模型精度;其次,采用Res2Net Block重构骨干网络,加强对细粒度特征信息的利用;最后,加入改进的空间金字塔注意力池化网络,强化模型的多层次特征表达能力。在CrowdHuman数据集进行训练和验证,YOLOv5-SA的平均检测精度达到了85.6%,相比原算法提高了3.8%,检测速度可以达到51 FPS(frames per second),识别精度和检测速度均具有较好的效果,可以有效应用于密集目标行人检测任务。展开更多
文摘This study explores the challenges posed by pedestrian detection and occlusion in AR applications, employing a novel approach that utilizes RGB-D-based skeleton reconstruction to reduce the overhead of classical pedestrian detection algorithms during training. Furthermore, it is dedicated to addressing occlusion issues in pedestrian detection by using Azure Kinect for body tracking and integrating a robust occlusion management algorithm, significantly enhancing detection efficiency. In experiments, an average latency of 204 milliseconds was measured, and the detection accuracy reached an outstanding level of 97%. Additionally, this approach has been successfully applied in creating a simple yet captivating augmented reality game, demonstrating the practical application of the algorithm.
文摘针对目标检测算法对小目标行人识别率低、对监控远处视野目标检测精度不理想的问题,提出了改进YOLOv5高效多尺度特征利用的行人检测算法。首先,通过在原网络中改进高效的特征融合结构,提高模型对深层特征的感知力来提高模型精度;其次,采用Res2Net Block重构骨干网络,加强对细粒度特征信息的利用;最后,加入改进的空间金字塔注意力池化网络,强化模型的多层次特征表达能力。在CrowdHuman数据集进行训练和验证,YOLOv5-SA的平均检测精度达到了85.6%,相比原算法提高了3.8%,检测速度可以达到51 FPS(frames per second),识别精度和检测速度均具有较好的效果,可以有效应用于密集目标行人检测任务。