针对人脸检测中小尺度人脸和遮挡人脸的漏检问题,提出了一种基于改进YOLOv5s-face(you only look once version 5 small-face)的Face5系列人脸检测算法Face5S(face5 small)和Face5M(face5 medium)。使用马赛克(mosaic)和图像混合(mixup...针对人脸检测中小尺度人脸和遮挡人脸的漏检问题,提出了一种基于改进YOLOv5s-face(you only look once version 5 small-face)的Face5系列人脸检测算法Face5S(face5 small)和Face5M(face5 medium)。使用马赛克(mosaic)和图像混合(mixup)数据增强方法,提升算法在复杂场景下检测人脸的泛化性和稳定性;通过改进C3的网络结构和引入可变形卷积(DCNv2)降低算法的参数量,提高算法提取特征的灵活性;通过引入特征的内容感知重组上采样算子(CARAFE),提高多尺度人脸的检测性能;引入损失函数WIoUV3(wise intersection over union version 3),提升算法的小尺度人脸检测性能。实验结果表明,在WIDER FACE验证集上,相较于YOLOv5s-face算法,Face5S算法的平均mAP@0.5提升了1.03%;相较于先进的人脸检测算法ASFD-D3(automatic and scalable face detector-D3)和TinaFace,Face5M算法的平均mAP@0.5分别提升了1.07%和2.11%,提出的Face5系列算法能够有效提升算法对小尺度和部分遮挡人脸的检测性能,同时具有实时性。展开更多
In order to classify nonlinear features with a linear classifier and improve the classification accuracy, a deep learning network named kernel principal component analysis network( KPCANet) is proposed. First, the d...In order to classify nonlinear features with a linear classifier and improve the classification accuracy, a deep learning network named kernel principal component analysis network( KPCANet) is proposed. First, the data is mapped into a higher-dimensional space with kernel principal component analysis to make the data linearly separable. Then a two-layer KPCANet is built to obtain the principal components of the image. Finally, the principal components are classified with a linear classifier. Experimental results showthat the proposed KPCANet is effective in face recognition, object recognition and handwritten digit recognition. It also outperforms principal component analysis network( PCANet) generally. Besides, KPCANet is invariant to illumination and stable to occlusion and slight deformation.展开更多
Effective strategies to control COVID-19 pandemic need high attention to mitigate negatively impacted communal health and global economy,with the brim-full horizon yet to unfold.In the absence of effective antiviral a...Effective strategies to control COVID-19 pandemic need high attention to mitigate negatively impacted communal health and global economy,with the brim-full horizon yet to unfold.In the absence of effective antiviral and limited medical resources,many measures are recommended by WHO to control the infection rate and avoid exhausting the limited medical resources.Wearing mask is among the non-pharmaceutical intervention measures that can be used as barrier to primary route of SARS-CoV2 droplets expelled by presymptomatic or asymptomatic individuals.Regardless of discourse on medical resources and diversities in masks,all countries are mandating coverings over nose and mouth in public areas.Towards contribution of public health,the aim of the paper is to devise a real-time technique that can efficiently detect non mask faces in public and thus enforce to wear mask.The proposed technique is ensemble of one stage and two stage detectors to achieve low inference time and high accuracy.We took ResNet50 as a baseline model and applied the concept of transfer learning to fuse high level semantic information in multiple feature maps.In addition,we also propose a bounding box transformation to improve localization performance during mask detection.The experiments are conducted with three popular baseline models namely ResNet50,AlexNet and MobileNet.We explored the possibility of these models to plug-in with the proposed model,so that highly accurate results can be achieved in less inference time.It is observed that the proposed technique can achieve high accuracy(98.2%)when implemented with ResNet50.Besides,the proposed model can generate 11.07%and 6.44%higher precision and recall respectively in mask detection when compared to RetinaFaceMask detector.展开更多
文摘针对人脸检测中小尺度人脸和遮挡人脸的漏检问题,提出了一种基于改进YOLOv5s-face(you only look once version 5 small-face)的Face5系列人脸检测算法Face5S(face5 small)和Face5M(face5 medium)。使用马赛克(mosaic)和图像混合(mixup)数据增强方法,提升算法在复杂场景下检测人脸的泛化性和稳定性;通过改进C3的网络结构和引入可变形卷积(DCNv2)降低算法的参数量,提高算法提取特征的灵活性;通过引入特征的内容感知重组上采样算子(CARAFE),提高多尺度人脸的检测性能;引入损失函数WIoUV3(wise intersection over union version 3),提升算法的小尺度人脸检测性能。实验结果表明,在WIDER FACE验证集上,相较于YOLOv5s-face算法,Face5S算法的平均mAP@0.5提升了1.03%;相较于先进的人脸检测算法ASFD-D3(automatic and scalable face detector-D3)和TinaFace,Face5M算法的平均mAP@0.5分别提升了1.07%和2.11%,提出的Face5系列算法能够有效提升算法对小尺度和部分遮挡人脸的检测性能,同时具有实时性。
基金The National Natural Science Foundation of China(No.6120134461271312+7 种基金6140108511301074)the Research Fund for the Doctoral Program of Higher Education(No.20120092120036)the Program for Special Talents in Six Fields of Jiangsu Province(No.DZXX-031)Industry-University-Research Cooperation Project of Jiangsu Province(No.BY2014127-11)"333"Project(No.BRA2015288)High-End Foreign Experts Recruitment Program(No.GDT20153200043)Open Fund of Jiangsu Engineering Center of Network Monitoring(No.KJR1404)
文摘In order to classify nonlinear features with a linear classifier and improve the classification accuracy, a deep learning network named kernel principal component analysis network( KPCANet) is proposed. First, the data is mapped into a higher-dimensional space with kernel principal component analysis to make the data linearly separable. Then a two-layer KPCANet is built to obtain the principal components of the image. Finally, the principal components are classified with a linear classifier. Experimental results showthat the proposed KPCANet is effective in face recognition, object recognition and handwritten digit recognition. It also outperforms principal component analysis network( PCANet) generally. Besides, KPCANet is invariant to illumination and stable to occlusion and slight deformation.
文摘Effective strategies to control COVID-19 pandemic need high attention to mitigate negatively impacted communal health and global economy,with the brim-full horizon yet to unfold.In the absence of effective antiviral and limited medical resources,many measures are recommended by WHO to control the infection rate and avoid exhausting the limited medical resources.Wearing mask is among the non-pharmaceutical intervention measures that can be used as barrier to primary route of SARS-CoV2 droplets expelled by presymptomatic or asymptomatic individuals.Regardless of discourse on medical resources and diversities in masks,all countries are mandating coverings over nose and mouth in public areas.Towards contribution of public health,the aim of the paper is to devise a real-time technique that can efficiently detect non mask faces in public and thus enforce to wear mask.The proposed technique is ensemble of one stage and two stage detectors to achieve low inference time and high accuracy.We took ResNet50 as a baseline model and applied the concept of transfer learning to fuse high level semantic information in multiple feature maps.In addition,we also propose a bounding box transformation to improve localization performance during mask detection.The experiments are conducted with three popular baseline models namely ResNet50,AlexNet and MobileNet.We explored the possibility of these models to plug-in with the proposed model,so that highly accurate results can be achieved in less inference time.It is observed that the proposed technique can achieve high accuracy(98.2%)when implemented with ResNet50.Besides,the proposed model can generate 11.07%and 6.44%higher precision and recall respectively in mask detection when compared to RetinaFaceMask detector.