由于背景环境复杂,检测物体易受部分遮挡、天气以及光线变化等因素的影响,传统目标检测方法存在提取特征难、检测准确率低、检测耗时长等缺陷.为了改善传统目标检测方法存在的缺陷,实现快速准确的目标检测,提出了一种基于快速区域卷积...由于背景环境复杂,检测物体易受部分遮挡、天气以及光线变化等因素的影响,传统目标检测方法存在提取特征难、检测准确率低、检测耗时长等缺陷.为了改善传统目标检测方法存在的缺陷,实现快速准确的目标检测,提出了一种基于快速区域卷积神经网络(faster regions with convolutional neural network,Faster-RCNN)算法的轻量化改进方法,即针对算法Inception-V2特征提取网络进行轻量化改进,并以带泄露线性整流(leaky rectified linear unit,Leaky ReLU)作为激活函数,解决使用线性整流(rectified linear unit,ReLU)激活函数存在的神经元输入为负数时输出为0的问题.基于上述改进方法,选择沙滩废弃物的检测为案例以验证方法的有效性,并且结合不同特征提取网络在检测沙滩废弃物时的表现,对比了SSD(single shot multibox detector)与Faster-RCNN算法.实验结果表明:所提改进算法在实际检测中有较好的综合性能,且相比原算法Faster-RCNN_Inception-V2,轻量化改进后的Inception-V2特征提取网络卷积计算量减少51.8%,模型训练耗时缩短了9.1%,检测耗时减少了10.9%,各类别AP的平均值(mean average precision,mAP)增加了1.02%,可见所提的改进方法能够有效提高目标检测的准确率,减少检测耗时,并在沙滩废弃物检测上得到成功应用,为海滨城市的沙滩清理维护提供了技术支持与保障.展开更多
Identifying faces in non-frontal poses presents a significant challenge for face recognition(FR)systems.In this study,we delved into the impact of yaw pose variations on these systems and devised a robust method for d...Identifying faces in non-frontal poses presents a significant challenge for face recognition(FR)systems.In this study,we delved into the impact of yaw pose variations on these systems and devised a robust method for detecting faces across a wide range of angles from 0°to±90°.We initially selected the most suitable feature vector size by integrating the Dlib,FaceNet(Inception-v2),and“Support Vector Machines(SVM)”+“K-nearest neighbors(KNN)”algorithms.To train and evaluate this feature vector,we used two datasets:the“Labeled Faces in the Wild(LFW)”benchmark data and the“Robust Shape-Based FR System(RSBFRS)”real-time data,which contained face images with varying yaw poses.After selecting the best feature vector,we developed a real-time FR system to handle yaw poses.The proposed FaceNet architecture achieved recognition accuracies of 99.7%and 99.8%for the LFW and RSBFRS datasets,respectively,with 128 feature vector dimensions and minimum Euclidean distance thresholds of 0.06 and 0.12.The FaceNet+SVM and FaceNet+KNN classifiers achieved classification accuracies of 99.26%and 99.44%,respectively.The 128-dimensional embedding vector showed the highest recognition rate among all dimensions.These results demonstrate the effectiveness of our proposed approach in enhancing FR accuracy,particularly in real-world scenarios with varying yaw poses.展开更多
The automated interpretation of rock structure can improve the efficiency,accuracy,and consistency of the geological risk assessment of tunnel face.Because of the high uncertainties in the geological images as a resul...The automated interpretation of rock structure can improve the efficiency,accuracy,and consistency of the geological risk assessment of tunnel face.Because of the high uncertainties in the geological images as a result of different regional rock types,as well as in-situ conditions(e.g.,temperature,humidity,and construction procedure),previous automated methods have limited performance in classification of rock structure of tunnel face during construction.This paper presents a framework for classifying multiple rock structures based on the geological images of tunnel face using convolutional neural networks(CNN),namely Inception-ResNet-V2(IRV2).A prototype recognition system is implemented to classify 5 types of rock structures including mosaic,granular,layered,block,and fragmentation structures.The proposed IRV2 network is trained by over 35,000 out of 42,400 images extracted from over 150 sections of tunnel faces and tested by the remaining 7400 images.Furthermore,different hyperparameters of the CNN model are introduced to optimize the most efficient algorithm parameter.Among all the discussed models,i.e.,ResNet-50,ResNet-101,and Inception-v4,Inception-ResNet-V2 exhibits the best performance in terms of various indicators,such as precision,recall,F-score,and testing time per image.Meanwhile,the model trained by a large database can obtain the object features more comprehensively,leading to higher accuracy.Compared with the original image classification method,the sub-image method is closer to the reality considering both the accuracy and the perspective of error divergence.The experimental results reveal that the proposed method is optimal and efficient for automated classification of rock structure using the geological images of the tunnel face.展开更多
文摘由于背景环境复杂,检测物体易受部分遮挡、天气以及光线变化等因素的影响,传统目标检测方法存在提取特征难、检测准确率低、检测耗时长等缺陷.为了改善传统目标检测方法存在的缺陷,实现快速准确的目标检测,提出了一种基于快速区域卷积神经网络(faster regions with convolutional neural network,Faster-RCNN)算法的轻量化改进方法,即针对算法Inception-V2特征提取网络进行轻量化改进,并以带泄露线性整流(leaky rectified linear unit,Leaky ReLU)作为激活函数,解决使用线性整流(rectified linear unit,ReLU)激活函数存在的神经元输入为负数时输出为0的问题.基于上述改进方法,选择沙滩废弃物的检测为案例以验证方法的有效性,并且结合不同特征提取网络在检测沙滩废弃物时的表现,对比了SSD(single shot multibox detector)与Faster-RCNN算法.实验结果表明:所提改进算法在实际检测中有较好的综合性能,且相比原算法Faster-RCNN_Inception-V2,轻量化改进后的Inception-V2特征提取网络卷积计算量减少51.8%,模型训练耗时缩短了9.1%,检测耗时减少了10.9%,各类别AP的平均值(mean average precision,mAP)增加了1.02%,可见所提的改进方法能够有效提高目标检测的准确率,减少检测耗时,并在沙滩废弃物检测上得到成功应用,为海滨城市的沙滩清理维护提供了技术支持与保障.
基金funding for the project,excluding research publication,from the Board of Research in Nuclear Sciences(BRNS)under Grant Number 59/14/05/2019/BRNS.
文摘Identifying faces in non-frontal poses presents a significant challenge for face recognition(FR)systems.In this study,we delved into the impact of yaw pose variations on these systems and devised a robust method for detecting faces across a wide range of angles from 0°to±90°.We initially selected the most suitable feature vector size by integrating the Dlib,FaceNet(Inception-v2),and“Support Vector Machines(SVM)”+“K-nearest neighbors(KNN)”algorithms.To train and evaluate this feature vector,we used two datasets:the“Labeled Faces in the Wild(LFW)”benchmark data and the“Robust Shape-Based FR System(RSBFRS)”real-time data,which contained face images with varying yaw poses.After selecting the best feature vector,we developed a real-time FR system to handle yaw poses.The proposed FaceNet architecture achieved recognition accuracies of 99.7%and 99.8%for the LFW and RSBFRS datasets,respectively,with 128 feature vector dimensions and minimum Euclidean distance thresholds of 0.06 and 0.12.The FaceNet+SVM and FaceNet+KNN classifiers achieved classification accuracies of 99.26%and 99.44%,respectively.The 128-dimensional embedding vector showed the highest recognition rate among all dimensions.These results demonstrate the effectiveness of our proposed approach in enhancing FR accuracy,particularly in real-world scenarios with varying yaw poses.
基金supported by the Natural Science Foundation Committee Program of China(Grant Nos.1538009 and 51778474)Science and Technology Project of Yunnan Provincial Transportation Department(Grant No.25 of 2018)+1 种基金the Fundamental Research Funds for the Central Universities in China(Grant No.0200219129)Key innovation team program of innovation talents promotion plan by MOST of China(Grant No.2016RA4059)。
文摘The automated interpretation of rock structure can improve the efficiency,accuracy,and consistency of the geological risk assessment of tunnel face.Because of the high uncertainties in the geological images as a result of different regional rock types,as well as in-situ conditions(e.g.,temperature,humidity,and construction procedure),previous automated methods have limited performance in classification of rock structure of tunnel face during construction.This paper presents a framework for classifying multiple rock structures based on the geological images of tunnel face using convolutional neural networks(CNN),namely Inception-ResNet-V2(IRV2).A prototype recognition system is implemented to classify 5 types of rock structures including mosaic,granular,layered,block,and fragmentation structures.The proposed IRV2 network is trained by over 35,000 out of 42,400 images extracted from over 150 sections of tunnel faces and tested by the remaining 7400 images.Furthermore,different hyperparameters of the CNN model are introduced to optimize the most efficient algorithm parameter.Among all the discussed models,i.e.,ResNet-50,ResNet-101,and Inception-v4,Inception-ResNet-V2 exhibits the best performance in terms of various indicators,such as precision,recall,F-score,and testing time per image.Meanwhile,the model trained by a large database can obtain the object features more comprehensively,leading to higher accuracy.Compared with the original image classification method,the sub-image method is closer to the reality considering both the accuracy and the perspective of error divergence.The experimental results reveal that the proposed method is optimal and efficient for automated classification of rock structure using the geological images of the tunnel face.