针对计算机视觉领域的人脸图像检索问题,提出了一种基于深度特征的快速人脸图像检索方法。该方法首先使用人脸图像训练集对深度卷积神经网络模型进行人脸分类训练;在此基础上采用三元组损失方法对已训练好的人脸分类网络模型进行微调,...针对计算机视觉领域的人脸图像检索问题,提出了一种基于深度特征的快速人脸图像检索方法。该方法首先使用人脸图像训练集对深度卷积神经网络模型进行人脸分类训练;在此基础上采用三元组损失方法对已训练好的人脸分类网络模型进行微调,使得网络能够更加有效地提取人脸特征构建高效的特征向量进行人脸检索初步过滤;最后,为了进一步提高系统检索性能,提出一阶段查询扩展方法对待检索人脸图像特征向量进行融合加强。在两个公用人脸数据集(CASIA-3DFaceV1和Labeled Faces in the Wild dataset)上进行详尽的实验验证,结果表明,基于深度特征的人脸图像检索方法不仅能够显著提高检索结果的准确率,而且该方法简单可靠,能够快速地实现人脸检索任务。展开更多
The utilization of digital picture search and retrieval has grown substantially in numerous fields for different purposes during the last decade,owing to the continuing advances in image processing and computer vision...The utilization of digital picture search and retrieval has grown substantially in numerous fields for different purposes during the last decade,owing to the continuing advances in image processing and computer vision approaches.In multiple real-life applications,for example,social media,content-based face picture retrieval is a well-invested technique for large-scale databases,where there is a significant necessity for reliable retrieval capabilities enabling quick search in a vast number of pictures.Humans widely employ faces for recognizing and identifying people.Thus,face recognition through formal or personal pictures is increasingly used in various real-life applications,such as helping crime investigators retrieve matching images from face image databases to identify victims and criminals.However,such face image retrieval becomes more challenging in large-scale databases,where traditional vision-based face analysis requires ample additional storage space than the raw face images already occupied to store extracted lengthy feature vectors and takes much longer to process and match thousands of face images.This work mainly contributes to enhancing face image retrieval performance in large-scale databases using hash codes inferred by locality-sensitive hashing(LSH)for facial hard and soft biometrics as(Hard BioHash)and(Soft BioHash),respectively,to be used as a search input for retrieving the top-k matching faces.Moreover,we propose the multi-biometric score-level fusion of both face hard and soft BioHashes(Hard-Soft BioHash Fusion)for further augmented face image retrieval.The experimental outcomes applied on the Labeled Faces in the Wild(LFW)dataset and the related attributes dataset(LFW-attributes),demonstrate that the retrieval performance of the suggested fusion approach(Hard-Soft BioHash Fusion)significantly improved the retrieval performance compared to solely using Hard BioHash or Soft BioHash in isolation,where the suggested method provides an augmented accuracy of 87%when executed on 1000 specimens 展开更多
Humongous amounts of data bring various challenges to face image retrieval. This paper proposes an efficient method to solve those problems. Firstly,we use accurate facial landmark locations as shape features. Secondl...Humongous amounts of data bring various challenges to face image retrieval. This paper proposes an efficient method to solve those problems. Firstly,we use accurate facial landmark locations as shape features. Secondly, we utilise shape priors to provide discriminative texture features for convolutional neural networks. These shape and texture features are fused to make the learned representation more robust.Finally, in order to increase efficiency, a coarse-tofine search mechanism is exploited to efficiently find similar objects. Extensive experiments on the CASIAWeb Face, MSRA-CFW, and LFW datasets illustrate the superiority of our method.展开更多
作为非线性降维的有效算法,局部线性嵌入(Locally linear embedding)(LLE)和深度自编码网络,被广泛应用于数据挖掘、故障诊断、模式识别等多种领域。本文采用定性与定量相结合的方法,对两种算法进行了对比研究。对LLE算法的基本原理进...作为非线性降维的有效算法,局部线性嵌入(Locally linear embedding)(LLE)和深度自编码网络,被广泛应用于数据挖掘、故障诊断、模式识别等多种领域。本文采用定性与定量相结合的方法,对两种算法进行了对比研究。对LLE算法的基本原理进行了简单介绍。描述了深度自编码网络的理论与模型。提供了数值实验分析,在可视化,人脸识别以及文本检索方面,对两种降维方法进行比较,得到各自适用的优缺点。展开更多
Similarity learning has always been a popular topic in computer vision research. Among this, facial similarity is especially important and difficult due to its wide applications and the nonrigid nature of human faces....Similarity learning has always been a popular topic in computer vision research. Among this, facial similarity is especially important and difficult due to its wide applications and the nonrigid nature of human faces. The large gap between feature representations and human perceptual descriptions makes the problem even harder. In this paper, we learn facial similarity through human-computer interactions. To learn perceptual similarities of faces in a gallery set, we ask users to label some candidate images with their similarities to a probe image. Based on users' responses, a sampling algorithm actively generates a probe image and a set of candidates for the next query. Assisted with human efforts, the algorithm embeds all the images into a space where the distance between two subjects conforms to their dissimilarity in human perception. We apply the learned embedding to face retrieval and compare our method with some feature-based methods on a dataset we collect from social network sites (SNS). Experimental results demonstrate that incorporating human efforts can ensure retrieval accuracy. At the same time, the active sampling algorithm reduces human efforts.展开更多
文摘针对计算机视觉领域的人脸图像检索问题,提出了一种基于深度特征的快速人脸图像检索方法。该方法首先使用人脸图像训练集对深度卷积神经网络模型进行人脸分类训练;在此基础上采用三元组损失方法对已训练好的人脸分类网络模型进行微调,使得网络能够更加有效地提取人脸特征构建高效的特征向量进行人脸检索初步过滤;最后,为了进一步提高系统检索性能,提出一阶段查询扩展方法对待检索人脸图像特征向量进行融合加强。在两个公用人脸数据集(CASIA-3DFaceV1和Labeled Faces in the Wild dataset)上进行详尽的实验验证,结果表明,基于深度特征的人脸图像检索方法不仅能够显著提高检索结果的准确率,而且该方法简单可靠,能够快速地实现人脸检索任务。
基金supported and funded by KAU Scientific Endowment,King Abdulaziz University,Jeddah,Saudi Arabia,grant number 077416-04.
文摘The utilization of digital picture search and retrieval has grown substantially in numerous fields for different purposes during the last decade,owing to the continuing advances in image processing and computer vision approaches.In multiple real-life applications,for example,social media,content-based face picture retrieval is a well-invested technique for large-scale databases,where there is a significant necessity for reliable retrieval capabilities enabling quick search in a vast number of pictures.Humans widely employ faces for recognizing and identifying people.Thus,face recognition through formal or personal pictures is increasingly used in various real-life applications,such as helping crime investigators retrieve matching images from face image databases to identify victims and criminals.However,such face image retrieval becomes more challenging in large-scale databases,where traditional vision-based face analysis requires ample additional storage space than the raw face images already occupied to store extracted lengthy feature vectors and takes much longer to process and match thousands of face images.This work mainly contributes to enhancing face image retrieval performance in large-scale databases using hash codes inferred by locality-sensitive hashing(LSH)for facial hard and soft biometrics as(Hard BioHash)and(Soft BioHash),respectively,to be used as a search input for retrieving the top-k matching faces.Moreover,we propose the multi-biometric score-level fusion of both face hard and soft BioHashes(Hard-Soft BioHash Fusion)for further augmented face image retrieval.The experimental outcomes applied on the Labeled Faces in the Wild(LFW)dataset and the related attributes dataset(LFW-attributes),demonstrate that the retrieval performance of the suggested fusion approach(Hard-Soft BioHash Fusion)significantly improved the retrieval performance compared to solely using Hard BioHash or Soft BioHash in isolation,where the suggested method provides an augmented accuracy of 87%when executed on 1000 specimens
文摘Humongous amounts of data bring various challenges to face image retrieval. This paper proposes an efficient method to solve those problems. Firstly,we use accurate facial landmark locations as shape features. Secondly, we utilise shape priors to provide discriminative texture features for convolutional neural networks. These shape and texture features are fused to make the learned representation more robust.Finally, in order to increase efficiency, a coarse-tofine search mechanism is exploited to efficiently find similar objects. Extensive experiments on the CASIAWeb Face, MSRA-CFW, and LFW datasets illustrate the superiority of our method.
文摘作为非线性降维的有效算法,局部线性嵌入(Locally linear embedding)(LLE)和深度自编码网络,被广泛应用于数据挖掘、故障诊断、模式识别等多种领域。本文采用定性与定量相结合的方法,对两种算法进行了对比研究。对LLE算法的基本原理进行了简单介绍。描述了深度自编码网络的理论与模型。提供了数值实验分析,在可视化,人脸识别以及文本检索方面,对两种降维方法进行比较,得到各自适用的优缺点。
文摘Similarity learning has always been a popular topic in computer vision research. Among this, facial similarity is especially important and difficult due to its wide applications and the nonrigid nature of human faces. The large gap between feature representations and human perceptual descriptions makes the problem even harder. In this paper, we learn facial similarity through human-computer interactions. To learn perceptual similarities of faces in a gallery set, we ask users to label some candidate images with their similarities to a probe image. Based on users' responses, a sampling algorithm actively generates a probe image and a set of candidates for the next query. Assisted with human efforts, the algorithm embeds all the images into a space where the distance between two subjects conforms to their dissimilarity in human perception. We apply the learned embedding to face retrieval and compare our method with some feature-based methods on a dataset we collect from social network sites (SNS). Experimental results demonstrate that incorporating human efforts can ensure retrieval accuracy. At the same time, the active sampling algorithm reduces human efforts.