Image matching based on scale invariant feature transform(SIFT) is one of the most popular image matching algorithms, which exhibits high robustness and accuracy. Grayscale images rather than color images are genera...Image matching based on scale invariant feature transform(SIFT) is one of the most popular image matching algorithms, which exhibits high robustness and accuracy. Grayscale images rather than color images are generally used to get SIFT descriptors in order to reduce the complexity. The regions which have a similar grayscale level but different hues tend to produce wrong matching results in this case. Therefore, the loss of color information may result in decreasing of matching ratio. An image matching algorithm based on SIFT is proposed, which adds a color offset and an exposure offset when converting color images to grayscale images in order to enhance the matching ratio. Experimental results show that the proposed algorithm can effectively differentiate the regions with different colors but the similar grayscale level, and increase the matching ratio of image matching based on SIFT. Furthermore, it does not introduce much complexity than the traditional SIFT.展开更多
The global context(GC) descriptor is improved for describing interest regions,uses gradient orientation for binning,and thus provides more robust invariance for geometric and photometric transformations.The performanc...The global context(GC) descriptor is improved for describing interest regions,uses gradient orientation for binning,and thus provides more robust invariance for geometric and photometric transformations.The performance of the improved GC(IGC) to image matching is studied through extensive experiments on the Oxford A?ne dataset.Empirical results indicate that the proposed IGC yields quite stable and robust results,signi?cantly outperforms the original GC,and also can outperform the classical scale-invariant feature transform(SIFT) in most of the test cases.By integrating the IGC to the SIFT,the resulting of hybrid SIFT+IGC performs best over all other single descriptors in these experimental evaluations with various geometric transformations.展开更多
Since the outbreak of Coronavirus Disease 2019(COVID-19),people are recommended to wear facial masks to limit the spread of the virus.Under the circumstances,traditional face recognition technologies cannot achieve sa...Since the outbreak of Coronavirus Disease 2019(COVID-19),people are recommended to wear facial masks to limit the spread of the virus.Under the circumstances,traditional face recognition technologies cannot achieve satisfactory results.In this paper,we propose a face recognition algorithm that combines the traditional features and deep features of masked faces.For traditional features,we extract Local Binary Pattern(LBP),Scale-Invariant Feature Transform(SIFT)and Histogram of Oriented Gradient(HOG)features from the periocular region,and use the Support Vector Machines(SVM)classifier to perform personal identification.We also propose an improved Convolutional Neural Network(CNN)model Angular Visual Geometry Group Network(A-VGG)to learn deep features.Then we use the decision-level fusion to combine the four features.Comprehensive experiments were carried out on databases of real masked faces and simulated masked faces,including frontal and side faces taken at different angles.Images with motion blur were also tested to evaluate the robustness of the algorithm.Besides,the experiment of matching a masked face with the corresponding full face is accomplished.The experimental results show that the proposed algorithm has state-of-the-art performance in masked face recognition,and the periocular region has rich biological features and high discrimination.展开更多
Road visual navigation relies on accurate road models.This study was aimed at proposing an improved scale-invariant feature transform(SIFT)algorithm for recovering depth information from farmland road images,which wou...Road visual navigation relies on accurate road models.This study was aimed at proposing an improved scale-invariant feature transform(SIFT)algorithm for recovering depth information from farmland road images,which would provide a reliable path for visual navigation.The mean image of pixel value in five channels(R,G,B,S and V)were treated as the inspected image and the feature points of the inspected image were extracted by the Canny algorithm,for achieving precise location of the feature points and ensuring the uniformity and density of the feature points.The mean value of the pixels in 5×5 neighborhood around the feature point at an interval of 45ºin eight directions was then treated as the feature vector,and the differences of the feature vectors were calculated for preliminary matching of the left and right image feature points.In order to achieve the depth information of farmland road images,the energy method of feature points was used for eliminating the mismatched points.Experiments with a binocular stereo vision system were conducted and the results showed that the matching accuracy and time consuming for depth recovery when using the improved SIFT algorithm were 96.48%and 5.6 s,respectively,with the accuracy for depth recovery of-7.17%-2.97%in a certain sight distance.The mean uniformity,time consuming and matching accuracy for all the 60 images under various climates and road conditions were 50%-70%,5.0-6.5 s,and higher than 88%,respectively,indicating that performance for achieving the feature points(e.g.,uniformity,matching accuracy,and algorithm real-time)of the improved SIFT algorithm were superior to that of conventional SIFT algorithm.This study provides an important reference for navigation technology of agricultural equipment based on machine vision.展开更多
Expression, occlusion, and pose variations are three main challenges for 3D face recognition. A novel method is presented to address 3D face recognition using scale-invariant feature transform(SIFT) features on 3D mes...Expression, occlusion, and pose variations are three main challenges for 3D face recognition. A novel method is presented to address 3D face recognition using scale-invariant feature transform(SIFT) features on 3D meshes. After preprocessing, shape index extrema on the 3D facial surface are selected as keypoints in the difference scale space and the unstable keypoints are removed after two screening steps. Then, a local coordinate system for each keypoint is established by principal component analysis(PCA).Next, two local geometric features are extracted around each keypoint through the local coordinate system. Additionally, the features are augmented by the symmetrization according to the approximate left-right symmetry in human face. The proposed method is evaluated on the Bosphorus, BU-3DFE, and Gavab databases, respectively. Good results are achieved on these three datasets. As a result, the proposed method proves robust to facial expression variations, partial external occlusions and large pose changes.展开更多
A super-resolution enhancement algorithm was proposed based on the combination of fractional calculus and Projection onto Convex Sets(POCS)for unmanned aerial vehicles(UAVs)images.The representative problems of UAV im...A super-resolution enhancement algorithm was proposed based on the combination of fractional calculus and Projection onto Convex Sets(POCS)for unmanned aerial vehicles(UAVs)images.The representative problems of UAV images including motion blur,fisheye effect distortion,overexposed,and so on can be improved by the proposed algorithm.The fractional calculus operator is used to enhance the high-resolution and low-resolution reference frames for POCS.The affine transformation parameters between low-resolution images and reference frame are calculated by Scale Invariant Feature Transform(SIFT)for matching.The point spread function of POCS is simulated by a fractional integral filter instead of Gaussian filter for more clarity of texture and detail.The objective indices and subjective effect are compared between the proposed and other methods.The experimental results indicate that the proposed method outperforms other algorithms in most cases,especially in the structure and detail clarity of the reconstructed images.展开更多
An Unmanned Aircraft System (UAS) is an aircraft or ground station that can be either remote controlled manually or is capable of flying autonomously under the guidance of pre-programmed Global Positioning System (...An Unmanned Aircraft System (UAS) is an aircraft or ground station that can be either remote controlled manually or is capable of flying autonomously under the guidance of pre-programmed Global Positioning System (GPS) waypoint flight plans or more complex onboard intelligent systems. The UAS aircrafts have recently found extensive applications in military reconnaissance and surveillance, homeland security, precision agriculture, fire monitoring and analysis, and other different kinds of aids needed in disasters. Through surveillance videos captured by a UAS digital imaging payload over the interest areas, the corresponding UAS missions can be conducted. In this paper, the authors present an effective method to detect and extract architectural buildings under rural environment from UAS video sequences. The SIFT points are chosen as image features. The planar homography is adopted as the motion model between different image frames. The proposed algorithm is tested on real UAS video data.展开更多
Automatic detection of a designated building area(DBA)is a research hotspot in the field of target detection using remote sensing images.Target detection is urgently needed for tasks such as illegal building monitorin...Automatic detection of a designated building area(DBA)is a research hotspot in the field of target detection using remote sensing images.Target detection is urgently needed for tasks such as illegal building monitoring,dynamic land use monitoring,antiterrorism efforts,and military reconnaissance.The existing detection methods generally have low efficiency and poor detection accuracy due to the large size and complexity of remote sensing scenes.To address the problems of the current detection methods,this paper presents a DBA detection method that uses hierarchical structural constraints in remote sensing images.Our method was conducted in two main stages.(1)During keypoint generation,we proposed a screening method based on structural pattern descriptors.The local pattern feature of the initial keypoints was described by a multilevel local pattern histogram(MLPH)feature;then,we used one-class support vector machine(OC-SVM)merely to screen those building attribute keypoints.(2)To match the screened keypoints,we proposed a reliable DBA detection method based on matching the local structural similarities of the screened keypoints.We achieved precise keypoint matching by calculating the similarities of the local skeletal structures in the neighboring areas around the roughly matched keypoints to achieve DBA detection.We tested the proposed method on building area sets of different types and at different time phases.The experimental results show that the proposed method is both highly accurate and computationally efficient.展开更多
The scale-invariant feature transform (SIFT) is often applied to extract tie-points for airborne SAR images. When a pair of airborne SAR images differs with look angles obviously, shadow sizes and shapes of same objec...The scale-invariant feature transform (SIFT) is often applied to extract tie-points for airborne SAR images. When a pair of airborne SAR images differs with look angles obviously, shadow sizes and shapes of same objects will differ obviously. In main and slave SAR images, key-points around shadows often match as tie-points, although they are not homologous points. The phenomenon worsens the performance of SIFT on SAR images. On the basis of SIFT, a modified matching method is proposed to decrease the number of incorrect tie-points. High-resolution airborne SAR images are used in Experiments. Experiment results show that the proposed method is very effective to extract correct tie-points for SAR images.展开更多
基金supported by the National Natural Science Foundation of China(61271315)the State Scholarship Fund of China
文摘Image matching based on scale invariant feature transform(SIFT) is one of the most popular image matching algorithms, which exhibits high robustness and accuracy. Grayscale images rather than color images are generally used to get SIFT descriptors in order to reduce the complexity. The regions which have a similar grayscale level but different hues tend to produce wrong matching results in this case. Therefore, the loss of color information may result in decreasing of matching ratio. An image matching algorithm based on SIFT is proposed, which adds a color offset and an exposure offset when converting color images to grayscale images in order to enhance the matching ratio. Experimental results show that the proposed algorithm can effectively differentiate the regions with different colors but the similar grayscale level, and increase the matching ratio of image matching based on SIFT. Furthermore, it does not introduce much complexity than the traditional SIFT.
基金the National Natural Science Foundation of China(Nos.60970109 and 61170228)
文摘The global context(GC) descriptor is improved for describing interest regions,uses gradient orientation for binning,and thus provides more robust invariance for geometric and photometric transformations.The performance of the improved GC(IGC) to image matching is studied through extensive experiments on the Oxford A?ne dataset.Empirical results indicate that the proposed IGC yields quite stable and robust results,signi?cantly outperforms the original GC,and also can outperform the classical scale-invariant feature transform(SIFT) in most of the test cases.By integrating the IGC to the SIFT,the resulting of hybrid SIFT+IGC performs best over all other single descriptors in these experimental evaluations with various geometric transformations.
基金Supported by the Postgraduate Research and Practice Innovation Program of Nanjing University of Aeronautics and Astronautics(XCXJH20220318)。
文摘Since the outbreak of Coronavirus Disease 2019(COVID-19),people are recommended to wear facial masks to limit the spread of the virus.Under the circumstances,traditional face recognition technologies cannot achieve satisfactory results.In this paper,we propose a face recognition algorithm that combines the traditional features and deep features of masked faces.For traditional features,we extract Local Binary Pattern(LBP),Scale-Invariant Feature Transform(SIFT)and Histogram of Oriented Gradient(HOG)features from the periocular region,and use the Support Vector Machines(SVM)classifier to perform personal identification.We also propose an improved Convolutional Neural Network(CNN)model Angular Visual Geometry Group Network(A-VGG)to learn deep features.Then we use the decision-level fusion to combine the four features.Comprehensive experiments were carried out on databases of real masked faces and simulated masked faces,including frontal and side faces taken at different angles.Images with motion blur were also tested to evaluate the robustness of the algorithm.Besides,the experiment of matching a masked face with the corresponding full face is accomplished.The experimental results show that the proposed algorithm has state-of-the-art performance in masked face recognition,and the periocular region has rich biological features and high discrimination.
基金This work was financially supported by the Zhejiang Science and Technology Department Basic Public Welfare Research Project(LGN18F030001)the Major Project of Zhejiang Science and Technology Department(2016C02G2100540).
文摘Road visual navigation relies on accurate road models.This study was aimed at proposing an improved scale-invariant feature transform(SIFT)algorithm for recovering depth information from farmland road images,which would provide a reliable path for visual navigation.The mean image of pixel value in five channels(R,G,B,S and V)were treated as the inspected image and the feature points of the inspected image were extracted by the Canny algorithm,for achieving precise location of the feature points and ensuring the uniformity and density of the feature points.The mean value of the pixels in 5×5 neighborhood around the feature point at an interval of 45ºin eight directions was then treated as the feature vector,and the differences of the feature vectors were calculated for preliminary matching of the left and right image feature points.In order to achieve the depth information of farmland road images,the energy method of feature points was used for eliminating the mismatched points.Experiments with a binocular stereo vision system were conducted and the results showed that the matching accuracy and time consuming for depth recovery when using the improved SIFT algorithm were 96.48%and 5.6 s,respectively,with the accuracy for depth recovery of-7.17%-2.97%in a certain sight distance.The mean uniformity,time consuming and matching accuracy for all the 60 images under various climates and road conditions were 50%-70%,5.0-6.5 s,and higher than 88%,respectively,indicating that performance for achieving the feature points(e.g.,uniformity,matching accuracy,and algorithm real-time)of the improved SIFT algorithm were superior to that of conventional SIFT algorithm.This study provides an important reference for navigation technology of agricultural equipment based on machine vision.
基金Project(XDA06020300)supported by the"Strategic Priority Research Program"of the Chinese Academy of SciencesProject(12511501700)supported by the Research on the Key Technology of Internet of Things for Urban Community Safety Based on Video Sensor networks
文摘Expression, occlusion, and pose variations are three main challenges for 3D face recognition. A novel method is presented to address 3D face recognition using scale-invariant feature transform(SIFT) features on 3D meshes. After preprocessing, shape index extrema on the 3D facial surface are selected as keypoints in the difference scale space and the unstable keypoints are removed after two screening steps. Then, a local coordinate system for each keypoint is established by principal component analysis(PCA).Next, two local geometric features are extracted around each keypoint through the local coordinate system. Additionally, the features are augmented by the symmetrization according to the approximate left-right symmetry in human face. The proposed method is evaluated on the Bosphorus, BU-3DFE, and Gavab databases, respectively. Good results are achieved on these three datasets. As a result, the proposed method proves robust to facial expression variations, partial external occlusions and large pose changes.
基金This work is supported by the National Key Research and Development Program of China[grant number 2016YFB0502602]the National Natural Science Foundation of China[grant number 61471272]the Natural Science Foundation of Hubei Province,China[grant number 2016CFB499].
文摘A super-resolution enhancement algorithm was proposed based on the combination of fractional calculus and Projection onto Convex Sets(POCS)for unmanned aerial vehicles(UAVs)images.The representative problems of UAV images including motion blur,fisheye effect distortion,overexposed,and so on can be improved by the proposed algorithm.The fractional calculus operator is used to enhance the high-resolution and low-resolution reference frames for POCS.The affine transformation parameters between low-resolution images and reference frame are calculated by Scale Invariant Feature Transform(SIFT)for matching.The point spread function of POCS is simulated by a fractional integral filter instead of Gaussian filter for more clarity of texture and detail.The objective indices and subjective effect are compared between the proposed and other methods.The experimental results indicate that the proposed method outperforms other algorithms in most cases,especially in the structure and detail clarity of the reconstructed images.
文摘An Unmanned Aircraft System (UAS) is an aircraft or ground station that can be either remote controlled manually or is capable of flying autonomously under the guidance of pre-programmed Global Positioning System (GPS) waypoint flight plans or more complex onboard intelligent systems. The UAS aircrafts have recently found extensive applications in military reconnaissance and surveillance, homeland security, precision agriculture, fire monitoring and analysis, and other different kinds of aids needed in disasters. Through surveillance videos captured by a UAS digital imaging payload over the interest areas, the corresponding UAS missions can be conducted. In this paper, the authors present an effective method to detect and extract architectural buildings under rural environment from UAS video sequences. The SIFT points are chosen as image features. The planar homography is adopted as the motion model between different image frames. The proposed algorithm is tested on real UAS video data.
基金This work was supported in part by the National Natural Science Foundation of China(Grant No.61601006)the Beijing Natural Science Foundation(Grant No.4192021)the Equipment Pre-Research Foundation(Grant No.61404130312).
文摘Automatic detection of a designated building area(DBA)is a research hotspot in the field of target detection using remote sensing images.Target detection is urgently needed for tasks such as illegal building monitoring,dynamic land use monitoring,antiterrorism efforts,and military reconnaissance.The existing detection methods generally have low efficiency and poor detection accuracy due to the large size and complexity of remote sensing scenes.To address the problems of the current detection methods,this paper presents a DBA detection method that uses hierarchical structural constraints in remote sensing images.Our method was conducted in two main stages.(1)During keypoint generation,we proposed a screening method based on structural pattern descriptors.The local pattern feature of the initial keypoints was described by a multilevel local pattern histogram(MLPH)feature;then,we used one-class support vector machine(OC-SVM)merely to screen those building attribute keypoints.(2)To match the screened keypoints,we proposed a reliable DBA detection method based on matching the local structural similarities of the screened keypoints.We achieved precise keypoint matching by calculating the similarities of the local skeletal structures in the neighboring areas around the roughly matched keypoints to achieve DBA detection.We tested the proposed method on building area sets of different types and at different time phases.The experimental results show that the proposed method is both highly accurate and computationally efficient.
基金Supported by the National Key Research and Development Program of China(No.2016YFB0502502)the Special Research and Trial Production Project of Sanya(No.sy17xs0113)
文摘The scale-invariant feature transform (SIFT) is often applied to extract tie-points for airborne SAR images. When a pair of airborne SAR images differs with look angles obviously, shadow sizes and shapes of same objects will differ obviously. In main and slave SAR images, key-points around shadows often match as tie-points, although they are not homologous points. The phenomenon worsens the performance of SIFT on SAR images. On the basis of SIFT, a modified matching method is proposed to decrease the number of incorrect tie-points. High-resolution airborne SAR images are used in Experiments. Experiment results show that the proposed method is very effective to extract correct tie-points for SAR images.