基于图像的二维人脸识别技术日趋成熟,但仍受光照、姿态和表情等变化的影响。利用三维人脸模型提高人脸识别性能并将其应用于实际成为近几年学术界的研究趋势。本文提出了SWJTU-MF多模人脸数据库(SWJTU multimodal face database,SWJTU-...基于图像的二维人脸识别技术日趋成熟,但仍受光照、姿态和表情等变化的影响。利用三维人脸模型提高人脸识别性能并将其应用于实际成为近几年学术界的研究趋势。本文提出了SWJTU-MF多模人脸数据库(SWJTU multimodal face database,SWJTU-MF Database),包含200个中性表情中国人的4种人脸样本数据,包括可见光图像、二维视频序列、三维人脸(高精度)和立体视频序列。本文首先分类介绍现有的三维人脸识别算法,然后概述相关的多模人脸数据库,接着提出SWJTU-MF多模人脸数据库,并说明数据库的采集装置、采集环境、采集过程及数据内容,随后简要展示数据标准化过程。最后讨论本数据库面向的应用研究,并给出SWJTU-MF建议的评测协议。展开更多
The devastating effects of wildland fire are an unsolved problem,resulting in human losses and the destruction of natural and economic resources.Convolutional neural network(CNN)is shown to perform very well in the ar...The devastating effects of wildland fire are an unsolved problem,resulting in human losses and the destruction of natural and economic resources.Convolutional neural network(CNN)is shown to perform very well in the area of object classification.This network has the ability to perform feature extraction and classification within the same architecture.In this paper,we propose a CNN for identifying fire in videos.A deep domain based method for video fire detection is proposed to extract a powerful feature representation of fire.Testing on real video sequences,the proposed approach achieves better classification performance as some of relevant conventional video based fire detection methods and indicates that using CNN to detect fire in videos is efficient.To balance the efficiency and accuracy,the model is fine-tuned considering the nature of the target problem and fire data.Experimental results on benchmark fire datasets reveal the effectiveness of the proposed framework and validate its suitability for fire detection in closed-circuit television surveillance systems compared to state-of-the-art methods.展开更多
In video sequence-based iris recognition system, the problem of making full use of relationship and correlation among frames still remains to be solved. A brand new template level multimodal fusion algorithm inspired ...In video sequence-based iris recognition system, the problem of making full use of relationship and correlation among frames still remains to be solved. A brand new template level multimodal fusion algorithm inspired by human cognition manner is proposed. In that a non-isolated geometrical manifold, named Hyper Sausage Chain due to its sausage shape, is trained using the frames from a pattern class for representing an iris class in feature space. We can classify any input iris by observing which manifold it locates in. This process is closer to the function of human being, which takes 'matter cognition' instead of 'matter classification' as its basic principle. The experiments on self-developed JLUBR-IRIS dataset with several video sequences per person demonstrate the effectiveness and usability of the proposed algorithm for video sequence-based iris recognition. Fur- thermore, the comparative experiments on public CASIA-I and CASIA-V4-Interval datasets show that our method can also achieve improved performance of image-based iris recognition system, provided enough samples are involved in training stage.展开更多
文摘基于图像的二维人脸识别技术日趋成熟,但仍受光照、姿态和表情等变化的影响。利用三维人脸模型提高人脸识别性能并将其应用于实际成为近几年学术界的研究趋势。本文提出了SWJTU-MF多模人脸数据库(SWJTU multimodal face database,SWJTU-MF Database),包含200个中性表情中国人的4种人脸样本数据,包括可见光图像、二维视频序列、三维人脸(高精度)和立体视频序列。本文首先分类介绍现有的三维人脸识别算法,然后概述相关的多模人脸数据库,接着提出SWJTU-MF多模人脸数据库,并说明数据库的采集装置、采集环境、采集过程及数据内容,随后简要展示数据标准化过程。最后讨论本数据库面向的应用研究,并给出SWJTU-MF建议的评测协议。
基金National Natural Science Foundation of China(No.61573095)Natural Science Foundation of Shanghai,China(No.6ZR1446700)
文摘The devastating effects of wildland fire are an unsolved problem,resulting in human losses and the destruction of natural and economic resources.Convolutional neural network(CNN)is shown to perform very well in the area of object classification.This network has the ability to perform feature extraction and classification within the same architecture.In this paper,we propose a CNN for identifying fire in videos.A deep domain based method for video fire detection is proposed to extract a powerful feature representation of fire.Testing on real video sequences,the proposed approach achieves better classification performance as some of relevant conventional video based fire detection methods and indicates that using CNN to detect fire in videos is efficient.To balance the efficiency and accuracy,the model is fine-tuned considering the nature of the target problem and fire data.Experimental results on benchmark fire datasets reveal the effectiveness of the proposed framework and validate its suitability for fire detection in closed-circuit television surveillance systems compared to state-of-the-art methods.
文摘In video sequence-based iris recognition system, the problem of making full use of relationship and correlation among frames still remains to be solved. A brand new template level multimodal fusion algorithm inspired by human cognition manner is proposed. In that a non-isolated geometrical manifold, named Hyper Sausage Chain due to its sausage shape, is trained using the frames from a pattern class for representing an iris class in feature space. We can classify any input iris by observing which manifold it locates in. This process is closer to the function of human being, which takes 'matter cognition' instead of 'matter classification' as its basic principle. The experiments on self-developed JLUBR-IRIS dataset with several video sequences per person demonstrate the effectiveness and usability of the proposed algorithm for video sequence-based iris recognition. Fur- thermore, the comparative experiments on public CASIA-I and CASIA-V4-Interval datasets show that our method can also achieve improved performance of image-based iris recognition system, provided enough samples are involved in training stage.