摘要
特征选择方法作为重要的数据预处理工作一直受到各个领域的关注。在分析现有的特征选择方法的基础上,针对MRMR方法中存在的冗余度和相关性评价方法单一,不能根据用户需求设置特征维度等问题进行了改进。在冗余度计算过程提出一种新的简单快速的计算方法;在计算权重过程中提出针对不同数据选用不同的特征评价方法;引入新的目标评价函数来进行特征选择。在五个经典的用于生物认证领域的特征数据库(FERET、CASIA、ORL、PIE和扩展的YaleB)上验证了算法的有效性,实验结果充分证明了改进的最大相关最小冗余算法的优势。
Feature selection as an important preliminary work has been concerned in various fields. Through analyzing the existing feature selection methods, the problem is improved that the single redundancy and relevance evaluation method and feature dimension cannot be set according to user requirements. A novel simple and fast computing method is presented in the redundant calculation process;the weight is calculated according to the data different choice of different evaluation methods;the novel evaluation function is used in feature selection. With five different databases(FERET、CASIA、ORL、CMU PIE and Extended YaleB), the effectiveness and feasibility of the algorithm are proved. The experimental results demonstrate the advantage of the MMRMR.
出处
《计算机工程与应用》
CSCD
2014年第9期116-122,共7页
Computer Engineering and Applications
基金
辽宁省社会科学规划基金项目(No.L13BXW006)
吉林省科技发展计划项目青年科研基金(No.201201070)
关键词
特征选择
最大相关最小冗余(MRMR)
生物认证
评价函数
经典数据库
feature selection
Minimal Redundancy Maximal Relevance(MRMR)
biometric identification
evaluation function
regular databases