期刊文献+

特征降维技术的研究与进展 被引量:25

Research and Development of Feature Dimensionality Reduction
下载PDF
导出
摘要 数据特征的质量会直接影响模型的准确度。在模式识别领域,特征降维技术一直受到研究者们的关注。随着大数据时代的到来,数据量巨增,数据维度不断升高。在处理高维数据时,传统的数据挖掘方法的性能降低甚至失效。实践表明,在数据分析前先对其特征进行降维是避免"维数灾难"的有效手段。降维技术在各领域被广泛应用,文中详细介绍了特征提取和特征选择两类不同的降维方法,并对其特点进行了比较。通过子集搜索策略和评价准则两个关键过程对特征选择中最具代表性的算法进行了总结和分析。最后从实际应用出发,探讨了特征降维技术值得关注的研究方向。 Quality of data characteristics directly impacts the accuracy of the model.In the field of pattern recognition,dimensionality reduction technique is always the focus of researchers.At the era of big data,massive data needs to be processed while the dimension of the data is rising.The performance of the traditional methods of data mining is degraded or losing efficiency for processing high dimensional data.Studies show that dimensionality reduction technology can be implemented to effectively avoid the"Curse of Dimensionality"in data analysis,thus it has wild application.This paper gave detailed description about two dimensionality reduction methods which are feature selection and feature extraction,in addition,a thoroughly comparison about the feature of these two methods was performed.Feature selection algorithm was summarized and analyzed by two key steps of algorithm,which are searching strategy and evaluation criterion.Finally,the direction for future research of the dimensionality reduction was discussed based on its practical application.
作者 黄铉 HUANG Xuan(School of Information Science and Technology,Southwest Jiaotong University, Chengdu 610031, Chin)
出处 《计算机科学》 CSCD 北大核心 2018年第B06期16-21,53,共7页 Computer Science
关键词 降维 特征选择 特征提取 研究进展 Dimensionality reduction Feature selection Feature extraction Research progress
  • 相关文献

参考文献3

二级参考文献25

  • 1李颖新,阮晓钢.基于支持向量机的肿瘤分类特征基因选取[J].计算机研究与发展,2005,42(10):1796-1801. 被引量:51
  • 2李颖新,李建更,阮晓钢.肿瘤基因表达谱分类特征基因选取问题及分析方法研究[J].计算机学报,2006,29(2):324-330. 被引量:45
  • 3Wang Lei.Feature Selection with Kernel Class Separability.IEEE Trans on Pattern Analysis and Machine Intelligence,2008,30 (9):1534-1546. 被引量:1
  • 4Liu Huan,Yu Lei.Toward Integrating Feature Selection Algorithms for Classification and Clustering.IEEE Trans on Knowledge and Data Engineering,2005,17(4):491 -502. 被引量:1
  • 5Webb A R.Statistical Pattern Recognition.2nd Edition.New York,USA:John Wiley & Sons,2002. 被引量:1
  • 6Narendra P M,Fukunaga K.A Branch and Bound Algorithm for Feature Subset Selection.IEEE Trans on Computers,1977,26 (9):917 -922. 被引量:1
  • 7Liu H,Motoda H.Feature Selection for Knowledge Discovery and Data Mining.Boston,USA:Kluwer Academic,1998. 被引量:1
  • 8Busetti F.Simulated Annealing Overview[EB/OL].[2009-05-03].http://www.geocities.com/francorbusetti/saweb.pdf. 被引量:1
  • 9Muller K R,Mika S,Ratsch G,et al.An Introduction to KernelBased Learning Algorithms.IEEE Trans on Neural Networks,2001,12(2):181 -201. 被引量:1
  • 10Weston J,Mukherjee S,Chapelle O,et al.Feature Selection for SVMs//Proc of the Annual Conference on Neural Information Processing Systems.Denver,USA,2000:668-674. 被引量:1

共引文献159

同被引文献187

引证文献25

二级引证文献111

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部