期刊文献+

基于遗传算法的α-离群约简搜索算法 被引量:2

A Searching Algorithm for α-Outlying Reduction Based on Genetic Algorithm
下载PDF
导出
摘要 离群数据挖掘与分析在网络入侵控制、信用卡检测、通信欺诈分析等诸多领域具有十分重要的意义。结合粗糙集理论的属性约简技术,定义了α-离群约简等概念,提出了一种以属性离群贡献率和离群划分相似水平为基础的基于遗传算法的α-离群约简算法。这种方法通过维数更小的属性子空间去获得相同或相近的离群数据集,使对离群数据来源及出现原因的分析和理解更加集中于较小的目标域。通过对现实数据集的实验表明,该算法可有效地产生出约简并具有较好的规模适应性。 Mining and analyzing for outliers is of great importance in many applications, including network invasion control, credit card and teleeom fraud detection, etc. A concept of a-outlying reduction is defined in the paper based on the approach of attribute reduction in the theory of rough set. Along with the discussion of outlying contribution rate of attributes and the level of outlying partition similarity, this paper proposes a searching algorithm for α-outlying reduction based on genetic algorithm. The approach can help us obtain similar outlier sets by means of searching in an attributes subspace with lesser dimension, which leads to that analyzing for origins and appearance reasons of outliers is focused better on narrow and specific object fields. Experimental results on real world data sets show that the proposed algorithm is scalable and efficient and it can result in optimal eduction.
出处 《计算机科学》 CSCD 北大核心 2006年第10期198-201,共4页 Computer Science
基金 重庆市自然科学基金资助项目(2005BB2224)。
关键词 离群约简 遗传算法 粗糙集 离群相似水平 Outlying reduction, Genetic algorithm, Rough set, Outlying similarity level
  • 相关文献

参考文献9

  • 1Ramaswamy S,Rastogi R,Shim K.Efficient algorithms for Mining Outliers from Large Data Sets.In:Proc.of the ACM SIGMOD International Conference on Management of Data.New York:ACM Press,2000.427~438 被引量:1
  • 2Angiulli F,Pizzuti C.Outlier mining in large high dimensional data sets.IEEE Tran on Knowledge and Data Engineering,2005,17(2):203~215 被引量:1
  • 3Tolvi J.Genetic algorithms for outlier detection and variable selection in linear regression models.Soft Computing,2003 被引量:1
  • 4张文修等编著..粗糙集理论与方法[M].北京:科学出版社,2001:224.
  • 5高隽编著..智能信息处理方法导论[M].北京:机械工业出版社,2004:325.
  • 6Ware M,Wilson D,Ware A.A knowledge based genetic algorithm approach to automating cartographic generalization.Knowledge-Based Systems,2003(16):295~303 被引量:1
  • 7陶志,许宝栋,汪定伟,李冉.基于遗传算法的粗糙集知识约简方法[J].系统工程,2003,21(4):116-122. 被引量:71
  • 8Dai J,Li Y.Heuristic genetic algorithm for minimal reduction decision system based on rough set theory.In:Proc.of the First International Conference on Machine Learning and Cybernetics.IEEE Press,2002.833~836 被引量:1
  • 9王立宏,吴耿锋.基于并行协同进化的属性约简[J].计算机学报,2003,26(5):630-635. 被引量:22

二级参考文献11

共引文献90

同被引文献7

引证文献2

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部