摘要
针对在实际问题中,数据库中的数据经常受到各种因素的干扰,待处理的数据常有某种程度的不完备的情况,基于粗糙集理论,对不完备信息系统的完备化进行了研究。提出了改进的ROUSTIDA算法,改善了原算法对某些缺失数据不能处理的情况,分别提出了相应的策略,扩充了原算法的适用范围,同时可以在填充时避免不一致信息的产生,对下一步的数据挖掘作好了充分的数据准备。
In practical issues, data in database is seldom complete. Data waiting to be processed is incomplete to some degree. An improved ROUSTIDA algorithm is brought forward. We consider filling of missing values should reflect the basic characters and the connotative internal rules of the information system. Based on the distinguish matrix, the using range of original algorithm is expended, incomplete information system is transformed into complete information system, and make it more reasonable and effective.
出处
《计算机工程与设计》
CSCD
北大核心
2009年第7期1681-1684,共4页
Computer Engineering and Design
基金
云南省教育厅自然科学基金项目(5Y0590D)
关键词
不完备信息系统
粗糙集
数据挖掘
缺失值
incomplete information system
rough set
data mining
missing data