摘要
在统计方法的基础上提出了一种根据数据集合本身的统计特性数据挖掘算法.该算法利用数据本身的统计特性对数据仓库中的数据进行分析,在给定重要性比例程度的前提下,经过系列的统计运算,得到简化的数据仓库集,并根据分析得到了数据挖掘算法的流程图.提出了比例大小关系函数,反映了数据自身的统计特性,分析表明:这种算法极大的提高了数据的有效水平.
On the basis of statistical method, this paper put forward one data mining algorithm according to statistics characteristic of the data′s set itself. This algorithm utilizes the statistical characteristic of data to analyze the data in the data warehouse, on the premise of giving definitely degree of proportion of importance, through a series of statistics operation, get the simple data warehouse set, and gained the flow chart of data mining algorithms according to analysis. The paper proposed the relation function of the proportion that reflects the statistical characteristic of data itself. The analysis th9at shows this algorithm has improved greatly the effective level of data.
出处
《湖北民族学院学报(自然科学版)》
CAS
2005年第1期42-44,共3页
Journal of Hubei Minzu University(Natural Science Edition)
基金
交通部基础研究专项基金项目(20031981408).
关键词
数据仓库
统计特性
数据挖掘
量纲的统一
data warehouse
statistical characteristic
data mining
dimension unification