摘要
对分布式数据库进行准确分类,能够有效提高数据的利用率。对数据库的准确分类,需通过近似函数计算后验概率,根据概率结果,完成数据库的准确分类。传统方法通过构造查询矩阵和相似度矩阵,确定数据库准确分类的策略,但忽略了后验概率的计算,导致分类效果不显著。在云计算平台下,提出基于Parzen窗估计模型的分布式数据库准确分类方法,在分析分布式数据库分类系统原理模型基础上,利用Parzen窗估计模型确定分布式数据库区间样本的类别条件概率密度函数,通过插值法设计类别条件概率密度函数的近似函数,并利用此近似函数计算数据库分类样本后验概率,根据概率结果,实现分布式数据库分类。通过计算数据库分类结果的亲和力,并将分类结果亲和力与设定阈值进行对比,实现分布式数据库准确分类。实验结果表明,所提方法分类准确度较高,且分类过程较简单。
To accurately classify distributed database can effectively improve the utilization rate of data. The traditional method constructs the query matrix and similarity matrix and determines the strategy of accurate classification for database,but ignores the calculation of posterior probability,which results in the insignificant classification effect.In cloud computing platform,this article puts forward an accurate classification method of distributed database based on Parzen window estimation model. Based on the analysis of the principle model of classification system in distributed database,this research used Parzen window estimation model to determine the probability density function of class condition of interval sample in distributed database. Then,our research used the interpolation method to design the approximate function of probability density function of class condition and used this approximate function to calculate the posterior probability of classification sample in database. According to the probability result,the research realized the classification of distributed database. By calculating the affinity of database classification result and comparing the affinity of classification result with the set threshold,we achieved the accurate classification of distributed database.Simulation results show that the proposed method has high classification accuracy and simple classification process.
作者
曹曼曼
汪勉
CAO Man-man;WANG Mian(Department of Computer Science,Jining University,Qufu Shandong 273155,China;Institute of Scientific and Technical Information of Jining,Jining Shandong 272000,China)
出处
《计算机仿真》
北大核心
2019年第1期354-357,共4页
Computer Simulation
关键词
云计算平台
分布式
数据库
准确分类
Cloud computing platform
Distributed
Database
Accurate classification