摘要
现有的数据挖掘算法多是集中式环境下的数据挖掘处理,但目前的大型数据库多以分布式的形式存在,针对分布式数据挖掘算法FDM及其改进算法中存在的频繁项集丢失问题和网络通信开销过高的问题,提出了一种改进的基于关联规则的分布式数据挖掘算法LTDM,LTDM算法引入了映射标示数组机制,可以在保证频繁项集完整性的同时降低网络的通信开销。实验结果证明了算法的有效性。
Most of the existing data mining algorithms are processing in the centralized systems; however, at present large database is usually distributed. Compared with the frequent itemsets lost and high communication traffic in distributed database conventional and improved algorithm FDM, an improved distributed data mining algorithm LTDM based on association roles is proposed. LTDM algorithm introduces the mapping indicated array mechanism to keep the integrity of frequent itemsets and decrease the communication traffic. The experimental results prove the efficiency of the proposed algorithm.
出处
《计算机系统应用》
2012年第8期218-221,共4页
Computer Systems & Applications
关键词
分布式数据库
数据挖掘
关联规则
频繁项集
网络通信开销
distributed database
data mining
association rules
frequent itemsets
communication traffic