基于链表结构的频繁模式树构造

Construction of Frequent Tree Based on Linked List Structure

下载PDF

导出

摘要 FP-Growth算法在关联规则挖掘中是最经典的算法,主要通过频繁模式树(FP树)避免生成候选频繁项目集。针对FP-Growth算法中耗费内存严重的问题,采用链表存储方式,给出了FP-Growth算法的实现方法,其中单个结点采用链表形式来产生,频繁模式树采用左孩子右兄弟的存储结构来组织。在此基础上利用索引表,实现了对频繁模式树中共同前缀结点的快速查找,提高了频繁模式树构造的效率,解决了FP树构造算法中数据存储的瓶颈问题。最后以天体光谱数据和城市土壤数据作为数据集分别对该算法进行测试,实验结果表明,该方法的构造效率要明显优于基于顺序结构的FP-Growth算法。 FP-Growth algorithm is the most typical and the most commonly used construction algorithms of frequent pattern tree（FP-Tree）.This article presents a realization method of FP-Growth algorithm based on linked list structure,in which the node of tree is organized by linked list structure,and the frequent pattern tree is stored by using children-brother data structure.On this basis,the quick search for the common and former nodes in the frequent pattern tree by using indexed table becomes true.Accordingly,the time efficiency of the construction of frequent pattern tree is improved,and the bottleneck of data store in the FP-tree construction algorithm is solved.Finally,through the use of the stellar data and urban soil data as the experimental data,the experiment results show that the method is evidently prior to the construction efficiency of the FP-Growth algorithm based on the sequential structure.

作者马洋赵旭俊

机构地区太原科技大学

出处《太原科技大学学报》 2013年第2期85-90,共6页 Journal of Taiyuan University of Science and Technology

基金山西省青年基金(2012021015-4) 山西省高校高新技术产业化项目(20121011) 太原科技大学校青年基金(20123020)

关键词关联规则频繁模式链表结构索引表光谱数据 association rules frequent pattern linked list structure indexed table spectra data

分类号 TP311 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献13

1AGRAWAL R, IMIELINSKI T, SWAMI A. Mining association rules between sets of items in large databases [ C ]//Proc of 1 th Int Conf on Management of Data, Washington DC, USA, 1993:207-216. 被引量：1
2JIAWEI Han,JIAN Pei, YIWEN Yin, et al. Mining Frequent Patterns without Candidate Generation: A Frequent-Pattern Tree Approach [ J ]. Data Mining and Knowledge Discovery ,2004,8 (1):53-87. 被引量：1
3王磊,张继福.基于属性相关分析的离群数据并行挖掘算法[J].太原科技大学学报,2011,32(5):364-369. 被引量：2
4马洋.恒星光谱数据分类规则挖掘系统研究[J].太原科技大学学报,2011,32(4):269-273. 被引量：2
5赵旭俊.基于频繁模式树的正负项目集挖掘[J].太原科技大学学报,2012,33(1):18-22. 被引量：2
6EHUD GUDES, SOLOMON EYAL SHIMONY, NATALIA VANETIK. Discovering Frequent Graph Patterns Using Disjoint Paths [ J ]. IEEE Transactions on Knowledge and Data Engineering ,2006,18 ( 11 ) : 1441-1456. 被引量：1
7CLAUDIO LUCCHESE, SALVATORE ORLANDO, RAFFAELE PEREGO. Fast and Memory Ettieient Mining of Frequent ClosedItemsets [ J ]. IEEE Transactions on Knowledge and Data Engineering,2006,18 (1) :21-36. 被引量：1
8DO TRONG. PGLCM: Efficient Parallel Mining of Closed Frequent Gradual Itemsets [ C ]//IEEE International Conference on Data Mining. Australia, Sydney ,2010 : 138-147. 被引量：1
9MESA ALEJANDRO, FEREGRINO-URIBE CLAUDIA, CUMPLIDO REN, et al. A Highly Parallel Algorithm for Frequent Itemset Mining [ C ]//MCPR' 10 Proceeding of the 2nd Mexican conference on pattern recognition. Mexico, Puebla, 2010: 291-300. 被引量：1
10韩蒙,张炜,李建中.RAKING:一种高效的不确定图K-极大频繁模式挖掘算法[J].计算机学报,2010,33(8):1387-1395. 被引量：17

二级参考文献55

1蔡江辉,张继福.基于聚类的离群数据挖掘及应用[J].太原重型机械学院学报,2004,25(4):254-258. 被引量：2
2张继福,张素兰,胡立华.约束概念格及其构造方法[J].智能系统学报,2006,1(2):31-38. 被引量：14
3胡学钢,陈慧,张玉红,马冯.基于分布式概念格的分类规则挖掘[J].合肥工业大学学报（自然科学版）,2007,30(2):132-136. 被引量：2
4Inokuchi A,Washio T,Motoda H.An apriori-based algorithm for mining frequent substructures from graph data//Proceedings of the 4th European Conference on Principles of Data Mining and Knowledge Discovery(PKDD'00).Freiburg,Germany,2000:13-23. 被引量：1
5Kuramochi M,Karypis G.Frequent subgraph discovery//Proceedings of the 2001 IEEE International Conference on Data Mining(ICDM 2001).San Jose,California,USA,2001:313-320. 被引量：1
6Yan X,Han J.gSpan:Graph-based substructure pattern mining//Proceedings of the 2001 IEEE International Conference on Data Mining(ICDM 2002).Maebashi City,Japan,2002:721-724. 被引量：1
7Tian Y,Hankins R A,Patel J M.Efficient aggregation for graph summarization//Proceedings of the ACM SIGMOD International Conference on Management of Data(SIGMOD 2008).Vancouver,BC,Canada,2008:567-580. 被引量：1
8Chen Chen,Lin Cindy X,Fredrikson Matt,Christodorescu Mihai,Yan Xifeng,Han Jiawei.Mining graph patterns efficiently via randomized summaries//Proceedings of the VLDB09.Lyon,France,2009:742-753. 被引量：1
9Yan X,Han J.Closegraph:Mining closed frequent graph patterns//Proceedings of the 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining(KDD').New York,NY,USA,2003:286-295. 被引量：1
10Zeng Z,Wang J,Zhang J,Zhou L.FOGGER:An algorithm for graph generator discovery//Proceedings of the 12th Inter-national Conference on Extending Database Technology(EDBT 2009).Saint-Petersburg,Russia,2009:517-528. 被引量：1

共引文献19

1韩蒙,李建中,邹兆年.从不确定图中发现K紧密子图[J].计算机科学与探索,2011,5(9):791-803. 被引量：5
2赵旭俊.基于频繁模式树的正负项目集挖掘[J].太原科技大学学报,2012,33(1):18-22. 被引量：2
3王宁,杨扬,巩华荣,赵耀培,孟坤.一种基于极大团的关键时间段挖掘方法[J].计算机科学,2012,39(6):166-169. 被引量：1
4蔡伟,张柏礼,吕建华.不确定图最可靠最大流算法研究[J].计算机学报,2012,35(11):2371-2380. 被引量：9
5张春英,张雪.Web社会网络的不确定属性图模型与应用[J].河北联合大学学报（自然科学版）,2012,34(4):56-61.
6陈捷,孟春梅.关联分析频繁模式挖掘Apriori算法简介及其应用[J].软件导刊,2012,11(11):49-50. 被引量：1
7王宁,杨扬,由海涌,赵耀培,孟坤.极大有序频繁项目集的时间属性分析方法[J].小型微型计算机系统,2013,34(1):120-124. 被引量：3
8王果,吴良峥,骆晓艳.基于时序的不同事物同属性的关联规则挖掘[J].江苏技术师范学院学报,2013,19(2):20-23.
9赵旭俊,蔡江辉,马洋.基于信息熵的加权频繁模式树构造算法研究[J].模式识别与人工智能,2014,27(1):28-34. 被引量：3
10刘爱琴,荀亚玲.基于属性熵和加权余弦相似度的离群算法[J].太原科技大学学报,2014,35(3):171-175. 被引量：5

1葛凌云,张继福,蔡江辉.基于微粒群和子空间的离群数据挖掘算法研究[J].系统仿真学报,2009,21(7):1897-1900. 被引量：2
2盛英.基于Fisher判别的半监督天体光谱数据特征降维[J].太原科技大学学报,2012,33(5):331-336.
3胡立华,张继福,张素兰.基于剪枝的概念格渐进式构造[J].计算机应用,2006,26(7):1659-1661. 被引量：3
4张继福,蔡江辉.面向LAMOST的天体光谱离群数据挖掘系统研究[J].光谱学与光谱分析,2007,27(3):606-609. 被引量：6
5韩永奇,张芸,姚玉霞.基于关联规则数据挖掘算法的应用[J].农业网络信息,2008(11):73-75. 被引量：3
6蔡江辉,杨海峰,赵旭俊,张继福.一种恒星光谱分类规则后处理方法[J].光谱学与光谱分析,2013,33(1):237-240. 被引量：2
7蔡江辉,张继福,赵旭俊.基于PSO的二阶段光谱模糊聚类研究[J].光谱学与光谱分析,2009,29(4):1137-1141. 被引量：4
8钟伟雄.一种简单方法实现自动浇花控制[J].福建电脑,2011,27(6):157-158. 被引量：1
9胡立华,张继福,张素兰.一种基于剪枝的横向分块概念格构造算法[J].小型微型计算机系统,2011,32(7):1394-1399. 被引量：4
10张仁,沈志宏,黎建辉,施建平.基于规则的土壤数据校验模型研究与实现[J].计算机系统应用,2010,19(8):78-81. 被引量：1

太原科技大学学报

2013年第2期

浏览历史

内容加载中请稍等...

基于链表结构的频繁模式树构造

参考文献13

二级参考文献55

共引文献19

相关作者

相关机构

相关主题

浏览历史