A New Algorithm for Mining Frequent Pattern 被引量：2

A New Algorithm for Mining Frequent Pattern

下载PDF

导出

摘要 Mining frequent pattern in transaction database, time series databases, and many other kinds of databases have been studied popularly in data mining research. Most of the previous studies adopt Apriori like candidate set generation and test approach. However, candidate set generation is very costly. Han J. proposed a novel algorithm FP growth that could generate frequent pattern without candidate set. Based on the analysis of the algorithm FP growth, this paper proposes a concept of equivalent FP tree and proposes an improved algorithm, denoted as FP growth * , which is much faster in speed, and easy to realize. FP growth * adopts a modified structure of FP tree and header table, and only generates a header table in each recursive operation and projects the tree to the original FP tree. The two algorithms get the same frequent pattern set in the same transaction database, but the performance study on computer shows that the speed of the improved algorithm, FP growth * , is at least two times as fast as that of FP growth. Mining frequent pattern in transaction database, time series databases, and many other kinds of databases have been studied popularly in data mining research. Most of the previous studies adopt Apriori like candidate set generation and test approach. However, candidate set generation is very costly. Han J. proposed a novel algorithm FP growth that could generate frequent pattern without candidate set. Based on the analysis of the algorithm FP growth, this paper proposes a concept of equivalent FP tree and proposes an improved algorithm, denoted as FP growth * , which is much faster in speed, and easy to realize. FP growth * adopts a modified structure of FP tree and header table, and only generates a header table in each recursive operation and projects the tree to the original FP tree. The two algorithms get the same frequent pattern set in the same transaction database, but the performance study on computer shows that the speed of the improved algorithm, FP growth * , is at least two times as fast as that of FP growth.

作者李力靳蕃

出处《Journal of Southwest Jiaotong University(English Edition)》 2002年第1期10-20,共11页 西南交通大学学报（英文版）

基金 theFundoftheNationalManagementBureauofTraditionalChineseMedicine(No .2 0 0 0 J P 5 4 )

关键词 data mining algorithm frequent pattern set FP growth data mining, algorithm, frequent pattern set, FP growth

分类号 TP311.131 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

同被引文献48

1李力,翟东海,靳蕃.一种频繁项集并行挖掘算法[J].铁道学报,2003,25(6):71-75. 被引量：3
2颜跃进,李舟军,陈火旺.基于FP-Tree有效挖掘最大频繁项集[J].软件学报,2005,16(2):215-222. 被引量：68
3颜跃进,李舟军,陈火旺.一种挖掘最大频繁项集的深度优先算法[J].计算机研究与发展,2005,42(3):462-467. 被引量：20
4Agrawal R,Imielinski T,Swami A.MiningAssociation Rules Between Sets of Items in large Databases[A].Proceedings of the ACMSIGMOD Conference on Management of Data[C].Washington D C,1993.207-216. 被引量：1
5J.W.Han,Kanbr M.数据挖掘概念与技术[M].范明,孟小峰,等,译.北京:机械工业出版社,2001. 被引量：1
6Park J S,Chen M S,Yu P S.An Effective Hashbased Algorithm for Mining AssociationRules[A].In:Proc.1995 ACM-SIGMOD Int'l Conf.on Management of Data[C].1995.175-186. 被引量：1
7R.Agrawal and R.Arikant.Fast Algorithms for Mining Association Rules[A].In Proc.1994Int.Conf.Very Large Data Bases (VLDB'94) [C].Santiago,Chile,1994.487-499. 被引量：1
8A.Savasere,E.Omiecinski,and S.Navathe.An Efficient Algorithm for Mining AssociationRules in Large Databases [A].In Proc.1995 Int.Conf.Very Large Data Bases (VLDB'95)[C].Zurich,Switzerland,1995.432-443. 被引量：1
9H.Mannila,H.Toivonen,and A.Verkamo.Efficient algorithm for discovering associationrules[A].AAAI Workshop on Knowledge Discovery in Databases[C].1994.181-192 被引量：1
10H.Toivonen.Sampling Large Databases for Association Rules[A].In Proc.1996 Int.Conf.VeryLarge Data Bases (VLDB'96)[C].Bombay,India,1996.134-145. 被引量：1

引证文献2

1李力,翟东海,靳蕃.一种频繁项集并行挖掘算法[J].铁道学报,2003,25(6):71-75. 被引量：3
2黄澍庄.频繁项集挖掘算法分析与比较[J].德州学院学报,2005,21(6):65-71.

二级引证文献3

1黄澍庄.频繁项集挖掘算法分析与比较[J].德州学院学报,2005,21(6):65-71.
2徐杰,李云,刘博,张晓斌.基于垂直FP树的并行频繁项集挖掘[J].计算机与数字工程,2012,40(10):12-15. 被引量：3
3王艳辉,王淑君,李曼,林帅.基于改进FP-Growth算法的CRHX型动车组牵引系统关联失效模型研究[J].铁道学报,2016,38(9):72-80. 被引量：5

1史巧硕,周慧霞,李杨,李娟.回归方法估算最长频繁模式长度[J].河北工业大学学报,2015,44(5):45-50.
2Wang Jie,Zeng Yu.SWFP-Miner： an efficient algorithm for mining weighted frequent pattern over data streams[J].High Technology Letters,2012,18(3):289-294.
3王鹏,吴晓晨,王晨,汪卫,施伯乐.CAPE——数据流上的基于频繁模式的分类算法[J].计算机研究与发展,2004,41(10):1677-1683. 被引量：7
4何炎祥,向剑文,朱骁峰,孔维强.不产生候选的快速投影频繁模式树挖掘算法[J].计算机科学,2002,29(11):71-75. 被引量：11
5邓砚谷,王丽珍.对FP-Tree头表节点数据结构的改进[J].计算机工程与应用,2004,40(25):176-178. 被引量：3
6LIU Ning,ZHONG Chongquan,BAI Yuqin.Real-time Performance Study of Information Transmission in EPA Industrial Ethernet[J].Chinese Journal of Electronics,2012,21(1):125-130. 被引量：3
7XiangJian-wen,HeYan-xiang,KokichiFutatsugi,KongWei-qiang.Constructing Projection Frequent Pattern Tree for Efficient Mining[J].Wuhan University Journal of Natural Sciences,2003,8(02A):351-357. 被引量：1
8YIN Yong ZHOU Zu-de LIU Quan LI Fang-min LI Zhong-nan.Performance Study of Distributed Multi-Agent Intrusion Detection System[J].Computer Aided Drafting,Design and Manufacturing,2005,15(2):38-43.
9王文娟,李炳龙.IDS规则库构建与合并算法[J].计算机应用与软件,2010,27(11):259-261. 被引量：4
10秦岭,阚树林.基于多Agent粒子群改进算法的车间调度[J].价值工程,2013,32(23):58-60.

Journal of Southwest Jiaotong University(English Edition)

2002年第1期

浏览历史

内容加载中请稍等...

A New Algorithm for Mining Frequent Pattern 被引量：2

同被引文献48

引证文献2

二级引证文献3

相关作者

相关机构

相关主题

浏览历史