期刊文献+

A New Algorithm for Mining Frequent Pattern 被引量:2

A New Algorithm for Mining Frequent Pattern
下载PDF
导出
摘要 Mining frequent pattern in transaction database, time series databases, and many other kinds of databases have been studied popularly in data mining research. Most of the previous studies adopt Apriori like candidate set generation and test approach. However, candidate set generation is very costly. Han J. proposed a novel algorithm FP growth that could generate frequent pattern without candidate set. Based on the analysis of the algorithm FP growth, this paper proposes a concept of equivalent FP tree and proposes an improved algorithm, denoted as FP growth * , which is much faster in speed, and easy to realize. FP growth * adopts a modified structure of FP tree and header table, and only generates a header table in each recursive operation and projects the tree to the original FP tree. The two algorithms get the same frequent pattern set in the same transaction database, but the performance study on computer shows that the speed of the improved algorithm, FP growth * , is at least two times as fast as that of FP growth. Mining frequent pattern in transaction database, time series databases, and many other kinds of databases have been studied popularly in data mining research. Most of the previous studies adopt Apriori like candidate set generation and test approach. However, candidate set generation is very costly. Han J. proposed a novel algorithm FP growth that could generate frequent pattern without candidate set. Based on the analysis of the algorithm FP growth, this paper proposes a concept of equivalent FP tree and proposes an improved algorithm, denoted as FP growth * , which is much faster in speed, and easy to realize. FP growth * adopts a modified structure of FP tree and header table, and only generates a header table in each recursive operation and projects the tree to the original FP tree. The two algorithms get the same frequent pattern set in the same transaction database, but the performance study on computer shows that the speed of the improved algorithm, FP growth * , is at least two times as fast as that of FP growth.
作者 李力 靳蕃
出处 《Journal of Southwest Jiaotong University(English Edition)》 2002年第1期10-20,共11页 西南交通大学学报(英文版)
基金 theFundoftheNationalManagementBureauofTraditionalChineseMedicine(No .2 0 0 0 J P 5 4 )
关键词 data mining algorithm frequent pattern set FP growth data mining, algorithm, frequent pattern set, FP growth
  • 相关文献

同被引文献48

  • 1李力,翟东海,靳蕃.一种频繁项集并行挖掘算法[J].铁道学报,2003,25(6):71-75. 被引量:3
  • 2颜跃进,李舟军,陈火旺.基于FP-Tree有效挖掘最大频繁项集[J].软件学报,2005,16(2):215-222. 被引量:68
  • 3颜跃进,李舟军,陈火旺.一种挖掘最大频繁项集的深度优先算法[J].计算机研究与发展,2005,42(3):462-467. 被引量:20
  • 4Agrawal R,Imielinski T,Swami A.MiningAssociation Rules Between Sets of Items in large Databases[A].Proceedings of the ACMSIGMOD Conference on Management of Data[C].Washington D C,1993.207-216. 被引量:1
  • 5J.W.Han,Kanbr M.数据挖掘概念与技术[M].范明,孟小峰,等,译.北京:机械工业出版社,2001. 被引量:1
  • 6Park J S,Chen M S,Yu P S.An Effective Hashbased Algorithm for Mining AssociationRules[A].In:Proc.1995 ACM-SIGMOD Int'l Conf.on Management of Data[C].1995.175-186. 被引量:1
  • 7R.Agrawal and R.Arikant.Fast Algorithms for Mining Association Rules[A].In Proc.1994Int.Conf.Very Large Data Bases (VLDB'94) [C].Santiago,Chile,1994.487-499. 被引量:1
  • 8A.Savasere,E.Omiecinski,and S.Navathe.An Efficient Algorithm for Mining AssociationRules in Large Databases [A].In Proc.1995 Int.Conf.Very Large Data Bases (VLDB'95)[C].Zurich,Switzerland,1995.432-443. 被引量:1
  • 9H.Mannila,H.Toivonen,and A.Verkamo.Efficient algorithm for discovering associationrules[A].AAAI Workshop on Knowledge Discovery in Databases[C].1994.181-192 被引量:1
  • 10H.Toivonen.Sampling Large Databases for Association Rules[A].In Proc.1996 Int.Conf.VeryLarge Data Bases (VLDB'96)[C].Bombay,India,1996.134-145. 被引量:1

引证文献2

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部