摘要
针对FBCM(基于矩阵压缩FUP(fast update algorithm))算法在项集挖掘过程中存在频繁扫描原频繁项集库,并生成大量候选集的问题,提出一种通过提取数据库中最频繁项的方法,以降低对原频繁项集库的扫描次数;并通过候选集剪枝思想,减少算法整体运行过程中的候选集生成,以提高频繁项集的挖掘速度.实验结果表明,在相同实验条件下,该算法的效率比FBCM算法效率提高15%以上,最高达60%.
Aiming at the problem that FBCM(FUP(fast update algorithm)based on matrix compression)algorithm frequently scanned the original frequent itemset library and generated a large number of candidate sets in the process of item set mining.We proposed a method to extract the most frequent items from the database to reduce the scanning times on the original frequent itemset library,and through a candidate set pruning idea,it reduced the generation of candidate sets in the whole running process of the algorithm,so as to improve the speed of frequent itemset mining.Experimental results show that the efficiency of the algorithm is 15%higher than that of FBCM algorithm,and the highest is 60%under the same experimental conditions.
作者
杨勇
张磊
曲福恒
刘俊杰
陈强
YANG Yong;ZHANG Lei;QU Fuheng;LIU Junjie;CHEN Qiang(School of Computer Science and Technology,Changchun University of Science and Technology,Changchun 130022,China;Institute of Education,Changchun Normal University,Changchun 130032,China)
出处
《吉林大学学报(理学版)》
CAS
北大核心
2021年第3期635-642,共8页
Journal of Jilin University:Science Edition
基金
国家自然科学基金(批准号:41671397)
吉林省教育科学“十三五”规划项目(批准号:GH19086)。
关键词
关联规则
增量挖掘
候选集剪枝
最频繁项
association rule
incremental mining
candidate set pruning
most frequent item