一种基于遗传算法的关联规则改进方法被引量：1

Improved mining method for association rules based on genetic algorithm

下载PDF

导出

摘要在对关联规则冗余问题产生机理分析的基础上,提出了针对于支持度阀值设置的惩罚函数和一个改进的遗传算法。该改进算法采用了频繁项分布、素因子编码、择偶和共享函数等新颖技术,使染色体总是能在频繁项密集区进行挖掘,从而对组合搜索空间进行了有效修剪。并且对事务进行了数值转换,有效地压缩了事务数据库存储空间,提高了运算速度。从实验效果来看,改进的挖掘方法在发现有价值规则的效率与精准率方面具有一定优势。 This paper proposes the penalty function by setting support threshold and an improved genetic algorithm, based on the mechanism analysis of redundancy problem production. The algorithm makes chromosome always mining in the concentrated area of frequent item by using some new technologies such as frequent item distribution, primes factor coding, spouse and sharing function, and thus combination space is validly pruned. Moreover, because the numerical conversion is used for the transaction, the storage space of transaction database is validly compressed and the operation speed is improved. Experiment results show that the improved mining method of the paper has certain advantage on the efficiency and precision of finding the valuable rules.

作者李凤营赵连朋王红雨

机构地区渤海大学国家铁路罐车容积计量站锦州分站

出处《计算机工程与应用》 CSCD 北大核心 2008年第14期155-158,165,共5页 Computer Engineering and Applications

关键词关联规则遗传算法频繁项分布素因子编码择偶 association rules genetic algorithm frequent item distribution primes factor coding spouse

分类号 TP301.6 [自动化与计算机技术—计算机系统结构]

引文网络
相关文献

参考文献15

1Agrawal R, Imielinski T, Swami A. Mining association rules between sets of items in large databases [C]//Pruceedings of the ACM SIG- MOD, Washington DC ,1993:207-216. 被引量：1
2Han J, Jian P. Mining frequent patterns without candidate generation [C]//Proceedings of ACM SIGMOD International Conference on Management of Data, Dallas,TX,2000 : 1-12. 被引量：1
3Jr Bayardo J R. Efficiently mining long patterns from databases [ C]//Ashutosh Tiwary,Boeing Co. Proc of the 1998 ACM SIGMOD International Conference on Management of Data( SIGMOD' 98 ). New York : ACM Press,1998:85-93. 被引量：1
4Hoppner F. Learning temporal rules from state sequences [ C ]//IJCAI Workshop on Learning from Temporal and Spatial Data, Seatle ,2001. 被引量：1
5曾海泉,刘永丹,宋扬,胡运发.基于互关联后继树的多时间序列关联模式挖掘[J].计算机研究与发展,2003,40(7):934-940. 被引量：5
6Chen B, Haas P, Scheuermann P. A new two-phase sampling based algorithms for discovery association rules [ C ]//Proceedings of the ACM SIGKDD' 02, Edmonton, Alberta, Canada,2002:462-468. 被引量：1
7Bronnimann H, Chen B, Dash M, et al. Efficient data reduction with EASE [ C ]//Proceedings of the ACM SIGKDD'03 ,Washington D C, 2003 : 59 -68. 被引量：1
8贺志,黄厚宽,田盛丰.一种优化相关规则的发现方法[J].计算机学报,2006,29(6):906-913. 被引量：12
9Omiecinski E R. Alternative interest measures for mining association in databases [ J ]. IEEE Transactions on Knowledge and Data Engineering,2003,15( 1 ) :57-69. 被引量：1
10Srikant R, Agrawal R. Mining generalized association rules [ C]// Proceedings of the 21 st International Conference on Very Large Data Bases ,Tokyo ,Japan, 1995:409-419. 被引量：1

二级参考文献50

1熊磊,李元香.网格计算资源调度策略的三级模式[J].计算机工程与应用,2005,41(1):96-97. 被引量：8
2袁磊,张浩,陆剑峰,陈静.数据网格中数据交互处理模式研究[J].计算机工程与应用,2005,41(15):133-137. 被引量：1
3胡运发.互关联后继树——一种新型全文数据库数学模型,技术报告:CIT-02—3[R].复旦大学计算机与信息技术系,2002.. 被引量：1
4贾彩燕倪现君.关联规则挖掘研究述评[J].计算机科学,2003,30(4):145-148. 被引量：13
5H Mannia, H Toivonen. Discovering generalized episodes using minimal occurrences. The 2nd Int'l Conf on Knowledge Discovery and Data Mining (KDD'96), Portland, Oregon, 1996. 被引量：1
6Y Huhtala, J Kaerkkaeinen, H Toivonen. Mining for similarities in aligned time series using wavelets. Data Mining and Knowledge Discovery: Theory, Tools and Technology, 1999, 3695: 150-160. 被引量：1
7T C Fu, F L Chung, R Luk et al. Pattern discovery from stock time series using self-organizing maps. Knowledge Discovery and Data Minining (KDD)2001 Workshop on Temporal Data Mining,San Francisco, 2001. 被引量：1
8D Gautam, L King, M Heikki et al. Rule discovery from time series. The 4th Int'l Conf on Knowledge Discovery and Data Mining (KDD), San Francisco, 1998. 被引量：1
9Frank Hoppner. Learning temporal rules from state sequences.IJCAI Workshop on Learning from Temporal and Spatial Data,Seatle, 2001. 被引量：1
10R Agrawal, T Imielinski, A Swami. Mining association rules sets of items in large databases. The ACM SIGMOD Conf on Management of Data (SIGMOD'93), Washington, DC, 1993. 被引量：1

共引文献22

1叶德谦,赵世磊.基于线性回归的关联规则相关性方法的研究[J].计算机研究与发展,2008,45(z1):291-294. 被引量：2
2马海兵,张成洪,张锦,胡运发.基于IS~±树模型的频繁模式挖掘[J].计算机研究与发展,2005,42(4):588-593. 被引量：3
3徐建彬,梁国光.试析电视广告视听语言的艺术性[J].齐齐哈尔大学学报（哲学社会科学版）,2005(5):140-140. 被引量：1
4秦少辉,肖辉,胡运发.互关联后继树在时间序列特征模式挖掘中的应用[J].计算机工程与设计,2006,27(8):1327-1329. 被引量：1
5梅红岩,周军,刘海霞.一种基于充要强度的优化规则发现方法[J].计算机应用,2008,28(3):761-763. 被引量：2
6赵连朋,金喜子,孙亮,姜文哲.基于小生境遗传算法的关联规则挖掘方法[J].计算机工程,2008,34(10):163-165. 被引量：5
7王二锋,崔杜武,陈皓,崔颖安,费蓉.一种新的多值属性关联规则挖掘算法[J].计算机工程,2008,34(22):77-79. 被引量：5
8詹芹,廖慧芬.基于抗体浓度和亲合度的关联规则挖掘算法[J].计算机工程与应用,2009,45(21):147-149.
9姜静逸,韩松臣,汤新民,倪金霞,崔国山.基于满意度控制的机场应急救援决策规则的数据挖掘[J].应用科学学报,2009,27(6):644-650. 被引量：3
10胡文瑜,孙志挥,吴英杰.数据挖掘取样方法研究[J].计算机研究与发展,2011,48(1):45-54. 被引量：54

同被引文献9

1刘芳,孙杨军.基于多克隆选择的多维关联规则挖掘算法[J].复旦学报（自然科学版）,2004,43(5):742-745. 被引量：9
2武兆慧,张桂娟,刘希玉.基于模拟退火遗传算法的关联规则挖掘[J].计算机应用,2005,25(5):1009-1011. 被引量：19
3任子武,伞冶.自适应遗传算法的改进及在系统辨识中应用研究[J].系统仿真学报,2006,18(1):41-43. 被引量：169
4Srinivas M,Patnaik LM.Adaptive probabilities of crossover and mutation in genetic algorithms. IEEE Transactions on Systems Man and Cybernetics . 1994 被引量：1
5Gale D.The game of Hex and Brouwer fixed point theorem. The American Mathematical Monthly . 1979 被引量：1
6朱颢东,钟勇.一种改进的模拟退火算法[J].计算机技术与发展,2009,19(6):32-35. 被引量：84
7阮光册,Ah-Hwwe Yu.基于兴趣度策略的启发式Web挖掘算法[J].计算机工程与应用,2009,45(35):148-150. 被引量：3
8符保龙.基于规则提取量的Web日志关联规则挖掘方法[J].计算机应用研究,2010,27(2):500-502. 被引量：4
9许国艳,史宇清.遗传算法在关联规则挖掘中的应用[J].计算机工程,2002,28(7):122-124. 被引量：28

引证文献1

1彭耶萍.自适应遗传模拟退火的Web日志关联挖掘[J].软件导刊,2011,10(7):57-59.

1元素.择偶的条件[J].计算机应用文摘,2009(1):85-85.
2未来择偶新标准：有房有车有IP![J].计算机应用文摘,2010(28):84-84.
3李宇凡.五凤求凰AE EVO 1“择偶”侧记[J].视听技术,2005(9):50-53.
4何世钧,杨立功,李震,张路,李照宇.VB在工业现场数据采集、处理中的应用[J].河南科学,2001,19(3):281-284. 被引量：3
5唐耀庭.数值转换器型问题的解法[J].数学大世界（初中版）,2010(3):3-4.
6Daniel D. WIEGMANN,Lisa M. ANGELONI,Steven M. SEUBERT,J. GordonWADE.Mate choice decisions by searchers[J].Current Zoology,2013,59(2):184-199.
7刘红梅.消除规则冲突和冗余的关联分类方法研究[J].电脑知识与技术,2009,5(1X):629-630.
8daigua.算算,TA值多少钱?[J].电脑爱好者,2010(11):44-44.
9多棱扫描[J].现代阅读,2013(12):5-5.
10张兴华,罗钧旻.可拓学在数据关联规则挖掘中的应用[J].微电子学与计算机,2007,24(4):35-36. 被引量：2

计算机工程与应用

2008年第14期

浏览历史

内容加载中请稍等...

一种基于遗传算法的关联规则改进方法被引量：1

参考文献15

二级参考文献50

共引文献22

同被引文献9

引证文献1

相关作者

相关机构

相关主题

浏览历史

一种基于遗传算法的关联规则改进方法 被引量：1

参考文献15

二级参考文献50

共引文献22

同被引文献9

引证文献1

相关作者

相关机构

相关主题

浏览历史

一种基于遗传算法的关联规则改进方法被引量：1