面向Spark的图书借阅数据关联模型的研究被引量：7

Research of associative model for libraries' book lending data based on the spark

下载PDF

导出

摘要为了方便读者能在海量的图书资源中快速有效的找到需要的书籍,利用Map Reduce框架分块处理,结合关联分析Apriori算法,将数据挖掘技术应用到图书管理系统中。但需要多次扫描数据库和产生大量候选集,对Hadoop平台处理速度带来了巨大挑战,因此,针对传统的Apriori算法,提出基于内存计算、弹性分布式数据集处理的Spark平台为读者推荐书籍,指引读者的借阅行为。 In order to search the required books from a tremendous amount of resources immediately for authors, we tried to use the method of MapReduce for dealing the process of block data, combining the algorithm of Apriori, and applying data mining technology to the library management system. But the method referring to above need scan database many times and emerge a large number of candidate set, which will produce tremendous challenge to the speed with Hadoop processing method. Thus, compared to the tradition method of Apriori, there is a new method based on the memory and RDD to compute in Spark platform to recommending books for readers and guiding their borrowing behavior.

作者高琪娟刘锴陈佳 GAO Qijuan1, LIU Kai2, CHEN Jia3(1. School of Information and Computer Science, Anhui Agricultural University, Hefei 230036; 2. Modem Educational Technology Center of Anhui Agricultural University, Hefei 230036; 3. High-standard clustering Service Center Department of China Telecom Co., Ltd., Wuhu 24100)

机构地区安徽农业大学信息与计算机学院安徽农业大学现代教育信息中心中国电信芜湖分公司高端聚类服务中心

出处《安徽农业大学学报》 CAS CSCD 2018年第4期768-771,共4页 Journal of Anhui Agricultural University

关键词 Apriori关联规则 Spark平台图书借阅行为模式频繁项集 apriori associative rules spark platform the borrow behavior model of books frequent itemset

分类号 TP391.3 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献6

1王景艳.基于改进关联规则挖掘算法的图书推荐服务[J].福建电脑,2008,24(5):72-73. 被引量：5
2王青,谭良,杨显华.基于Spark的Apriori并行算法优化实现[J].郑州大学学报（理学版）,2016,48(4):60-64. 被引量：12
3车晋强,谢红薇.基于Spark的分层协同过滤推荐算法[J].电子技术应用,2015,41(9):135-138. 被引量：12
4牛海玲,鲁慧民,刘振杰.基于Spark的Apriori算法的改进[J].东北师大学报（自然科学版）,2016,48(1):84-89. 被引量：23
5景民昌,于迎辉.基于借阅时间评分的协同图书推荐模型与应用[J].图书情报工作,2012,56(3):117-120. 被引量：24
6王家胜,牟肖光.读者借阅多维关联规则挖掘模型的建立与分析[J].计算机应用,2011,31(11):3084-3086. 被引量：12

二级参考文献40

1李清峰,杨路明,张晓峰,龙艳军.数据挖掘中关联规则的一种高效Apriori算法[J].计算机应用与软件,2004,21(12):84-86. 被引量：29
2陆觉民,郑宇.数据挖掘技术的改进在图书馆个性化服务中的应用[J].现代图书情报技术,2006(8):65-68. 被引量：21
3钟勇,秦小麟,包磊.一种基于多维集的关联模式挖掘算法[J].计算机研究与发展,2006,43(12):2117-2123. 被引量：8
4刘以安,羊斌.关联规则挖掘中对Apriori算法的一种改进研究[J].计算机应用,2007,27(2):418-420. 被引量：53
5AGRAWAL R, IMIELINSKI T, SWAMI A. Mining association rules between sets of items in large databases [ C]/! Proceedings of the ACM SIGMOD Conference on Management of Data. New York: ACM Press, 1993:207 -216. 被引量：1
6HAN J, PEI J, YN Y. Mining frequent patterns without candidate gen- eration[ C]// Proceedings of the 2000 ACM SIGMOD International Con- ference on Management of Data. Dallas: ACM Press, 2000:1 - 12. 被引量：1
7OCLC. University of Sheffield and OCLC awarded funding to explore library catalogue recommender system[EB/OL].http://www.oclc.org/news/releases/2010/201031.htm,2011. 被引量：1
8Richard M. 5 problems of recommender systems[EB/OL].http://www.readwriteweb.com/archives/5_problems_recommender_systems.php,2011. 被引量：1
9Goldberg D,Nichols D,Oki B M. Using collaborative filtering to weave an information tapestry[J].Communications of the ACM,1992,(12):61-70. 被引量：1
10Apache. What is apache mahout[EB/OL].http://mahout.apache.org,2011. 被引量：1