期刊文献+

云计算下的SPRINT并行算法研究 被引量:5

Research on SPRINT Algorithm in Cloud Computing
下载PDF
导出
摘要 目前,由于云计算的出现,越来越多的中小企业在分析海量数据时能便利地找到廉价的解决方案。本文,鉴于MapReduce作为Hadoop中的重要编程模型,在介绍基于云计算的Hadoop平台和数据挖掘技术中的SPRINT分类算法的基础上,详细描述SPRINT的并行算法在MapReduce编程模型上的执行流程,并利用研究出的决策树模型对输入数据进行分类。 At present, because of the presence of cloud computing, more and more small and medium sized enterprises can find low-cost solution easily when analyzing mass data. In this paper, whereas MapReduce being the important programming model of Hadoop, in the base of introducing the Hadoop platform and SPRINT algorithm of data mining, proposes the detailed procedure of SPRINT algorithm on MapReduce,and classifies the input data by the model of decision tree.
作者 张春艳
出处 《软件》 2010年第11期57-61,共5页 Software
关键词 云计算 HADOOP MAPREDUCE 数据挖掘 SPRINT Cloud computing Hadoop MapReduce data mining SPRINT
  • 相关文献

参考文献7

二级参考文献19

  • 1栾丽华,吉根林.决策树分类技术研究[J].计算机工程,2004,30(9):94-96. 被引量:116
  • 2Frawley W J, Piatetsky-Shapiro G, Matheus C J. Knowledge Discovery in Databases: an Overview [ C]//Knowledge Discovery in Databases. California: AAAI Press, 1992 : 57-70. 被引量:1
  • 3Cheeseman P, Stutz J. Bayesian Classification (Auto Class) : Theory and Results [ C ]//Advances in Knowledge Discovery and Data Mining. California: AAAI Press, 1996: 153-180. 被引量:1
  • 4Quinlan J R. Induction of Decision Trees [J]. Machine Learning, 1986, 1 (1) : 81-106. 被引量:1
  • 5Krose B, Van Der Smagt P. An Introduction to Neural Networks [ M]. 8th ed. Amsterdam: Faculty of Mathematics and Computer Science, 1996: 11-31. 被引量:1
  • 6Swiniarski R W. Rough Sets Methods in Feature Reduction and Classification [ J ]. Appl Math Comput Sci, 2001, 11(3) : 565-582. 被引量:1
  • 7PEI Min, Goodman E D, YING Ding, et al. Genetic Algorithms for Classification and Feature Extraction [ C ]// Proceeding of Classification Society of North America Annual Meeting. Denver: [ s. n. ], 1995: 1-28. 被引量:1
  • 8Rastogi R, Shim K. PUBLIC: a Decision Tree Classifier That Integrates Building and Pruning [ J ]. Data Mining and Knowlegde Discovery, 2000, 4(4): 315-344. 被引量:1
  • 9Shafer J, Agrawal R, Mehta M. SPRINT: a Scalable Parallel Classifier for Data Mining [ C ]//Proceedings of the 22nd VLDB Conference Mumbai(Bombay). Mumbai : Morgan Kaufmann, 1996 : 544-555. 被引量:1
  • 10HAN EH, SRIVASTAVA A, KUMAR V. Parallel formulation of inductive classification learning algorithm[ R]. Minneapolis, USA: University of Minnesota, 1996. 被引量:1

共引文献28

同被引文献36

引证文献5

二级引证文献53

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部