期刊文献+

常见大数据处理框架比较研究 被引量:4

下载PDF
导出
摘要 本文主要对Hadoop、Spark两种大数据处理框架进行介绍,阐述各自的原理、生态组成及应用特点,并对两者进行了简单的比较.
作者 孙丽
出处 《电脑知识与技术》 2020年第12期3-5,共3页 Computer Knowledge and Technology
  • 相关文献

参考文献4

二级参考文献22

  • 1Wikipedia. Sina Weibo[EB/OL]. en. wikipedia, org/ wiki/Sina Weibo. 被引量:1
  • 2Andritsos P, Tsaparas P, Miller R J, et al. LIMBO: Scalable clustering of categorical data [C]//EDBT, 2004 .- 123-146. 被引量:1
  • 3Brin S, Davis J, Garcia-Molina H. Copy detection mechanisms for digital documents[C]//ACM SIGMOD Record. ACM, 1995,24(2) : 398-409. 被引量:1
  • 4Lyon C, Barrett R, Malcolm J. A theoretical basis to the automated detection of copying between texts, and its practical implementation in the Ferret plagiarism and collusion detector [D. Plagiarism: Prevention, Practice and Policies, 2004. 被引量:1
  • 5Lyon C, Barrett R, Malcolm J. Plagiarism is easy, but also easy to detect[M]. Ann Arbor, MI Scholarly Publishing Office, University of Michigan Library, 2006. 被引量:1
  • 6Shivakumar N, Garcia-Molina H. Finding near-replicas of documents on the web[M//The World Wide Web and Databases. Springer Berlin Heidelberg, 1999: 204- 212. 被引量:1
  • 7Broder A Z. Identifying and filtering near-duplicate documents [C]//Combinatorial pattern matching. Springer Berlin Heidelberg, 2000 : 1-10. 被引量:1
  • 8Manku G S, Jain A, Das Sarma A. Detecting near-du- plicates for web crawling[C//Proceedings of the 16th international conference on World Wide Web. ACM, 2007 = 141-150. 被引量:1
  • 9Henzinger M. Finding near-duplicate web pages., a large-scale evaluation of algorithms [C]//Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval. ACM, 2006 .- 284-291. 被引量:1
  • 10Gibson D, Kleinberg J, Raghavan P. Clustering cate- gorical data: An approach based on dynamical systems [J]. The VLDB Journal, 2000,8 (3-4) : 222-236. 被引量:1

共引文献4

同被引文献39

引证文献4

二级引证文献22

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部