期刊文献+

Hadoop整合Cassandra处理海量数据 被引量:1

Research on Integrating Hadoop with Cassandra in Processing Massive data
下载PDF
导出
摘要 Hadoop作为开源组织Apache的一个分布式计算开源框架,可高效的对海量数据进行运算和处理,可以应对互联网上数以千万计的并发处理和访问,但其不支持数据的实时读写和修改。Cassandra是一款面向列的功能强大的Key-Value分布式数据库系统,具有良好的实时读写性能和可扩展性,但缺乏对海量数据进行分析运算的能力。将Hadoop与Cassan dra结合起来,取长补短,就能为云计算模型的实施提供一个高效的切实可行的方案。该文首先阐述了Hadoop整合Cas sandra处理海量数据的必要性,然后提出了具体的整合方案和实现,最后总结了Hadoop整合Cassandra所遇到的主要问题。 As a framework of distributed computing in open sourcing Apache organization,Hadoop can solve large scale access ing of massive data efficiently,which can also cope with tens of millions of concurrency accessing from Internet.Unfortunately,Hadoop can’t support real-time reading,writing and modifying of the data.Furthermore,as a powerful key-value distributed database which faces to the columns,Cassandra has outstanding performance in real-time data reading,writing and scalability,but it lacks of the ability in analyzing and computing of massive data.Therefore,combining Hadoop with Cassandra can draw upon and benefit from each other to achieve a feasible solution in dealing with cloud computing problems.This paper,on the basis of the combination between Hadoop and Cassandra,discusses the necessity of the integration.Then,the specific integrating solution and implement was put forward.The summarizations of the problems during the integration were also be discussed.
作者 苏翔宇 朱爱群 SU Xiang-yu,ZHU Ai-qun(Shenzhen Institute of Technology,Shenzhen 518045,China)
机构地区 深圳技师学院
出处 《电脑知识与技术》 2013年第3期1491-1493,共3页 Computer Knowledge and Technology
关键词 键值 云计算 集群 Key-Value cloud computing clusters
  • 相关文献

参考文献4

二级参考文献35

  • 1VARIA J. Cloud architectures - Amazon Web services [ EB/OL]. [ 2009 - 03 - 01 ]. http://acmbangalore, org/events/monthly-talk/ may-2008 --cloud-architectures---amazon-web-services. html. 被引量:1
  • 2BRYANT R E. Data-intensive supercomputing: The case for DISC, CMU-CS-07-128 [ R]. Pittsburgh, PA, USA: Carnegie Mellon University, Department of Computer Science, 2007. 被引量:1
  • 3SZALAY A S, KUNSZT P, THAKAR A, et al. Designing and mining multi-terabyte astronomy archives: The sloan digital sky survey [ C]//Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data. New York: ACM Press, 2000:451 - 462. 被引量:1
  • 4BARROSO L A, DEAN J, HOLZLE U. Web search for a planet: The Google cluster architecture [ J]. IEEE Micro, 2003, 23(2) : 22 -28. 被引量:1
  • 5GILES J. Google tops translation ranking [ EB/OL]. (2006 - 11 - 06) [ 2009 - 03 - 06 ]. http://www, nature, com/news/2006/ 061106/full/news061106-6. html. 被引量:1
  • 6维基百科.Cloud computing [ EB/OL]. [ 2009 - 03 - 10]. http://en. wikipedia, org/wiki/Cloud_computing. 被引量:1
  • 7中国云计算网.什么是云计算?[EB/OL].(2008-05-14)[2009-02-27].http://www.cloudcomputing-china.cn/Article/ShowArticle.asp?ArticleID=1. 被引量:18
  • 8VAQUERO L M, RODERO-MERINO L, CACERES J, et al. A break in the clouds: Towards a cloud definition [ J]. ACM SIGCOMM Computer Communication Review, 2009, 39(1): 50-55. 被引量:1
  • 9WEISS A. Computing in the clouds [ J]. ACM Networker, 2007, 11(4): 16 -25. 被引量:1
  • 10GRIFFITHS A, METHERALL G. Cluster intereonnection networks [ EB/OL]. (2000 -09 -01)[2009 -03 -03]. http://www, gridbus. org/-raj/csc433/ClusterNets, pdf. 被引量:1

共引文献936

同被引文献6

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部