期刊文献+

基于Flume和HDFS的大数据采集系统的研究与实现 被引量:7

Research and implementation of big data acquisition system based on Flume and HDFS
下载PDF
导出
摘要 在充分研究大数据采集、大数据存储、HDFS和Flume基础上,综合分析并利用相关领域知识,给出了一种基于Flume和HDFS相结合的大数据采集系统BDAS的概念模型和体系结构.并根据BDAS的体系结构,可以明确实现一种大数据采集的具体工作,即:Flume Agent的配置.根据体系结构,给出一个实现Web Server日志采集的具体实现方法和步骤. BDAS概念模型和体系结构在大数据分析和研究领域具有重要的理论意义和实际意义,也为大数据领域的研究提供了一种通用的大数据获取手段. On the basis of fully studying big data collection,big data storage,HDFS and Flume,comprehensively analyzing and utilizing the relevant domain knowledge,a conceptual model and architecture were presented based on Flume and HDFS combined with big data acquisition system BDAS.According to the architecture of BDAS,the specific work of a big data collection can be clearly realized,namely,the configuration of the Flume Agent.According to the architecture,a specific implementation method and steps to achieve Web Server log collection were offered.The BDAS conceptual model and architecture have important theoretical and practical significance in the field of big data analysis and research,providing a universal means of big data acquisition for the research of big data.
作者 方中纯 赵江鹏 FANG Zhong-chun;ZHAO Jiang-peng(Engineering and Training Center,Inner Mongolia University of Science and Technology,Baotou 014010,China;Information Engineering School,Inner Mongolia University of Science and Technology,Baotou 014010,China)
出处 《内蒙古科技大学学报》 CAS 2018年第3期255-259,共5页 Journal of Inner Mongolia University of Science and Technology
基金 国家自然科学基金资助项目(61462069) 内蒙古自然科学基金资助项目(2017MS0604 2017MS(LH)0603) 内蒙古科技大学教改重点资助项目(JY2016003)
关键词 HDFS FLUME 大数据采集系统 WEB SERVER BDAS HDFS Flume Big Data Acquisition Web Server BDAS
  • 相关文献

参考文献4

二级参考文献28

  • 1梅立军,周强,臧路,陈祖舜.知网与同义词词林的信息融合研究[J].中文信息学报,2005,19(1):63-70. 被引量:28
  • 2王春梅.基于数据仓库的数据挖掘技术[J].西安邮电学院学报,2006,11(5):99-102. 被引量:6
  • 3董振东,董强,郝长伶.知网的理论发现[J].中文信息学报,2007,21(4):3-9. 被引量:98
  • 4Aranya A,Wright C P,Zadok E.Tracefs:A file system to trace them all[C]//Proceedings of the 3rd USENIX Conference on File and Storage Technologies,2004.San Francisco,CA:USENIX Association,2004. 被引量:1
  • 5Ramakrishnan K K,Biswas P,Karedla R-Analysis of file I/O trac-es in commercial computing environments[M].New York,USA:ACM,1992. 被引量:1
  • 6Ousterhout J K.A trace-driven analysis of the UNIX 4.2 BSD file system[C]//Proceedings of the 10th ACM Symposium on Operating Systems Principles,Orcas Island,WA,1985. 被引量:1
  • 7Zhu N,Chen J,Chiueh T.TBBT:Scalable and accurate trace re-play for file server evaluation[C]//Proceedings of the 4th USE-NIX Conference on File and Storage Technologies,Volume 4,2005.San Francisco,CA:USENIX Association,2005. 被引量:1
  • 8Mesnier M P.Trace:Parallel trace replay with approximate caus-al events[C]//Proceedings of the 5th USENIX Conference on File and Storage Technologies,2007.San Jose,CA:USENIX As-sociation,2007. 被引量:1
  • 9Engel S J, Gilmartin B J, Bongort K, et al. Prognostics, the real issues involved with predicting life remaining[C]/// Proceedings of the IEEE Aerospace. Big Sky: IEEE, 2000 : 457 - 469. 被引量:1
  • 10Mathur A. Data Mining of aviation data for advancing health management [ C]// Proceedings Component and Systems Diagnostics, Prognostics, and Health Management. Orlando: SPIE, 2002:61 -71. 被引量:1

共引文献752

同被引文献47

引证文献7

二级引证文献15

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部