期刊文献+

基于MapReduce的Web日志挖掘预处理 被引量:2

MapReduce-based Web Log Mining Preprocessing
下载PDF
导出
摘要 介绍Web日志挖掘的一般性过程,并重点对Web日志挖掘预处理的Web会话划分进行研究。介绍云计算的概念与优点,针对目前Web日志挖掘的瓶颈以及数据的存储与共享的难点,提出基于MapReduce的Web日志挖掘预处理,能较好地解决Web日志挖掘当前所面临的效率问题,更好地整合计算机资源,减少不必要的资源浪费。 This article describes the general process of Web log mining and focuses on the study of Web session division in the preprocessing of Web log mining.The article introduces the concept and advantages of cloud computing.For the bottleneck analysis of Web log mining,as well as the difficulties of data storage and sharing,we proposed MapReduce-based Web log mining preprocessing,which can better solve the efficiency issues currently faced in Web log mining and better integrate the computer resources to reduce unnecessary waste.
出处 《计算机与现代化》 2013年第9期35-37,41,共4页 Computer and Modernization
关键词 WEB日志挖掘 MAPREDUCE 会话划分 Web log mining MapReduce session division
  • 相关文献

参考文献14

二级参考文献42

  • 1王涛伟,周必水.基于DHP的频繁遍历路径挖掘算法[J].杭州电子科技大学学报(自然科学版),2005,25(5):60-63. 被引量:5
  • 2张波,巫莉莉,周敏.基于Web使用挖掘的用户行为分析[J].计算机科学,2006,33(8):213-214. 被引量:27
  • 3余慧佳,刘奕群,张敏,茹立云,马少平.基于大规模日志分析的搜索引擎用户行为分析[J].中文信息学报,2007,21(1):109-114. 被引量:117
  • 4卢云彬,曹汉强.基于Hash表的关联规则挖掘算法的改进[J].计算机技术与发展,2007,17(6):12-14. 被引量:10
  • 5Bin Tan, Fuchun Peng. Unsupervised query segmentation using generative language models and Wikipedia[C]//Proceeding of the 17th international conference on World Wide Web. Beijing, China, 2008:347-356. 被引量:1
  • 6Craig Silverstein, Monika Henzinger, Hannes Marais, et al. Analysis of a very large Web search engine query log[J]. In SIGIR Forum, fall 1998, 33(1):6-12. 被引量:1
  • 7Daqing He, Ays, e Goker. Detecting session boundaries from Web user logs[C]//Proceedings of the 22nd annual colloquium on information, 2000. 被引量:1
  • 8H. Cenk Ozmutlu , Fatih cavdur, Application of automatic topic identification on excite web search engine data logs.[J]Information Processing and Management: an International Journal, 2005, 41(5) : 1243-1262. 被引量:1
  • 9Jing Bai, Jian-Yun Nie, Guihong Cao, Hugues Bouchard. Using query contexts in information retrieval[J]. SIGIR'07, July 23-27, 2007. 被引量:1
  • 10Jinhui Yuan, Huiyi Wang, Lan Xiao, Wujie Zheng, Jianmin Li, Fuzong Lin, and Bo Zhang. A Formal Study of Shot Boundary Detection. [C]//IEEE transactions on circuits and systems for video technology, VOL. 17, NO. 2, pp. 168-186. February 2007. 被引量:1

共引文献40

同被引文献14

引证文献2

二级引证文献9

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部