摘要
介绍Web日志挖掘的一般性过程,并重点对Web日志挖掘预处理的Web会话划分进行研究。介绍云计算的概念与优点,针对目前Web日志挖掘的瓶颈以及数据的存储与共享的难点,提出基于MapReduce的Web日志挖掘预处理,能较好地解决Web日志挖掘当前所面临的效率问题,更好地整合计算机资源,减少不必要的资源浪费。
This article describes the general process of Web log mining and focuses on the study of Web session division in the preprocessing of Web log mining.The article introduces the concept and advantages of cloud computing.For the bottleneck analysis of Web log mining,as well as the difficulties of data storage and sharing,we proposed MapReduce-based Web log mining preprocessing,which can better solve the efficiency issues currently faced in Web log mining and better integrate the computer resources to reduce unnecessary waste.
出处
《计算机与现代化》
2013年第9期35-37,41,共4页
Computer and Modernization