摘要
目前Web挖掘的数据源来自于服务器端,包括:日志、内容、结构数据,由此相应有Web使用挖掘,Web内容挖掘,Web结构挖掘;基于当前Web挖掘的不足本文在数据源问题上给出了自己的看法,提出了在客户端进行数据收集的观点,并采用在页面中嵌入追踪功能的方法来实现。文中首先描述了当前基于服务器端数据的Web挖掘的现状和不足,并给出了在客户端收集浏览行为的解决方法;然后介绍了当前的几种可以用于客户端追踪技术并比较其优缺点,最后,采用嵌入脚本的方法实现客户端数据的收集和发送,最为合适当前的互联网环境。
the data of web mining is from the server-side presently, it includes: log, content, structure, and come out Web usage mining, Web content mining, Web structures mining; Facing origin of the data, personal opinion is given that collecting data in client-side based in current fault of Web mining , and use the embedded technology of in web page to tracking. First, web mining based in Server-side data is depicted in current and shortage, Second, resolution is put forward that collecting data in client-side; Third, technology of client-side tracking is introduced currently, and compared with each other in advantage and disadvantage, Finally, the most appropriate method for internet is the embedded technology of in web page to tracking.
出处
《微计算机信息》
北大核心
2007年第33期270-272,共3页
Control & Automation
基金
广西教育厅科研项目[桂教科研[2003]22号]