摘要
以从新浪微博平台海量信息中挖掘知识为目的,通过获取新浪微博开放平台授权并认证,调用新浪微博API(Application Programming Interface)相应的函数接口,获取单用户基本信息及所发微博信息,应用多用户遍历思想及迭代算法,获取大量用户基本信息及微博信息,并存储到数据库中,利用数据挖掘关联规则算法进行话题分析,并将分析结果通过可视化的方式展现,最终实现话题的关注度分析、话题间关联程度分析以及话题关注人群的特征分析。
With the purpose of mining knowledge from mass information of Sina micro-blog platform,authorization and authentication from Sina micro-blog open platform is obtained, corresponding function interfaces of Sina micro-blog API( Application Programming Interface) is ultimately realized, the basic information of individual user and the micro-blogs is got,multi user traversal thought and iterative algorithm is applied,the basic information of a large number of users and micro-blogs is got,the information to the database is stored,the topic by using the data mining algorithm of association rule is analyzed,the analysis results is applied in a visual way,analysis of the topic of attention and analysis of the correlation between topics and analysis of the characteristics of topic concerned crowds is ultimately realized.
出处
《山东交通学院学报》
CAS
2015年第4期78-86,共9页
Journal of Shandong Jiaotong University