摘要
城市化进程中,新的地点不断出现且地点类型不断更新,导致大量未知地点产生,为城市形态的理解和掌控造成障碍。本文综合多种空间分析及文本挖掘技术,创新性地融合Twitter数据中的时间记录与Tweets(用户在Twitter中发表的文本内容)用于地点分类。设计抽取精细的人群活动的时空-内容信息的方法,并通过监督学习方法,利用少量标记样本,自动识别未知地点的类型。最终识别出教育、娱乐、商店、社会服务、交通五种类型的地点,整体精度达67.6%,表明方法的可行性,为社交数据在地点分类研究中的有效利用提供了新的思路。
In the process of urbanization,new locations constantly appear and the categories of locations are usually updated,resulting in great number of unknown locations and obstacles to the comprehension and grasp of urban functional structures.In this paper,various spatial analysis and text mining techniques are combined to integrate time records and Tweets(text content published by users in Twitter)in Twitter data for location classification innovatively.A method for extracting the detailed spatiotemporal-content information from crowds’activities is designed,and supervised learning techniques are utilized to automatically identify the type of unknown locations using a small number of labeled samples.At last,five types of locations including education,entertainment,shops,social services,and transportation are identified,with an overall accuracy of 67.6%,which shows the feasibility of this method and provides a novel idea for the effective application of social media data in location classification researches.
作者
邱小宇
林杰
Qiu Xiaoyu;Lin Jie(School of Earth Sciences,Zhejiang University,Hangzhou City,Zhejiang 310027,China)
出处
《科技通报》
2020年第4期67-71,共5页
Bulletin of Science and Technology
基金
国家自然科学基金项目(41501423)