摘要
【目的】使用文本挖掘技术从西方媒体的新闻文本数据中提取中国的经济形象。【方法】基于人类的认知图式分析了形象的文字呈现方式,提出从主题、观点、倾向三个层次来提取国家形象,进而提出相应的文本挖掘方法和流程。【结果】从达沃斯论坛期间的西方媒体新闻中提取的中国经济形象可以概括为:充满活力、有巨大成就、为世界带来机遇和挑战、可能撼动世界格局的新兴发展中国家。【局限】主题模型使用人工解释,会带来个体差异。【结论】从主题、观点、倾向三个层次进行文本挖掘有利于把新闻数据和媒体形象联系起来,该方法对国家、地区、城市等媒体形象提取研究和实践也具有借鉴意义。
[Objective]This paper uses text mining techniques to extract China’s economic image from news published by western media.[Methods]First,we analyzed the representation of image by textual message based on the cognitive schema of human.Then,we extracted the image from topics,viewpoints and sentiment.Finally,we developed text mining process and methods to retrieve China’s image from Western reports.[Results]China’s economic image from news published by Western media covering Davos Forum was summarized as a developing country full of vitality,with great achievements,bringing opportunities and challenges to the world,and possibly affecting the world order.[Limitations]The human interpretation of LDA models inevitably leads to individual difference.[Conclusions]The proposed method could benefit research and practice on extracting image of a country,a region,or a city from news reports.
作者
许光
任明
宋城宇
Xu Guang;Ren Ming;Song Chengyu(School of Information Resource Management,Renmin University of China,Beijing 100872,China)
出处
《数据分析与知识发现》
CSSCI
CSCD
北大核心
2021年第5期30-40,共11页
Data Analysis and Knowledge Discovery
基金
国家自然科学基金项目(项目编号:71772177,72072177)的研究成果之一。