摘要
确定一定数量的高频词是识别研究热点的基础性工作,但是目前对于如何确定高低词频的分界点还缺乏客观的、行之有效的方法。本研究以2002~2011年收录入Web of Science SCI中934篇科学计量相关文献为语料,分析了齐普夫定律中的常数变化规律,进而基于统计分析创建了一种确定语料中高低词频分界点的新方法。通过比较分析发现,相对于已有的方法,本方法在识别高频词方面具有数量和稳定性两方面的双重优势。应用该方法识别科学计量学的研究热点,发现10年来科学计量研究领域已形成一系列成熟、稳定的研究议题,如引文分析、期刊影响因子、产出评价等。同时这一领域也处于不断发展之中,引文分析方法的成熟和h指数等新型研究议题的兴起使这一领域的研究正在走向深化。
Identifying some high-frequency words is a basic work to detect the research focuses using bibliometrics method, but how to objectively and effectively set the dividing point between the high- and the low-frequency words is a question which still puzzles researchers. In this study, a total of 934 papers about scientometircs published during the year 2002 to 2011 were retrieved as the corpus. The constant in the Zipf's law was analysed using the corpus, and then a new method to identify the boundary between high- and low-frequency words in corpus based on ZipFs law was proposed. Compared with the other methods, the new method had the advantage both on quantity and stability in confirming the number of high-frequency words. Applying this new method to the corpus, we found that after ten years of development, the scienmetrics had formed some basic research issues, for example, the impact factors, citation analysis, research performance and so on, and the scientometrics was still in developing. Some new research issues, for example the cocitation analysis, the h-index and so on were leading the scientometrics to go deeper.
出处
《情报学报》
CSSCI
北大核心
2013年第11期1196-1203,共8页
Journal of the China Society for Scientific and Technical Information
关键词
齐普夫定律
科学计量
研究热点
高频词
低频词
分界点
Zipf's law, scientometrics, research focuses, high-frequency words, low-frequency words, dividing point