摘要
[目的/意义]旨在探究不同内容层面:标题和摘要、引文内容、全文内容中的主题是否存在差异,以分析标题和摘要中的主题内容是否可以揭示全文的研究内容,以及引文内容对其施引文献内容的作用,为基于文献的标题和摘要来分析全文的研究内容提供理论支持。[方法/过程]使用新冠领域的中文期刊论文进行实证研究,从文献的标题和摘要、引文内容、全文内容中抽取特征词,使用聚类算法对特征词进行聚类,然后采用人工判读的方式识别研究主题,并进行对比研究,分析三者之间的主题差异。[结果/结论]研究结果表明:研究主题在文献的标题和摘要、引文内容、全文内容中存在差异;与标题和摘要相比,全文中富含更多的主题内容,但二者的主题内容差异较小,可以使用标题和摘要中的主题内容来表征全文的研究内容;引文内容与其施引文献内容的主题相关,二者可以进行内容互补。
[Purpose/Significance]This paper aims to explore whether there are differences in the title and abstract,citation content,and full-text content on research topic,and analyze whether the topic content in the title and abstract can reveal the research content of the full text and the effect of the citation content on the content of the citing literature,so as to provide theoretical support for analyzing the research content of the full text based on the title and abstract of the literature.[Method/Process]This paper conducts an empirical study using Chinese journal papers in the field of COVID-19,extracts feature words from the titles and abstracts,citations and full-text contents of the literature,uses the clustering algorithm to cluster the feature words,and then uses manual interpretation to identify the research topics,and conducts a comparative study to analyze the topic differences among the three parts.[Results/Conclusions]The results show that:the research topics are different in the title and abstract,citation content and full-text content of the literature;compared with the title and abstract,the full text contains more topic content,but the difference in the topic content is small,so the topic content in the title and abstract can be used to represent the research content of the full text;the content of the citation is related to the topic of the citing literature,and they can complement each other.
作者
赵磊
章成志
ZHAO Lei;ZHANG Chengzhi(Department of Information Management,School of Economics&Management,Nanjing University of Science&Technology,Nanjing 210094)
出处
《农业图书情报学报》
2021年第5期14-27,共14页
Journal of Library and Information Science in Agriculture
基金
江苏省社科基金重点项目“智能化驱动的学者细粒度画像构建研究”(20TQA001)。
关键词
新冠肺炎
特征词抽取
词聚类
主题分析
主题模型
COVID-19
feature word extraction
word clustering
topic analysis
topic model