摘要
[目的/意义]探究国内信息行为的发展现状以及研究的主题热点,对未来的信息行为发展趋势提出猜想和判断。[方法/过程]运用LDA主题模型,使用gensim库对语料进行预处理,通过词袋模型提取文本特征和TF-IDF计算形成向量化的词篇矩阵。通过计算主题一致性判断最优主题数目和主题强度,预测未来信息行为的研究导向。[结果/结论]模型试验得到五个信息行为研究领域的热点主题:信息传播,信息分享,信息安全,信息偶遇,信息搜寻。通过分析和判断确定了信息行为研究现阶段的发展空缺和未来的发展方向。
[Purpose/significance]The paper explores the development status and hot topics of research of information behavior in China and puts forward conjectures and judgments on the future development trend of information behavior.[Method/process]This paper uses LDA topic model and gensim database to preprocess the corpus extracts text features by the word bag model and forms a vectorized text matrix through TF-IDF calculation.By calculating topic consistency it can judge the optimal number and intensity of topics and predict the research orientation of future information behavior.[Result/conclusion]The model test obtains five hot topics in the field of information behavior research:information dissemination information sharing information security information encounter and information search.Through analysis and judgment it determines the development vacancy and future development direction of information behavior research at the current stage.
作者
孙正轩
马海群
Sun Zhengxuan;Ma Haiqun(School of Information Management Heilongjiang University Harbin,Heilongjiang 150000)
出处
《情报探索》
2023年第11期35-43,共9页
Information Research
关键词
LDA主题模型
信息行为
研究热点
LDA topic model
information behavior
research hotspot