摘要
近年社交网络用户数量不断增加,基于文本的用户情感分析技术得到普遍关注和应用。但数据稀疏性、精度较低等问题往往会降低情感识别方法的精度和速度,提出了用户情感Biterm主题模型(US-BTM),从特定场所的文本中发现用户偏好及情感倾向,有效利用Biterm进行主题建模,并使用聚合策略形成伪文档,为整个文本集创建词汇配对以解决数据稀疏性和短文本等问题。通过词汇共现算法对主题进行研究,推断文本集级别信息的主题,并通过分析特定场景下的评论文本集中的词汇配对集及其相应主题的情感,达到准确预测用户对特定场景的兴趣、偏好和情感的目的。结果证明,所提方法能准确地捕捉用户的情感倾向,正确地揭示用户偏好,可广泛应用于社交网络的内容描述、推荐及社交网络用户兴趣描述、语义分析等多个领域。
With the increasing number of social network users in recent years,text-based user sentiment analysis technology has been widely concerned and applied.However,data sparsity and low accuracy often reduce the accuracy and speed of emotion recognition methods.The user emotion Biterm topic model(US-BTM)was proposed which could find user preference and emotional tendency from the text of specific places,so as to effectively use Biterm for topic modeling.The strategy of user aggregation to form pseudo-documents was used,and word pairs were created for the whole corpus to solve the problems of data sparsity and short text.Then the topic was studied through the lexical co-occurrence model,so as to infer the topic with abundant corps-level information,and the purpose of accurately predicting the user’s interest,preference and emotion to the specific scene was achieved by analyzing the lexical matching set in the comment corpus under the specific scene and the emotion of the corresponding topic.Theexperimental results show that the method proposed can accurately capture users’emotional tendency and correctly reveal users’preference,which can be widely used in social network content description,recommendation,social network user interest description,semantic analysis and other fields.
作者
顾秋阳
吴宝
琚春华
GU Qiuyang;WU Bao;JU Chunhua(School of Management,Zhejiang University of Technology,Hangzhou 310023,China;China Institute for Small and Medium Enterprises,Zhejiang University of Technology,Hangzhou 310023,China;School of Management Science&Engineering,Zhejiang Gongshang University,Hangzhou 310018,China)
出处
《电信科学》
2020年第11期47-60,共14页
Telecommunications Science