摘要
基于递归分治策略基本思想,本文构建了一种新的情感分析模型并解释了模型的合理性。本文首先分析了资源和统计方法的优缺点。资源的情感倾向性分析优点在于情感词表准确,缺点是完备性较差;而统计的方法则恰恰相反。进而提出了规则和统计相结合的方法分析文本的情感倾向性,并将规则和统计相结合的情感分析方法应用于该模型,并验证了其有效性。实验表明,在数据不均衡的条件下,该方法的正确率达到了77.68%。
In this paper,we propose a new model of sentiment analysis which is based on the recursive and divided function,and explain the rationality of the model.The paper analyzes the advantages and disadvantages of sentiment analysis.The advantages of the resource-based approach are that the emotional vocabulary is accurate.The shortcomings of this method is that the soundness is poor.But the statistical methods are opposite.Additionally,the paper provides a new way to analyse the sentiment of texts,and verifies the effectiveness.The method attains an accuracy of 77.68% on the test,although the data is imbalancing.
出处
《计算机工程与科学》
CSCD
北大核心
2011年第5期146-150,共5页
Computer Engineering & Science
基金
国家863计划资助项目(2007AA01Z198)
国家自然科学基金资助项目(60970083)
国家社会科学基金资助项目(08CYY016)
关键词
中文信息处理
情感分类
搭配规则
判定表
Chinese information processing
sentiment classification
collocation rules
decision list