摘要
收集在线教学平台上1306名小学生的作文、日记及评论,采用自然语言处理技术进行文本分析,并应用机器学习模型实现对羞怯特质的自动预测,构建小学生羞怯行为、认知和情绪的语言风格模型。研究发现:(1)扩充的心理词典适合分析小学生文本;(2)分别存在羞怯行为、认知和情绪问题的学生其日常用语既有共性也有特性,且与普通学生存在差异;(3)羞怯各维度在不同分类器上达到较好的预测效果,其中随机森林模型的整体表现相对最好。
The present study aimed to explore a new method of measuring shyness based on 1306 elementary school students’online writing texts.A supervised learning method was used to map students'labels(tagged by their results of scale)with their text features(extracted from online writing texts based on a psychological dictionary)to build a machine learning model.Key feature sets for different dimensions of shyness were built and a machine learning model was constructed based on the selected feature to achieve automatic prediction.The labels were obtained through“National School Children Shyness Scale”completed online by elementary students.The scale includes three dimensions of shyness:shy behavior,shy cognition and shy emotion.Students with Z-scores of each dimension over 1 were labeled as shy and others were labeled as normal.Students’online writing texts were collected from"TeachGrid"(https://www.jiaokee.com/),an online learning platform wherein students writing texts.The dictionary applied in the present study was Textmind,a widely used Chinese psychological dictionary developed based on Linguistic Inquiry and Word Count(LIWC).The dictionary was compiled mainly based on the corpus of adults.To ensure the validity of extracted features,we modified the original dictionary by expanding the categories and vocabulary with the real writing text of elementary students.The revised dictionary contained 118 categories.Features were extracted based on the revised dictionary.Chi-square algorithm was applied to identify the features that can distinguish between shy and normal groups to the greatest extent.Three sets of key features confirmed a significant lexical difference between shy and normal individuals.Among the selected features,some were shared by multiple dimensions reflecting the universal textual expression of shy individuals(e.g.,The average number of words per sentence and the frequency of social words of shy individuals were less than that of normal counterparts.),and there were certain features reflected the
作者
骆方
姜力铭
田雪涛
肖梦格
马彦珍
张生
LUO Fang;JIANG Liming;TIAN Xuetao;XIAO Mengge;MA Yanzhen;ZHANG Sheng(School of Psychology,Beijing Normal University,Beijing 100875,China;School of Computer and Information Technology,Beijing Jiaotong University,Beijing 100044,China;Collaborative Innovation Center of Assessment toward Basic Education Quality,Beijing Normal University,Beijing 100875,China)
出处
《心理学报》
CSSCI
CSCD
北大核心
2021年第2期155-169,共15页
Acta Psychologica Sinica
基金
国家自然科学基金联合基金项目(U1911201)。
关键词
羞怯
在线写作
心理词典
文本挖掘
语言风格模型
shyness
online writing
psychological dictionary
text mining
language style model