期刊文献+

基于NLP和统计方法的唐代不同流派诗歌风格特征分析

Analysis of the Stylistic Characteristics of Different Schools of the Tang Dynasty Poetry Based on NLP and Statistical Methods
下载PDF
导出
摘要 唐诗是我国的文化瑰宝,数量大,风格和主题多样。为了对不同流派唐诗的特征进行讨论,研究基于自然语言文字处理,利用k-means++聚类分析、重复测量方差分析和配对样本t检验等统计方法对山水田园诗派、边塞诗派、浪漫诗派、现实诗派和咏史诗派五大流派诗歌的风格特征和差异进行了分析。结果表明:唐诗本身具有较强的共性,如“千里”“万里”“何处”等关键词的TF-IDF值在不同流派中均比较靠前;在不同流派的诗歌特征上,浪漫诗派和现实诗派、边塞诗派与咏史诗派、浪漫诗派与咏史诗派在关键词的使用上相似度较高,对不同关键词的TF-IDF值进行配对样本t检验后的p值分别为0.973、0.383、0.052;其余流派间的差异较大。 Tang poetry is the cultural treasure of our country, with a large quantity, diverse styles and themes. In order to discuss the characteristics of Tang poems of different schools, based on natural language word processing, this study analyzed the stylistic characteristics and differences of the poems of five major schools, namely, landscape pastoral poetry, frontier poetry, romantic poetry, realistic poetry and epic poetry, using statistical methods such as k-means + + cluster analysis, repeated measurement variance analysis and paired sample t test. The results show that Tang poetry itself has strong commonality, such as “thousands of miles”, “ten thousands of miles”, “where” and other keywords TF-IDF values are relatively high in different schools. In terms of poetry characteristics of different schools, romantic poetry and realistic poetry, frontier poetry and epic poetry, romantic poetry and epic poetry have high similarities in the use of keywords, and the p -values of TF-IDF values of different keywords after paired sample t -test are 0.973, 0.383 and 0.052, respectively. There are significant differences between the other genres.
作者 李梦巧 沈凡起 马明 张朝元 LI Mengqiao;SHEN Fanqi;MA Ming;ZHANG Chaoyuan(College of Mathematics and Computer,Dali University,Dali 671003,Yunnan,China;College of Teacher Education,Dali University,Dali 671003,Yunnan,China)
出处 《昆明冶金高等专科学校学报》 CAS 2023年第6期46-54,共9页 Journal of Kunming Metallurgy College
基金 大理大学第八期教育教学改革研究项目“新工科背景下《高等数学》课程教学改革研究与实践”(2022JGY08-99) 2022年度云南省研究生导师团队建设项目“学科教学(数学)研究生导师团队”(108)。
关键词 NLP k-means++聚类分析 方差分析 唐代诗歌 特征分析 NLP k-means++clustering analysis variance analysis the Tang Dynasty poetry feature analysis.
  • 相关文献

参考文献14

二级参考文献48

共引文献147

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部