摘要
本文介绍近期完成的国家自然科学基金项目<藏缅语语料库及比较研究的计量描写>的软件系统。该系统建立了我国境内藏缅语族五大语支82个语言点16万词条的开放性词汇语音数据库。研制了语言特征统计,语言比较研究软件。设计了应用于多种语言谱系分类比较研究的语音对应关系“全方位交叉”算法。对藏语方言的音节、音位、声母、韵母、声词、词素、构词能力和语音结构等10余项特征做了分布和对比统计。对藏语15个方言点做了语音对应关系和音系对比关系的量化描述,并在此基础上做出具有历时与共时比较研究意义的R相关和Φ相关分析,得出了语言分类的相关矩阵和聚类分析图表。
This paper makes an introduction to the statistical software of the project' the Tibeto-Burman Corpus and the Quantitative Description of the Comparative Studies', Sponsored by the National Natural Sciences Foundation of China. This project establis hes an overt lexical and Phonetic corpus with 160, 000 vocabulary entries of 82 different languages or dialects bclonging to the five branches of the Tibeto--Burman Group wi thin China, develops a package for the statistics of linguistic features and comparative linguistic studies, and designs an' all--bearing cross' algorthm for Phonological correspondan ce which is applicable to multilingual genetic clasaification and comparative studies. It sets a distributional and contrastivc stahstics of more than 10 features of Tibetan dialects , e. g. their syllables, phonemes, innals, fmals, tones, morphemes, word formations and phone mic structures, etc., quantifics their phonological correspondances and contrasts. Further more, it makes a R correlation and. correlation analysis significant of diachronic and syn chronic comparative studies and compiles an analytical chart of correlativc matrix and cluster of linguistric classification.
出处
《中文信息学报》
CSCD
1996年第2期23-31,共9页
Journal of Chinese Information Processing