摘要
通过词汇在文献里共现特征分析,可以为人工确定词间关系起到指引和减轻工作量的作用。文章具体使用水利水电领域专业词汇,通过在重庆维普核心科技期刊数据库中的共现频次和共现率的统计分析,以"水电站"与其他高频词组合检索,统计词频、共现频次以及共现率,结果认为,词频、共现频次、共现率等信息对人工确定词间关系具有指导意义,并且讨论了可能存在的问题及解决办法。
The co-occurrence information of keywords in document is useful for the artificial identification of conceptual relation among terms. This paper selects 291 keywords in the field of water conservancy and hydroelectric power as a sample, which is collected from the Chinese core sci-teeh journal full-text database ( Chongqing VIP information, Ltd. ) , to analyze the frequency and ratio of key- word co - occurrence. In the specific example of keyword "hydropower station", the word frequency, the frequency and ratio of co-oc- currence with other keywords are calculated. The result indicates that these indicators are of guides for the artificial identification of conceptual relation among terms. The potential issues in this method, such as word segmentation, are discussed too.
出处
《图书情报工作》
CSSCI
北大核心
2009年第8期17-20,共4页
Library and Information Service
基金
国家自然科学基金资助项目"农业ontology的构建和转化研究"(项目编号:70573116)
中央级公益性科研院所基本科研业务费专项资金重点项目"汉语科技词系统建设与应用工程"(项目编号:2008KP01-3-2)研究成果之一
关键词
词间关系
词频
共现率
concept relations word frequency keywords co-ocurrence frequency