摘要
提出了将医学叙词表MeSH词汇加入到通用分词表中进行分词,并利用MeSH词汇结合词长、词语所在位置加权实现医学新闻网页的关键词自动提取策略。作者随机选取了10家网站100篇医学新闻进行人工关键词标引,并采用机器标引与人工标引比照的方式进行验证的结果表明,关键词抽取精度达0.34,召回率达0.30,实验证明该策略可行。
The strategies for automatic extraction of key words from medical news were put forward by adding the MeSH terms into the general classification table in combination with the length of MeSH terms and location-weigh-ted MeSH terms.The key words randomly selected from 100 papers reporting medical news on 10 Websites were in-dexed and verified by machine indexing.The extraction accuracy was 0.34 and the recall rate was 0.30, showing that the strategies can be used for automatic extraction of key words from medical news.
出处
《中华医学图书情报杂志》
CAS
2014年第4期13-17,共5页
Chinese Journal of Medical Library and Information Science
基金
中国人民解放军总后勤部"全军医学信息资源共建共享服务体系建设"(司训[2011]116号)项目成果之一