期刊文献+

叙词表及其管理系统在CERN生态学数据资源检索中的应用

Application of Thesaurus Management System for Searching CERN's Ecosystem Data Resources
原文传递
导出
摘要 为了更好地利用CERN数据管理与信息共享系统技术平台为广大科研人员提供CERN生态学数据资源服务,CERN需要不断完善平台性能,其中包括提高用户搜索CERN数据资源的效率和可靠性。本文分析了导航式搜索、主题式搜索、关键词搜索等三种不同检索方式的优缺点,着重讨论了在关键词搜索方式中,如何引入叙词表的技术来提高检索结果的查全率、查准率和响应速度。本文介绍了叙词表的概念与CERN生态学叙词表的构建方法,以及如何将开源的叙词表管理系统TemaTres进行汉化,包括关键词浏览功能、关键词扩展功能、关键词自动填完功能、利用扩展后的关键词去搜索CERN生态学数据资源元数据功能的汉化实现过程。通过建设并运行TemaTres汉化版叙词表管理信息系统,增强了CERN生态学元数据中关键词编撰的可控性和规范性,并且在CERN数据资源元数据检索中引入了关键词之间的某些简单的语义关系,比如等级关系、等同关系(即同义词)、相关关系,从而改善了搜索效率,同时为下一步构建生态学本体打下良好基础。 In order to improve the capability of CERN data management and information sharing system ,so as to provide much more better services of CERN's data resources to scientific researchers, CERN need to constantly improve the efficiency and reliability of retrieving and finding of the data resources. This paper discusses the advantages and disadvantages of different retrieving approaches, such as browse searching, searching by topic and searching by keywords. Then the paper puts focus on the method of improving retrieving efficiency through adopting "thesaurus" in searching by keywords. This paper introduces the concept of thesaurus and the method of constructing CERN's thesaurus, presents how to convert the TemaTres, an open-source management system for thesaurus, into Chinese version. It includes the function of browsing the terms of CERN thesaurus, expanding the searched keywords according to the semantic relations between terms in the thesaurus database, auto-completing of keywords while users input their search words, and also the function of searching CERN metadata database by expanded terms. The Chinese version of TemaTres has been put into operation, it improves the suitability and controllability while CERN information managers compiling dataset keywords of metadata for CERN data resources, furthermore, some simple semantic relations between keywords, such as 'Broader and Narrower', 'Used For', and 'Related' relations, have been introduced to the process of searching CERN metadata. It is shown that the efficiency and reliability of searching CERN metadata has been promoted. Meanwhile, thesaurus is also a good foundation for building ecosystem ontology for the next step.
出处 《科研信息化技术与应用》 2013年第2期59-66,共8页 E-science Technology & Application
基金 中国环境保护部<全国生态环境十年(2000-2010年)变化遥感调查与评估项目>(STSN-02-04)
关键词 叙词表 数据资源检索 叙词表管理信息系统 等级关系 等同关系 相关关系 Thesaurus Data resource retrieving Thesaurus Management System 'Broader and narrower' relation 'Used For' relation 'Related' relation
  • 相关文献

参考文献15

二级参考文献69

共引文献50

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部