摘要
[目的 /意义]构建一个基于多语言本体的跨语言信息检索模型,有助于用户通过该模型使用自己熟悉的语言来获取不同语种的信息资源。[方法 /过程]通过本体设计及检索模型功能模块设计建立一个基于数字出版领域本体的中英跨语言信息检索模型,并利用Java语言及Lucene搜索引擎架构对该模型进行编程实现。[结果/结论]多语言领域本体具有明确、形式化、共享、概念化、结构清晰等特征,可以作为语义层应用于跨语言信息检索系统之中,实现信息资源的语义表达。经测试,本文构建的模型能够较好地实现分词、查询扩展和语义关联等功能,促进跨语言信息检索向语义层次发展。
[ Purpose/significance] Constructing a cross-language information retrieval model based on muhilingual ontology is beneficial for users to acquire information resources in different languages by using their familiar language. [ Method/process ] This paper proposes a cross-language information retrieval model based on digital publishing domain ontology by the design of ontology and function module, and finally implements the retrieval model by Java and Lueene. [ Result/conclusionl Multilingual ontology is explicit, formalized, sharable, conceptualized and has clear structure, which can be used as a semantic layer in cross-language information retrieval system and bring about the semantic expres- sion of information resources. After testing, the model constructed in this paper can achieve some functions like word seg- mentation, query expansion and semantic association, promoting cross-language information retrieval to the semantic level.
出处
《图书情报工作》
CSSCI
北大核心
2017年第1期100-108,共9页
Library and Information Service
基金
教育部人文社会科学重点研究基地重大项目“基于内容的多语言信息组织与检索研究”(项目编号:14JJD870001)研究成果之一
关键词
跨语言信息检索
多语言本体
检索模型
cross-language information retrieval muhilingual ontology retrieval model