摘要
现有的检索系统仅仅是对关键字进行简单的匹配,其检索结果往往不如人意,究其原因是在查询过程中丢失了关键词大量的知识和信息。在总结了Lucene检索系统的优缺点后,发现检索系统应该利用本体的特性对关键词进行扩展,结合用户关注程度等人为信息,在原有的排序计算方法上加以改进,从而得出了一种新的排序方法。试验证明,这种排序方法对查全率、查准率都有一定程度的提升。
In existing retrieval systems, only the simple keyword matching is used, but a lot of knowledge and information of the keywords is lost in the query processing. So the search results are often unsatisfactory. In this paper,the advantages and disadvantages of the Lucene retrieval system are summarized, presenting a new retrieval sort algorithm based on ontology-based keyword expansion and users concemed degree. Experimental results show that the retrieval sort algorithm has good improvement in recall and precision.
出处
《世界科技研究与发展》
CSCD
2013年第2期214-215,219,共3页
World Sci-Tech R&D
关键词
本体
信息_检索
查全率
查准率
ontology
information retrieval
recall ratio
precision ratio