摘要
越来越多的信息隐藏在W eb查询接口之后,在此情况下如何寻找与用户查询最相关的数据源接口就变得越来越重要。文中提出了一种Deep W eb查询接口选择算法,该算法是完全依赖于查询接口特征的。给定大量异构的Deep W eb数据源,目标是选择与用户查询最相关的查询接口集。通过对实际查询接口特征的观察,发现了查询接口上谓词间的相关性。基于此发现,设计了一种基于共同出现谓词相关度模型的数据源选择算法,用于选择与用户查询最相关的查询接口集。
As Web develops, more and more data has become available under Web query interface. Therefore, how to find the data-sources that are most relevant to the user's requirements has become more and more important. This paper presented a Deep Web query interface selection arithmetic, which completely depended on the characteristics of query interface. Given numerous heterogeneous Deep Web data sources, we aimed at selecting sources most relevant to the user's requirements. By allowing the users to input an imprecise initial query, our system found appropriate sources for them. We observed the characteristics of query interface and found out the relationships between predicates. Based on this discovery, an algorithm based on co-occurrence predicate model for capturing the relevance of attributes was designed. select the sources most relevant to the user's requirments.
出处
《计算机应用》
CSCD
北大核心
2006年第9期2024-2027,共4页
journal of Computer Applications
基金
教育部高校博士学科点科研基金资助项目(20040285016)
江苏省高技术研究计划资助项目(BG2005019)
教育部科研重点资助项目(205059)
关键词
谓词模型
接口对象
动态选择
predicate model
interface object
dynamic selecting