摘要
用户在查询XML文档的时候经常有模糊的或者不精确的查询要求.为了解决用户的模糊查询意图,提出了一种基于XML内容和结构的模糊查询方法.以模糊集理论为基础,提出了利用模糊谓词实现XPath查询表达式的模糊扩展,采用模糊查询松弛方法,它可以产生更多满足用户查询要求的结果.在排序这些查询结果的时候,提出的打分方法使用一个扩展的向量空间模型,考虑了内容和结构的相关性,按照内容和结构的匹配情况打分,得分大于阈值的节点就是答案节点.最后,通过实验验证了所提方法的有效性.
Users often have fuzzy or imprecise requests when querying XML documents.A new approach based on XML content and structure was proposed to reflect users' fuzzy query intention.Based on the fuzzy set theory,a fuzzy extension of XPath query expression was proposed,which can be expressed exploiting fuzzy predicates.And then fuzzy query relaxations was provided to get more querying results which satisfy users' query requests.The proposed scoring method uses an extended vector space model,which considers the relevance of both content and structure when ranking these query results.According to the matching of the structure and content,the nodes whose scores are greater than the threshold are query results.Finally,the efficiency of the approach is demonstrated by experimental results.
出处
《东北大学学报(自然科学版)》
EI
CAS
CSCD
北大核心
2011年第7期931-934,共4页
Journal of Northeastern University(Natural Science)
基金
国家自然科学基金资助项目(60873010
61073139)
中央高校基本科研业务费专项资金资助项目(N090504005
N100604017
N090604012)
教育部新世纪优秀人才支持计划项目(NCET-05-0288)
关键词
模糊集
XML
模糊查询
查询松弛
排序
fuzzy set
XML(extensible markup language)
fuzzy query
query relaxation
ranking