摘要
针对半结构化流式XML数据的访问特征和查询处理要求,基于SAX事件驱动解析XML,提出了采用链接栈处理XPath查询的方法 QXSStack。首先定义了XML流式数据模型,然后在整体处理框架的基础上,给出了针对XPath查询片段XP^({/,//,[],*})的具体算法实现,最后通过实验对比分析了QXSStack方法与QStream++方法的运行效率,验证了提出方法的有效性。
Focused on the characteristics of processing semi-structure XML streaming data and concerned the requirement of processing for large scale data, QXSStack method is proposed for massive data processing based on SAX parsing XML. Data model and algorithm integrated framework are defined firstly, the integrated method of how to process predicate and wildcard is then discussed in detail. The experimental results show the effectiveness of this approach which outperforms the state-of-the-art algorithms such as QStream + + especially for large processed data.
出处
《北华航天工业学院学报》
CAS
2016年第3期11-13,共3页
Journal of North China Institute of Aerospace Engineering
基金
北华航天工业学院项目(KY-2015-11)
北华航天工业学院科技创新团队项目(XJTD20140)
廊坊市科技支撑计划项目(2015011066)