摘要
随着互联网的迅猛发展,监控网络的所产生的海量数据对查询处理提出挑战。根据数据明显分为大量的事件数据和少量、稳定的配置数据的特点,提出了一种基于单机DBMS的并行查询处理方法。从关系代数的角度,将任意查询分解成对水平数据分区的子查询和汇总中间结果的后处理查询。借助DBMS提供的数据库链路,在不改动DBMS的情况下,方便地构造查询处理器。用真实负载的测试表明:在中间结果集不很大的情况下,能获得接近线性的扩展比。
With the rapid expansion of Internct,massive data produced by monitoring networks present challenge to query processing.According to the application characteristics that data can be divided into two classes:huge event data and smaller, stable configure data,presents an approach of parallel query processing based on the DBMS on single machine.From the perspective of relational algebra,decomposes any query to the sub-query on data partitioned horizontally and the post-query on merged intermediate results.With the database link provided by the DBMS,we can construct the query processor easily without any change to the original DBMS.Experimental results on real workload show that near-linear scalability can be achieved if the size of result is not very huge.
出处
《计算机工程与应用》
CSCD
北大核心
2007年第30期5-10,共6页
Computer Engineering and Applications
基金
国家发改委CNGI基金项目(No.CNGI-04-7-1D)
关键词
网络数据管理
水平分区
查询处理
扩展性
network data management
horizontal partition
query processing
scalability