期刊文献+

Co-OLAP:CPU&GPU混合平台上面向星形模型基准的协同OLAP(英文)

Co-OLAP:Research on cooperated OLAP with star schema benchmark on hybrid CPU&GPU platform
下载PDF
导出
摘要 当前GPU(图形处理器),即使是中端服务器配置的中端GPU也拥有强大的并行计算能力.不同于近期的研究成果,中端服务器可能配置有几块高端CPU和一块中端GPU,GPU能够提供额外的计算能力而不是提供比CPU更加强大的计算能力.本文以中端工作站上的CoOLAP(协同OLAP)为中心,描述如何使中端GPU与强大的CPU协同以及如何在计算均衡的异构平台上分布数据和计算以使Co-OLAP模型简单而高效.根据实际的配置,基于内存容量,GPU显存容量,数据集模式和订制的AIR(数组地址引用)算法提出了最大高性能数据分布模型.CoOLAP模型将数据划分为驻留于内存和GPU显存的数据集,OLAP计算也划分为CPU和GPU端的自适应计算负载来最小化CPU和GPU内存之间的数据传输代价.实验结果显示,在SF=20的SSB(星形模型基准)测试中,两块至强六核处理器的性能略优于一块NVIDA Quadra 5 000GPU(352个cuda核心)的处理性能,Co-OLAP模型可以将负载均衡分布在异构计算平台并使每个平台简单而高效. Nowadays GPUs have powerful parallel computing capability even for moderate GPUson moderate servers. Opposite to the recent research efforts, a moderate server may be equippedwith several high level CPUs and a moderate GPU, which can provide additional computing powerinstead of more powerful CPU computing. In this paper, we focus on Co-OLAP (CooperatedOLAP) processing on a moderate workstation to illustrate how to make a moderate GPU cooperatewith powerful CPUs and how to distribute data and computation between the balanced computingplatforms to create a simple and efficient Co-OLAP model. According to real world configuration,we propose a maximal high performance data distribution model based on RAM size, GPU devicememory size, dataset schema and special designed AIR(array index referencing) algorithm. TheCo-OLAP model distributes dataset into host and device memory resident datasets, the OLAP isalso divided into CPU and GPU adaptive computing to minimize data movement between CPU andGPU memories. The experimental results show that two Xeon six-core CPUs slightly outperformone NVIDA Quadra 5 000 GPU with 352 cuda cores with SF = 20 SSB dataset, the Co-OLAPmodel can assign balanced workload and make each platform simple and efficient.
出处 《华东师范大学学报(自然科学版)》 CAS CSCD 北大核心 2014年第5期240-251,共12页 Journal of East China Normal University(Natural Science)
基金 中央高校基本科研业务费专项资金(12XNQ072,13XNLF01)
  • 相关文献

参考文献17

  • 1GOVINDARAJU N K,LLOYD B,WANG W,et al.Fast computation of database operations using graphics processors[C]//SIGMOD Conference.2004. 被引量:1
  • 2HE B,LIU M,YANG K,et al.Relational query coprocessing on graphics processors[J].ACM Transactions on Database Systems,2009,34(4). 被引量:1
  • 3HE B,YANG K,FANG R,ey al.Relational joins on graphics processors[C]//SIGMOD,2008:511-524. 被引量:1
  • 4PIRK H,MANEGOLD S,KERSTEN M.Accelerating foreign-key joins using asymmetric memory channels[C]//ADMS,2011. 被引量:1
  • 5ALCANTARA D A,SHARF A,ABBASINEJAD F,et al.Real-time parallel hashing on the gpu[J].ACM Trans Graph,2009,28(5). 被引量:1
  • 6AO N,ZHANG F,WU D,et al.Efficient parallel lists intersection and index compression algorithms using graphics processing units[J].PVLDB,2011. 被引量:1
  • 7FANG W,HE B,LUO Q.Database compression on graphics processors[C]//VLDB,2010. 被引量:1
  • 8GOVINDARAJU N,GRAY J,KUMAR R,et al.Gputerasort:high performance graphics co-processor sorting for large database management[C]//SIGMOD,2006. 被引量:1
  • 9SATISH N,KIM C,CHHUGANI J,et al.Fast sort on cpus and gpus:a case for bandwidth oblivious simd sort[C]//SIGMOD,2010. 被引量:1
  • 10SITARIDI E,ROSS K.Ameliorating memory contention of olap operators on gpu processors[C]//DaMoN,2012:39-47. 被引量:1

共引文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部