期刊文献+

异构高性能计算系统Linpack效率受限因素分析 被引量:1

Analysis of the factors limiting Linpack efficiency of heterogeneous high-performance computing systems
下载PDF
导出
摘要 能耗是目前高性能计算系统性能提升的一大挑战。主处理器连接加速器的异构计算技术可以有效提升系统能效,因而被广泛应用于当前高性能计算系统的设计。同等系统规模下,异构计算系统的Linpack效率普遍低于同构系统。针对这一问题,从结构设计的角度,基于真实计算系统的设计参数和性能数据,分析了大规模异构高性能计算系统Linpack效率受限的主要因素及其对结构设计的需求,并构建了针对异构计算系统的Linpack性能模型对分析结论进行了验证。研究成果对异构计算系统Linpack的性能优化以及未来高效异构架构的设计具有一定的指导意义。 Energy consumption is a major challenge for current high-performance computing(HPC)system design.The technique of heterogeneous computing with accelerators connected to host processor can improve the system energy efficiency,and has been widely applied in HPC.However,heterogeneous systems have lower Linpack efficiency in comparison to homogeneous systems with the equivalent scale.Aiming at this problem,from the perspective of structural design,based on the design parameters and performance data of real computing systems,this paper analyzes the main factors limiting the Linpack efficiency of heterogeneous HPC systems and their demands of structural design.Besides,the analysis is verified with a Linpack performance model tailored for heterogeneous systems.This study results are of guiding significance to Linpack performance optimization and efficient architecting on future heterogeneous systems.
出处 《计算机工程与科学》 CSCD 北大核心 2018年第2期224-230,共7页 Computer Engineering & Science
基金 国家自然科学基金(91430214)
关键词 异构 高性能 LINPACK 效率 heterogeneous high performance Linpack efficiency
  • 相关文献

参考文献2

二级参考文献9

  • 1张文力,陈明宇,樊建平.HPL测试性能仿真与预测[J].计算机研究与发展,2006,43(3):557-562. 被引量:13
  • 2Petitet A, Whaley R C, Dongarra J, et al. HPL-aportable Implementation of the High-performance Linpack Benchmark for Distributed-memory Computers[EB/OL]. (2004-01-20). http:// www.netlib.org/benehmark/hpl. 被引量:1
  • 3Meuer H W, Strohmaier E, Dongarra J, et al. Top500 Super computer Sites[EB/OL]. (2007-08-14). http://www.netlib.org/ benchmark/top500.html. 被引量:1
  • 4Demmel J. Applications of Parallel Computers Dense Linear Algebra[EB/OL]. (2008-07-16), http://www.cs.berkeley.edu/- demmel/cs267 _171002.ppt. 被引量:1
  • 5Goto K. High Performance BLAS GOTO[EB/OL]. (2007-04-16). http://www.tacc.utexas.edu/-kgoto. 被引量:1
  • 6Innovative Computing Laboratory. HPL Algorithm[EB/OL]. (2009-08-16). http://www.netlib.org/benehmark/algorithm.html. 被引量:1
  • 7Dongarra J, Luszczek P, Petitet A. The Linpaek Benchmark: Past, Present, and Future[J]. Concurrency and Computation: Practice and Experience, 2003, 15(9): 803~820. 被引量:1
  • 8Cao Zhennan. How to do Linpack Test and Performance Optimi- zation[EB/OL]. (2009-10-11). http://www.samss.org.cn/lmpack- doc/howtodo.pdf. 被引量:1
  • 9YANG XueJun,TANG Tao,WANG GuiBin,JIA Jia,XU XinHai.MPtostream:an OpenMP compiler for CPU-GPU heterogeneous parallel systems[J].Science China(Information Sciences),2012,55(9):1961-1971. 被引量:3

共引文献38

同被引文献4

引证文献1

二级引证文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部