摘要
性能骨架分析技术通过刻画并行应用程序的程序结构,为并行应用程序性能建模提供输入,是大规模并行应用程序性能分析、性能优化的基础。文中针对数值模拟领域中的一类构件化并行应用程序,在面向通用程序二进制文件的动静态结构分析技术的基础上,提出并实现了一种基于"构件-循环-调用"关系树(Component-Loop-Call-Tree, CLCT)的程序结构自动化生成方法,在此基础上,研制了一种面向构件化并行应用程序的性能骨架分析工具(CLCT SkeleTon Analysis Toolkit, CLCT-STAT)。该方法可以自动识别构件化应用程序中构件类成员函数符号,生成以构件为最小单位的并行应用程序性能骨架。在多个构件化并行应用程序上的测试表明,相比分析建模手动生成性能骨架的方法,所提方法不仅能提供更丰富的程序结构信息,还可以节约人工分析的时间成本。
Performance skeleton analysis technology(PSTAT)provides input parameters for performance modeling of parallel applications by describing the program structure of parallel applications.PSTAT is the basis of performance analysis and performance optimization for large-scale parallel applications.Aiming at a kind of component-based parallel applications in the field of numerical simulation,based on the dynamic and static application structure analysis technology oriented to general program binary file,this paper proposes and implements an automatic performance skeleton generation method based on“component-loop-call”tree.On this foundation,a performance skeleton analysis toolkit CLCT-STAT(Component-Loop-Call-Tree SkeleTon Analysis Toolkit)is developed.This method can automatically identify the function symbols of component class members in component-based applications,and generate the performance skeleton of parallel application with component as the smallest unit.Compared with the method of manual generation of performance skeleton by analytical modeling,the proposed method can provide more program structure information and save the cost of manual analysis.
作者
傅天豪
田鸿运
金煜阳
杨章
翟季冬
武林平
徐小文
FU Tian-hao;TIAN Hong-yun;JIN Yu-yang;YANG Zhang;ZHAI Ji-dong;WU Lin-ping;XU Xiao-wen(Institute of Applied Physics and Computational Mathematics,Beijing 100094,China;Department of Computer Science and Technology,Tsinghua University,Beijing 100084,China;Graduate School of China Academy of Engineering Physics(CAEP),Beijing 100088,China)
出处
《计算机科学》
CSCD
北大核心
2021年第6期1-9,共9页
Computer Science
基金
科技部重点研发计划高性能计算重点专项课题(2017YFB0202103)
国防基础科研核科学挑战专题项目(TZ2019002)。