基于规范划分集的并行循环计算划分被引量：1

A Computation Partition Based on Uniform Partitioning Schemes for Parallel Loops

下载PDF

导出

摘要计算划分问题是并行编译中最为重要的问题之一.针对并行循环,在数据分布确定的情况下,提出了基于规范集的计算划分算法,具体讨论了规范集的获取方法及综合通信与负载均衡的最优方案选取算法.实验表明,在并行循环处理方面,这一算法与以前几种算法相比更加简单、有效;采用这一算法的p_HPF编译器对数据并行应用问题可以获得良好的加速比和效率.该编译器已在石油领域得到应用. Computation partition is one of the most important problems in parallel compilation and optimization. For dealing with parallel loops with determinated data distribution, a computation partition algorithm based on the subset of uniform schemes is proposed. The method of getting the subset of uniform schemes is given, as well as the algorithm of selecting the most optimized scheme under the consideration of communication and load balance. The experimental results prove that this algorithm is simpler and more effective than several previous algorithms in dealing with parallel loops, and the p_HPF compiler adopted by this algorithm can obtain good speedups and efficiencies. The compiler has been applied in the field of petroleum.

作者黄其军杨建武余华山许卓群

机构地区北京大学计算机科学技术系北京大学计算机科学技术研究所文字信息处理国家重点实验室

出处《软件学报》 EI CSCD 北大核心 2003年第3期362-368,共7页 Journal of Software

基金 Supported by the National Natural Science Foundation of China under Grant No.60173004 (国家自然科学基金)

关键词规范划分集并行循环计算划分并行编译编译程序 parallel loop parallel compilation computation partition parallel computation node program

分类号 TP314 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献10

1[1]Banerjee U. Unimodular transformations if double loops. In: Proceedings of the 3rd Workshop on Languages and Compilers for Parallel Computing. 1990. 192～219. 被引量：1
2[2]Banerjee U. Loop Transformations for Restructuring Compilers. Norwell: Kluwer Academic Publishers, 1993. 被引量：1
3[3]Anderson JM, Lam MS. Global optimizations for parallelism and locality on scalable parallel machines. In: Proceedings of the ACM SIGPLAN'93 Conference on Programming Language Design and Implementation. 1993. 112～125. 被引量：1
4[4]Lim AW, Cheong GI, Lam MS. An affine partitioning algorithm to maximize parallelism and minimize communication. In: Proceedings of the 13th ACM SIGARCH International Conference on Supercomputing. 1999. 228～237. 被引量：1
5[5]Lim AW, Liao S-W, Lam MS. Blocking and array contraction across arbitrarily nested loops using affine partitioning. ACM SIGPLAN Notices, 2001,36(7):103～112. 被引量：1
6[6]http://www.cs.rice.edu/～dsystem/dhpf/dhpf-overview-96/index.html. 1995. 被引量：1
7[7]Hu CY, Jin GH, Johnsson SL, Kehagias D, Shalaby N. HPFBench: a hign performance Fortran Benchmark suite. ACM Transactions on Mathematical Software, 2000,26(1):99～149. 被引量：1
8[8]CRPC/HPFF/benchmarks/index.cfm. 1995. 被引量：1
9[9]Thirumalai A. Code generation and optimization for high performance Fortran . Department of Electrical and Computer Engineering, Louisiana State University, 1995. 被引量：1
10[10]Huang QJ. Parallel loop and its compiling and optimizing techniques . Beijing: Peking University, 2002 (in Chinese with English Abstract). 被引量：1

同被引文献5

1严胜祥,吴绍春,吴耿锋,金沈杰.一种基于纵向划分数据集的并行决策树分类算法[J].计算机工程与科学,2004,26(7):67-70. 被引量：2
2张宇亮,张立臣,李代平.并行算法的任务粒度与映射方法的分析[J].计算机工程与应用,2005,41(20):44-47. 被引量：3
3刘键,谢卫,朱晓梅,谷秋艳.一种关于DO－loop并行划分的新观点与新方法[J].计算机学报,1996,19(7):520-529. 被引量：1
4LinC,SynderL.并行程序设计原理[M].陆鑫达,林新华,译.北京:机械工业出版社,2009:101-195. 被引量：5
5江文毅,庞丽萍,高兰,韩宗芬.串行程序的并行划分算法研究[J].华中理工大学学报,2000,28(12):30-32. 被引量：4

引证文献1

1卢成浪.串行程序的并行划分算法研究[J].温州大学学报（自然科学版）,2010,31(4):21-26.

1王振宇,王义和,郭福顺.并行循环的识别[J].哈尔滨工业大学学报,1992,24(1):40-46.
2王振宇,郭福顺.循环并行的优化技术[J].深圳大学学报（理工版）,1994,11(3):25-30. 被引量：1
3胡吉明,周建强.并行循环的乐观自调度模式[J].计算机学报,1995,18(1):46-55.
4侯智玮.光纤传感器在石油行业中的应用[J].石化技术,2016,23(6):265-265.
5余德汝,王石刚.六轮车扭矩加载系统的应用程序框架研究[J].测控技术,2015,34(4):112-115.
6沈志宇.多机系统的并行循环调度[J].计算机工程与科学,1989,11(2):1-12.
7黄品丰,赵荣彩,姚远,赵捷.面向异构多核处理器的并行代价模型[J].计算机应用,2013,33(6):1544-1547. 被引量：3
8唐北平,肖建华.快速模板匹配算法在图像循环处理中的实现[J].信息技术,2006,30(8):34-36.
9王桂彬,杨学军,徐新海,林一松,李鑫.异构系统功耗感知的并行循环调度方法[J].软件学报,2011,22(9):2222-2234. 被引量：7
10黄品丰,赵荣彩,韩林,刘晓娴.OpenMP数据分布子句自动生成算法[J].计算机工程,2013,39(3):295-299.

软件学报

2003年第3期

浏览历史

内容加载中请稍等...

基于规范划分集的并行循环计算划分被引量：1

参考文献10

同被引文献5

引证文献1

相关作者

相关机构

相关主题

浏览历史

基于规范划分集的并行循环计算划分 被引量：1

参考文献10

同被引文献5

引证文献1

相关作者

相关机构

相关主题

浏览历史

基于规范划分集的并行循环计算划分被引量：1