期刊文献+

基于依赖分析的SPMD程序隐式同步检测及处理算法 被引量:1

Detecting and Treatment Algorithm of Implicit Synchronization Based on Dependence Analysis in SPMD Program
下载PDF
导出
摘要 SPMD翻译是指将一种特定类型的SPMD程序编译到多种设备上,当前的细粒度SPMD翻译研究建立在线程之间相互独立的假定上,线程之间只通过显式同步进行通信.但线程之间还隐含存在着各种数据依赖,如隐式同步,这导致了SPMD翻译在处理隐式同步时的正确性缺陷.为了对隐式同步进行处理,对细粒度SPMD模型CUDA中的隐式同步进行了系统的分析,指出了当前翻译CUDA程序到多核平台的相关研究在处理隐式同步上的不足,提出了基于依赖分析的隐式同步检测方法.在检测出隐式同步的基础上,设计了循环重排序的优化处理算法,对显式同步和隐式同步进行了统一处理.实验结果表明,与现有的SPMD翻译方法相比,该检测及处理算法能够正确而快速地检测并翻译CUDA中的各种隐式同步,代价较小,有助于编译器产生正确而有效的翻译结果. SPMD translation compiles programs of one SPMD-threaded programming model to multi devices.The current researches base on the supposition that different threads are independent except in communication with explicit synchronizations.However,the data dependence relation between threads such as implicit synchronizations results in the correctness pitfalls in SPMD translation.In order to deal with implicit synchronizations,the implicit synchronizations in fine-grained SPMD programming model CUDA are analyzed systematically.The correctness pitfalls in existing SPMD translation from CUDA to Multi-core are revealed in which this paper proposes a method of detecting implicit synchronizations based on dependence analysis.On the basis of implicit synchronizations detecting,an optimized treatment algorithm is designed to treat explicit and implicit synchronizations synthetically by the loop reorder.The experimental results show that compared with existing SPMD translation,the detecting and optimized algorithm could treat kinds of implicit synchronizations in fine grained SPMD translation correctly and quickly by small expense,which helps compiler produces correct and efficient result.
出处 《软件学报》 EI CSCD 北大核心 2013年第8期1775-1785,共11页 Journal of Software
基金 国家高技术研究发展计划(863)(2009AA012201) 国家科技重大专项(核高基)(2009ZX01036-001-001) 河南省重大科技攻关专项(092101210501)
关键词 SPMD翻译 显式同步 隐式同步 依赖分析 线程循环 循环重排序 SPMD translation explicit synchronization implicit synchronization dependence analysis thread loop loop reorder
  • 相关文献

参考文献14

  • 1Stratton J, Grover V, Marathe J, Aarts B, Murphy M, Hu Z, Hwu WMW. Efficient compilation of fine-grained spmd-threaded programs for multicore CPUs. In: Proe. of the 2010 Int'l Symp. on Code Generation and Optimization. ACM Press, 2010. 111-119 [doi: 10.1145/1772954.1772971]. 被引量:1
  • 2Kirk DB, Hwu WMW. Programming Massively Parallel Processors. Elsevier Inc., 2010. 39-42. 被引量:1
  • 3Wu B, Zhang EZ, Shen XP. Enhancing data locality for dynamic simulations through asynchronous data transformations and adaptive control. In: Proc. of the 20th Int'l Conf. on Parallel Architecture and Compilation Techniques. IEEE Computer Society, 2011. 243-252. [doi: 10.1109/PACT.2011.56]. 被引量:1
  • 4Wang PY, Chen YJ, Shen HH, Chen TS, Zhang H. Memory consistency verification of chip multi-processor. Ruan Jian Xue Bao/ Journal of Software, 2010,21(4):863-874 (in Chinese with English abstract), http://www.jos.org.cn/1000-9825/3705.htm [doi: 10. 3724/SPJ.1001.2010.03705]. 被引量:1
  • 5NVIDIA Corporation. Getting Started With CUDA SDK Samples. NVIDIA Corporation, 2012.2-5. 被引量:1
  • 6Guo ZY, Shen XP. Fine-Grained treatment to synchronizations in GPU-to-CPU translation. Technical Report, WM-CS-2011-02, 2011. 被引量:1
  • 7Aiken A, Gay D. Barrier inference. In: Proc. of the 25th ACM Symp. on Principles of Programming Languages. IEEE Press, 1998. 342-354. [doi: 10.1145/268946.268974]. 被引量:1
  • 8NVIDIA Corporation. NVIDIA CUDA Programming Guide. Version 2.3, NVIDIA Corporation, 2009. 71-75. 被引量:1
  • 9Allen R, Kennedy K. Optimizing Compilers for Modem Architectures: A Dependence-Based Approach. Elsevier Science, 2001. 34-39. 被引量:1
  • 10Guo ZY, Zhang EZ, Shen XP. Correctly treating synchronizations in compiling fine-grained spmd-threaded programs for CPU. In: Proc. of the 20th Int'l Conf. on Parallel Architectures and Compilation Techniques. IEEE Computer Society, 2011. 310-319. [doi: 10.I 109/PACT.2011.62]. 被引量:1

二级参考文献20

  • 1Chatterjee P,Sivaraj H,Gopalakrishnan G.Shared memory consistency protocol verification against weak memory models:Refinement via model-checking.In:Proc.of the 14th Int'l Conf.on Computer Aided Verification (CAV 2002).2002.http://www.cs.utah.edu/formal_verification/papers/cav02paper.pdf. 被引量:1
  • 2Yang Y,Gopalakrishnan G,Lindstrom G,Slind K.Nemos:A framework for axiomatic and executable specifications of memory consistency models.In:Proc.of the 18th Int'l Parallel and Distributed Processing Symp.(IPDPS 2004).2004.http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=1302944. 被引量:1
  • 3Gibbons P,Korach E.On testing cache-coherent shared memories.In:Proc.of the 6th ACM Symp.on Parallel Algorithms and Architectures (SPAA'94).1994.http://delivery.acm.org/10.1145/190000/181328/p177-gibbons.pdf?key1=181328&key2=318237 8621&coll=GUIDE&dl=GUIDE&CFID=82133830&CFTOKEN=76647768. 被引量:1
  • 4Gibbons P,Korach E.Testing shared memories.SIAM Journal on Computing,1997,26(4):1208-1244. 被引量:1
  • 5Meixner A,Sorin D.Dynamic verification of sequential consistency.In:Proc.of the 32nd Int'l Symp.on Computer Architecture (ISCA 2005).2005.http://people.ee.duke.edu/~sorin/papers/isca05_dvsc.pdf. 被引量:1
  • 6Meixner A,Sorin D.Dynamic verification of memory consistency in cache-coherent multithreaded computer architectures.In:Proc.of the Int'l Conf.on Dependable Systems and Networks (DSN 2006).2006.http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=& arnumber=1633497&userType=inst. 被引量:1
  • 7Hangal S,Vahia D,Manovit C,Lu J,Narayanan S.Tsotool:A program for verifying memory systems using the memory consistency model.In:Proc.of the 31st Int'l Symp.on Computer Architecture (ISCA 2004).2004.http://ieeexplore.ieee.org/ iel5/9170/29103/01310768.pdf?arnumber=1310768. 被引量:1
  • 8Manovit C,Hangal S.Efficient algorithms for verifying memory consistency.In:Proc.of the 17th ACM Symp.on Parallelism in Algorithms and Architecture (SPAA 2005).2005.http://delivery.acm.org/10.1145/1080000/1074011/p245-manovit.pdf?key1= 1074011&key2=6873378621&coll=GUIDE&dl=GUIDE&CFID=82135957&CFTOKEN=66149051. 被引量:1
  • 9Roy A,Zeisset S,Fleckenstein C,Huang J.Fast and generalized polynomial time memory consistency verification.In:Proc.of the 18th Int'l Conf.on Computer Aided Verification (CAV 2006).Berlin,Heidelberg:Springer-Verlag,2006.503-516. 被引量:1
  • 10Manovit C,Hangal S.Completely verifying memory consistency of test program executions.In:Proc.of the 12th Int'l Symp.on High-Performance Computer Architecture (HPCA 2006).2006.http://ieeexplore.ieee.org/iel5/10647/33614/01598123.pdf? arnumber=1598123. 被引量:1

共引文献12

同被引文献2

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部