期刊文献+

Revisiting the Parallel Strategy for DOACROSS Loops 被引量:1

原文传递
导出
摘要 DOACROSS loops are significant parts in many important scientific and engineering applications,which are generally exploited pipeline/wave-front parallelism by loop transformations.However,previous work almost statically performs iterations in parallel threads,thus causing a waste of computing resources in thread synchronization.This paper proposes a brand-new parallel strategy for DOACROSS loops that provides a dynamic task assignment with reduced dependences to achieve wave-front parallelism through loop tiling.The proposed strategy uses a master-slave parallel mode and some customized structures to realize dynamic and flexible parallelization,which effectively avoids threads from waiting in communication.An efficient tile size selection(TSS)approach is also proposed to preserve data reuse in cache for tiled codes.The experimental results show that the proposed parallel strategy obtains good and stable speedups over six typical benchmarks with different problem sizes and different numbers of threads on an Intel■Xeon■32-core serve?.And it outperforms two static strategies,a barrier-based strategy and a post/wait-based strategy,by 32% and 20% in average performance,respectively.This strategy also yields a better performance than a mutex-based dynamic strategy.Besides,it has been demonstrated that the proposed TSS approach can achieve a near-optimal performance and is comparable with a state-of-the-art TSS approach.
出处 《Journal of Computer Science & Technology》 SCIE EI CSCD 2019年第2期456-475,共20页 计算机科学技术学报(英文版)
基金 the National Key Research and Development Program of China under Grant No.2016YFB0201800 the National Natural Science Foundation of China under Grant Nos.91630206 and 91330117.
  • 相关文献

同被引文献5

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部