摘要
地震波传播的高性能数值模拟是地震研究的重要组成部分。通过挖掘地震波传播弹性动力学方程和其有限差分离散的并行性,着重对地震波传播模拟在GPU体系结构上的性能进行研究。提出了使用GPU模拟地震波传播的优化算法,包括GPU上特有的区域分解法和子区域网格上最大化访存联合的两类片内存储器访问方案。实验表明,优化后的GPU实现与使用英特尔线程构建模块优化的双核CPU上的实现相比获得了42倍以上的加速比。
High performance numerical simulation of seismic wave propagation plays an important role in seismic research. In this paper an optimized simulation algorithm of seismic wave propagation on the graphics processing unit (GPU) is presented. Based on parallelism analysis of elastodynamic equations and their finite-difference discretization, emphasis is placed on optimizations directly targeted at GPU architecture to best exploit the computational capabilities available. We discuss the specific implementation details of GPU kernels for domain decomposition method. We also describe two optimized on-chip memory access schemes with maximized memory coalescing for the meshes on the subdomains. The experimental results show that the optimized GPU implementation is more than 42 times faster than an Intel Threading Building Blocks (TBB) optimized dual-core CPU counterpart.
出处
《系统仿真学报》
CAS
CSCD
北大核心
2009年第S1期170-174,共5页
Journal of System Simulation