期刊文献+

基于PCIe的高性能FPGA-GPU-CPU异构编程架构 被引量:5

A high performance FPGA-GPU-CPU heterogeneous programming architecture based on PCIe
下载PDF
导出
摘要 异构计算作为一种特殊的并行计算方式,能根据计算任务的特点发挥不同计算资源的能力,在提高服务器计算性能、能效比和实时性方面有极大优势,但目前异构计算环境存在编程复杂、可信性无法保证的问题。针对以上问题,提出了一个基于状态变迁矩阵(STM)的编程框架,可以集成GPU和FPGA的资源。通过状态迁移矩阵对CUDA和Vivado的应用程序接口(API)进行集成,自动生成异构计算所需要的标准C代码。通过PCIe总线连接GPU和FPGA设备,从而可以在这些异构计算单元之间进行数据传输,中间无需使用系统CPU内存。并且通过GPUDirect RDMA实现了FPGA作为主控器的PCIe通信,突破了GPU作为主控器的PCIe通信当中读取操作的短板。实验表明,相比共享内存的通信方式,FPGA作为主控器的PCIe通信方式的通信效率提高了1.4倍,实现的数据速率接近理论带宽的最大值。 As a special parallel computing method,heterogeneous computing can make full use of the capabilities of different computing units according to the characteristics of computing tasks.It has great advantages in improving the computing performance,real-time performance and reducing the energy consumption of the processor.However,at present,there are some problems in heterogeneous computing environment,such as complex programming and unreliability.To solve these problems,this paper proposes a programming framework based on state transition matrix(STM),which can integrate GPU and FPGA resources.Application programming interfaces(APIs)of CUDA and Vivado are integrated through STM,and the standard C code for heterogeneous computing is automatically generated.By connecting GPU and FPGA devices through PCI Express bus,data can be transferred between these heterogeneous computing units without intermediate use of system CPU memory.Besides,GPUDirect RDMA is used to realize the PCIe communication with FPGA as the main controller,which breaks through the short board of read operation in the PCIe communication with GPU as the main controller.Experimental results show that the communication efficiency is 1.9 times higher than that of shared memory,and the realized data rate is close to the maximum of theoretical bandwidth.
作者 孙兆鹏 周宽久 SUN Zhao-peng;ZHOU Kuan-jiu(College of Software,Dalian University of Technology,Dalian 116620,China)
出处 《计算机工程与科学》 CSCD 北大核心 2021年第4期641-651,共11页 Computer Engineering & Science
基金 中央高校基本科研业务费专项资金(DUT19ZD104)。
关键词 状态变迁矩阵 异构计算 FPGA GPU PCIE state transition matrix heterogeneous computing FPGA GPU PCIe
  • 相关文献

参考文献4

二级参考文献30

  • 1蒋屹新,林闯,曲扬,尹浩.基于Petri网的模型检测研究[J].软件学报,2004,15(9):1265-1276. 被引量:20
  • 2訾小超,姚立红,李斓.一种基于有限状态机的隐含信息流分析方法[J].计算机学报,2006,29(8):1460-1467. 被引量:13
  • 3Matsumoto M,Anada K,Ueshima D,et al.Model Checking of State Transition Matrix[R].ITSSV 2005,AIST Technical Report.2005:2-11. 被引量:1
  • 4Watanabe M.Extended Hierarchy State Transition Matrix Design Method-Version 2.0[R].CATS Technical Report.1998. 被引量:1
  • 5Clarke E,Kroening D,Ouaknine J,et al.Completeness and complexity of bounded model checking[C]//Proceedings of the Verification,Model Checking,and Abstract Interpretation.Springer Berlin Heidelberg,2004:85-96. 被引量:1
  • 6Clarke E M,Grumberg O,Peled D.Model checking[M].The MIT press,1999. 被引量:1
  • 7Armando A,Mantovani J,Platania L.Bounded model checking of software using SMT solvers instead of SAT solvers[M]//Model Checking Software.Springer Berlin Heidelberg,2006:146-162. 被引量:1
  • 8KongWei qiang,FukudaA,WatanabeM.An SMT approach to bounded model checking of design in state transition matrix[C]// Proceedings of the Computational Science and Its Applications (ICCSA).2010:231-238. 被引量:1
  • 9Kong Wei qiang,Katahira N,Watanabe,et al.Formal verification of software designs in hierarchical state transition matrix with SMT based bounded model checking[C]//Proceedings of the Software Engineering Conference (APSEC).2011:81-88. 被引量:1
  • 10Latvala T,Biere A,Heljanko K,et al.Simple bounded LTL model checking[C]//Proceedings of the Formal Methods in Computer-Aided Design 2004.2004:186-200. 被引量:1

共引文献10

同被引文献54

引证文献5

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部