摘要
图形处理器(GPU)访存利用率已经成为影响其性能的关键瓶颈之一。在处理器设计中,访存的预取结构设计成为了提高访存利用率的主要方法之一。结合图形处理器的访存密集的特点,在提高预取性能的前提下,减小影响图形流水线正常效率成为热门的研究方向。本文基于一种图形处理器无损压缩的结构,提出了一套图形处理器的预取结构设计。本预取结构设计可在访存密集型的图形流水线中有效提高访存利用率,并不影响当前图形流水线的效率。实验结果表明,在Godson GPU图形处理器平台上,与传统预取结构相比,针对访存密集型测试程序,cache命中率可以提高15%以上。针对访存空闲的测试程序,该设计不会对流水线产生负面影响。
Graphics processing unit(GPU)memory access utilization has become one of the key bottlenecks affecting performance.In processor design,memory access prefetch architecture design has become one of the main methods to improve memory access utilization.Combined with graphics processor memory access,due to the dense features,under the premise of improving the prefetch performance,reducing the influence on the normal efficiency of the graphics pipeline has become a popular research direction.Based on a graphics processor lossless compression architecture,this paper proposes a set of graphics processor prefetch architecture design.The design of the prefetch architecture can effectively improve the memory access utilization in the memory-intensive graphics pipeline,and does not affect the efficiency of the current graphics pipeline.The experimental results show that on the Godson graphic processing unit(GSGPU)graphics processor platform,compared with the traditional prefetch architecture,the cache hit rate can be increased by more than 15% for the memory-intensive test program.For the test program with idle memory,it will not have a negative impact on the pipeline.
作者
赵士彭
张立志
章隆兵
ZHAO Shipeng;ZHANG Lizhi;ZHANG Longbing(State Key Laboratory of Computer Architecture,Institute of Computing Technology,Chinese Academy of Sciences,Beijing 100190;Institute of Computing Technology,Chinese Academy of Sciences,Beijing 100190;University of Chinese Academy of Sciences,Beijing 100049)
出处
《高技术通讯》
CAS
2022年第4期351-357,共7页
Chinese High Technology Letters
基金
国家自然科学基金(61521092,61432016)
中国科学院重点部署项目(ZDRW-XH-2017-1)资助。
关键词
图形处理器(GPU)
访存子系统
预取结构
压缩结构
graphic processing unit(GPU)
memory access subsystem
prefetch architecture
compressed architecture