期刊文献+

一种适用于嵌入式图形处理器的多端口纹理Cache的设计 被引量:2

The Design of Multiported Texture Cache for Embedded Graphics Processing Unit
下载PDF
导出
摘要 为了提高嵌入式图形处理器的纹理单元效率,提出了一种多端口纹理高速缓存(Texture Cache)结构。该结构采用了基于块的光栅化和块交错的纹理内存组织,能够充分发掘数据间相关性,提高了Cache命中率;此外该结构采用Cache预取技术,有效隐藏了访存延迟;为了进一步提高数据吞吐率,设计了4个读端口,可支持并行读取4个纹素。仿真结果表明,设计的Cache可达到92%左右命中率,访存性能可达到零延迟内存系统的90%,数据吞吐率是单端口Cache的3~4倍。 In order to improve GPU's texture unit efficiency ,a multiported texture cache architecture is proposed . The architecture employs a tiling rasterization order and a block interleaving memory organization ,which can fully exploit the data locality and improve cache hit rate . In addition , the architecture employs cache prefetching technology ,which can hide memory latency . In order to further improve data throughput , four read ports supporting 4 parallel reading are designed .Simulation results show that the hit rate of the proposed cache is about 92% and can attain 90% of the performance of a zero latency memory system .The data throughput is about 3~4 times of the single ported cache .
出处 《微电子学与计算机》 CSCD 北大核心 2013年第11期27-30,34,共5页 Microelectronics & Computer
基金 国家自然科学基金(61234003)
关键词 图形处理器 光栅化 纹理高速缓存 分层细化 预取 多端口 Graphics Processing Unit rasterization texture cache mipmap prefetching multiported
  • 相关文献

参考文献9

  • 1Khronos Group. Open GL ES common profile specifi cation [S]. USA.- Khronos Group, 2008. 被引量:1
  • 2Williams L. Pyramidal parametrics [J]. Computer Graph ics, 1983,17(3) : 1-11. 被引量:1
  • 3程龙,郭立,史鸿声.一种纹理映射算法的FPGA实现[J].小型微型计算机系统,2009,30(9):1855-1859. 被引量:6
  • 4Hakura Z, Gupta A. The design and analysis of a cache architecture for texture mapping[C]//Proceedings of the 24'h International Symposium on Computer Architecture Denver, Colorado, 1997:108-119. 被引量:1
  • 5Ben Juurlink, Iosif Antochi, INn Crisu, et al. GRAAL: a framework for low-power 3D graphics accelerators [J]. IEEE computer graphics and applications, 2008,28(4) : 63- 73. 被引量:1
  • 6Homan Igehy, Matthew Eldridge, Kekoa Proudfoot. Prefetching in a texture cache architecture[C]//SIG- GRAPH/Eurographics Workshop on Graphics Hard- ware. Orlando, 1998 : 133-142. 被引量:1
  • 7JohnLHennessy,DavidAPatterson.计算机系统结构一量化研究方法[M].4版.白跃彬,译.北京:电子工业出版社,2007. 被引量:1
  • 8Michael Cox, Narendra Bhandari, Michael Shanta Multi- level texture caching for 3D graphics Hardware[C]//Pm- ceedings of the 25 International Symposium on Computer Architecture. Barcelona, 1998: 86-97. 被引量:1
  • 9奚杰,陈杰,朱玥.利用SystemC实现多核系统的快速建模[J].微电子学与计算机,2010,27(7):214-217. 被引量:11

二级参考文献16

  • 1陈绍贺,赵明,王京.基于SystemC的片上系统设计[J].微电子学与计算机,2005,22(4):51-52. 被引量:11
  • 2张奇,曹阳,李栋娜,马秦生.基于SystemC的SoC行为级软硬件协同设计[J].计算机工程,2005,31(19):217-219. 被引量:9
  • 3Lance Williams. Pyramidal parametrics [ J ]. Computer Graphics, July, 1983, 17(3): 1-11. 被引量:1
  • 4PowerVR. PowerVR white paper: 3D graphical processing[ EB/ OL]. http ://www. powervr, com/pdf/TBR3D, pdf, 2000. 被引量:1
  • 5S3TC DirectX 6.0 standard texture compression ~ EB/OLJ. $3 Inc, http://www, s3. com/savage3d/s3tc, htra, 1998. 被引量:1
  • 6Strom J Akenine-Moller T. Packman: texture compression for mobile phones[ C]. In Sketches program at SIGGRAPH,2004. 被引量:1
  • 7Jon P Ewins, Marcus D Waller, Martin White, et al. Mip-map level selection for texture mapping [ J]. IEEE Transactions on Visualization and Computer Graphics, Oct. 1998,4(4) :317-329. 被引量:1
  • 8McCormack J, McNarnara R. Tiled polygon traversal using halfplane edge functions [ J]. In Proceedings of the 2000 SIGGRAPH/ EUROGRAPHICS Workshop on Graphics Hardware, 2000,15-21. 被引量:1
  • 9Michael Cox, Narendra Bhandari, Michael Shantz. Multi-level texture caching for 3D graphics hardware[ C]. Proceeding of the 25th Annual International Symposium on Computer Architecture, Apr. 1998. 被引量:1
  • 10Plb bus structure: processor local bus ( PLB ) v3.4 [ EB/OL ]. http ://www. xilinx, com/support/documentation/ip_documentation/ plb_v34, pdf, July,2003. 被引量:1

共引文献15

同被引文献12

引证文献2

二级引证文献5

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部