期刊文献+

同时多线程处理器上的Cache性能分析与优化 被引量:2

Performance Evaluation and Optimization of Cache Architecture for Simultaneous Multithreading Processor
下载PDF
导出
摘要 同时多线程(SMT)是一种延迟容忍的体系结构,它在每个周期内可以执行多个线程的多条指令.在SMT处理器上,对于片上共享存储这个复杂的结构资源,至今还没有很好的共享和冲突解决方案.本文着重研究了在多个并发执行的线程间划分共享Cache所存在的问题,指出基于LRU策略的传统Cache会根据需要隐式地划分共享Cache,这在某些情况下会导致全局性能的下降.针对这一问题并且考虑到SMT处理器上对Cache访问带宽的需求,本文提出采用一种多模块多体的Cache结构设计方案.并且在一个修改过的SMT模拟器上对该设计方案进行了性能评价.实验结果显示,相比于基于LRU策略的传统Cache,这一结构可以将一个4路SMT处理器的IPC提高9%. Simultaneous multithreading(SMT)is a latency-tolerant architecture that executes multiple instructions from multiple threads each cycle. In the SMT processor, for on-chip shared storage which is a complicated architecture resource,there aren't good solutions of share and conflict up to now. This paper investigates the problem of partitioning a shared cache between multiple concurrently executing threads, and shows that the commonly used LRU policy implicitly partitions a shared cache on a demand basis, and it will reduce the overall performance sometimes. According to the foregoing problem and taking into account the high-bandwidth Cache access in SMT processor, this paper puts forward adopting a multi-module and multi-banking Cache architecture. The design has been evaluated using a modified SMT simulator. The results show that this architecture improves IPC of a four-way SMT system by up to 9% over the traditional cache based on standard LRU replacement policy.
出处 《小型微型计算机系统》 CSCD 北大核心 2009年第1期159-163,共5页 Journal of Chinese Computer Systems
基金 国家自然科学基金重点项目"当代并行机的并行算法应用基础研究"(60533020)资助 国家"八六三"项目"红色神经元超高扩展高密度计算技术"(2005AA104031)资助
关键词 同时多线程 高速缓存 仿真 simultaneous multithreading (SMT) cache simulation
  • 相关文献

参考文献13

  • 1Susan J Eggers, Joel S Emer, Henry M Levy, et al. Simultaneous multithreading:a platform for next-generation processors [J]. IEEE Micro,September/October 1997,12-18. 被引量:1
  • 2Dean M Tullsen, Susan J Eggers,Henry M Levy. Simultaneous multithreading : maximizing on-chip parallelism[C]. Proceedings of the 22nd Annual International Symposium on Computer Architecture. Santa Margherita Ligure, Italy, June, 1995. 被引量:1
  • 3Marr D T,Binns F,Hill D L,et al. Hyper-threading technology architecture and microarchitecture[J]. Intel Technology Journal, 2002,6(1):4-15. 被引量:1
  • 4Clabes J,et al. Design and implementation of the POWER5 microprocessor [R]. In: ISSCC Digest of Technical Papers, 2004, 56-57. 被引量:1
  • 5Diefendorff K. Compaq chooses SMT for alpha[J]. Micropro-cessor Report, 1999,13(16). 被引量:1
  • 6Emer J S. Simultaneous multithreading: multiplying alpha's performance [C]. In: Proc. of the Microprocessor Forum (San Jose, CA), 1999. 被引量:1
  • 7Sohi G S,Franklin M. High-bandwidth data memory systems for superscalar processors [C]. Proceedings of ASPLOS-IV, April 1991,53-62. 被引量:1
  • 8Microprocessor Report [R]. IBM Regains Performance Lead with Power2, October 1993,7,13. 被引量:1
  • 9Digital equipment corporation maynard, MA[Z]. Alpha Architecture Handbook, 1996. 被引量:1
  • 10Digital equipment corporation maynard MA[Z]. Alpha Architecture Handbook, 1994. 被引量:1

同被引文献2

引证文献2

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部