期刊文献+

CTUNING:A REUSE DISTANCE BASED CACHE PERFORMANCE TUNING TOOL

CTUNING:A REUSE DISTANCE BASED CACHE PERFORMANCE TUNING TOOL
下载PDF
导出
摘要 Cache performance tuning tools are conducive to develop program with good locality and fully use cache to decrease the influence caused by speed gap between processor and memory. This paper introduces the design and implementation of a cache performance tuning tool named CTuning, which employs a source level instrumentation method to gather program data access information, and uses a limited reuse distance model to analyze cache behavior. Experiments on 183.equake improve average performance more than 6% and show that CTuning is proficient not only in locating cache performance bottlenecks to guide manual code transformation, but also in analyzing cache behavior relationship among variables, thus to direct manual data reorganization. Cache performance tuning tools are conducive to develop program with good locality and fully use cache to decrease the influence caused by speed gap between processor and memory. This paper introduces the design and implementation of a cache performance tuning tool named CTuning, which employs a source level instrumentation method to gather program data access information, and uses a limited reuse distance model to analyze cache behavior. Experiments on 183.equake improve average performance more than 6% and show that CTuning is proficient not only in locating cache performance bottlenecks to guide manual code transformation, but also in analyzing cache behavior relationship among variables, thus to direct manual data reorganization.
机构地区 College of Computer
出处 《Journal of Electronics(China)》 2009年第4期517-524,共8页 电子科学学刊(英文版)
基金 Sponsored by the National Natural Science Foundation of China (No.60573141, 60773041) National 863 High Tech- nology Research Program of China (No.2007AA01Z404, 2007AA01Z478) High Technology Research Programme of Jiangsu Province (No.BG2006001) Key Laboratory of Information Technology Processing of Jiangsu Province (kjs06006) Project of NJUPT (NY207135)
关键词 Cache behavior Source level instrumentation Reuse distance Code transformation Data reorganization 性能优化工具 缓存性能 距离 复用 手工代码转换 高速缓存 信息工具 执行情况
  • 相关文献

参考文献10

  • 1K. Beyls,E. D’Hollander.Intermediately executed code is the key to find refactorings that improve temporal data locality[].Proceedings of the rd Conference on Computing Frontiers.2006 被引量:1
  • 2X. Shen,Y. Zhong,C. Ding.Predicting locality phases for dynamic memory optimization[].Journal of Parallel and Distributed Computing.2007 被引量:1
  • 3Y. Zhong,M. Orlovich,X. Shen,C. Ding.Array regrouping and structure splitting using whole-pro- gram reference affinity[].Proceedings of ACM SIGPLAN Conference on Programming Language Design and Implementation.2004 被引量:1
  • 4K. Beyls,E. D’Hollander,F. Vandeputte.RDVIS: A tool that visualizes the causes of low locality and hints program optimizations[].ICCS’.2005 被引量:1
  • 5X. Fu,Y. Zhang,Y. Chen.Data-layout optimi- zation using reuse distance distribution[].The Inter- national Workshop on Embedded Software Optimi- zation.2006 被引量:1
  • 6J. Marathe,F. Mueller,T. Mohan, et al.METRIC: Memory tracing via dynamic binary rewriting to identify cache inefficiencies[].ACM Transactions on Programming Languages and Systems (TOPLAS).2007 被引量:1
  • 7J. Weinberg,A. Edward Snavely.Accurate memory signatures and synthetic address traces for HPC applications[].Proceedings of the nd Annual International Conference on Supercomputing (ICS).2008 被引量:1
  • 8E Raman,R. Hundt,S. Mannarswamy.Structure layout optimization for multithreaded programs[].Proceedings of the International Symposium on Code Generation and Optimization (CGO).2007 被引量:1
  • 9S. Viswanadha,et al.Java Compiler Compiler (JavaCC)[]..2008 被引量:1
  • 10R. A. Sugumar and S. G. Abraham.Efficient simulation of multiple cache configurations using binomial trees[]..1991 被引量:1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部