期刊文献+

基于NUMA架构的解释器访存优化设计与实现

Design and implementation of memory optimization for NUMA based Interpreters
下载PDF
导出
摘要 为了提高非一致内存访问(NUMA)架构虚拟机解释器的访存性能,研究了解释器在NUMA架构下的访存优化技术,提出了一种NUMA架构下的解释器访存优化方案,而且设计并实现了解释器的静态指令分派优化方法和动态指令分派优化方法。根据这一方案虚拟机启动时首先获取NUMA节点信息,并在每个NUMA节点中自动生成解释器所需的全部数据结构;解释器在运行时,通过静态或动态的指令分派技术来实现其执行线程在NUMA节点上访存的局部化。试验结果表明,上述方法能够显著提升解释器在NUMA系统中的性能。在DaCapo测试集上的总体性能提升了8%,最高性能提升幅度高达23%,而且算法实现代价低,适用于绝大多数的NUMA服务器系统。 In order to improve the performance of a virtual machine' s interpreter under the non-uniform memory access (NUMA) architecture, a study of memory access optimization was conducted. Then, a scheme for memory optimi- zation of the interpreters under the NUMA architecture was proposed, and based on it, two novel approaches for an interpreter's memory optimization, namely the one using static instruction dispatching and another using dynamic instruction dispatching, were designed and implemented. According to the scheme, the virtual machine collects the information of the NUMA nodes when it starts up, and automatically generates all the data structures the interpreter needs in each NUMA node; when the interpreter is running on a NUMA node, it uses the two mentioned dispatc- hing optimization approaches to realize its local access to the NUMA nodes. The experimental results demonstrated that the proposed scheme can significantly improve the performance of interpreters in NUMA systems. The results of the experiment using the DaCapo showed that the overall performance of the interpreter was improved by 8 %, and the highest increment was up to 23%. The proposed memory optimization algorithm is very easyto implement and can be applied to most NUMA systems.
出处 《高技术通讯》 CAS CSCD 北大核心 2015年第7期661-668,共8页 Chinese High Technology Letters
基金 国家“核高基”科技重大专项课题(2009ZX01028-002-003,2009ZX01029-001-003,2010ZX01036-001-002,2012ZX01029-001-002-002,2014ZX01020201) 国家自然科学基金(61221062,61133004,61173001,61232009,61222204,61432016) 863计划(2012AA010901,2012AA011002,2013AA014301)资助项目
关键词 非一致内存访问(NUMA) 虚拟机 解释器 响应速度 启动性能 访存优化 non-uniform memory access (NUMA), virtual machine, interpreter, response speed, startupperformance, memory optimization
  • 相关文献

参考文献16

  • 1程学旗,靳小龙,王元卓,郭嘉丰,张铁赢,李国杰.大数据系统和分析技术综述[J].软件学报,2014,25(9):1889-1908. 被引量:745
  • 2Lindholm T, Yellin F, Bracha G, et al. The Java? Vir-tual Machine Specification. http://docs. oracle, com:Oracle Corporation, 2013. 被引量:1
  • 3苏文,王焕东,台运方,王靖.面向云计算的多核处理器存储和网络子系统优化设计[J].高技术通讯,2013,23(4):360-367. 被引量:6
  • 4Hennessy J L, Peterson D A. Computer ArchitectureA Quantitative Approach. 4th Edition. Amsterdam/Boston/Heidelberg/London/New York/Oxford/Paris / San Diego / San Francisco / Singapore / Sydney / Tokyo : Morgan Kaufmann Publishers, 2007. 195 -200. 被引量:1
  • 5Lameter C. NUMA ( Non-Uniform Memory Access) : An Overview. Queuey 2013,11(7) :40-51. 被引量:1
  • 6Arnold M, Fink S J,Grove D, et al. A Survey of adaptive optimization in virtual machines. Proceedings of the IEEE, 2005, 93(2) :449-466. 被引量:1
  • 7Oracle. Java SE Development Kit 8. http://www. oracle.com/ : Oracle Corporation, 2014. 被引量:1
  • 8Liu M,Li T. Optimizing virtual machine consolidation performance on NUMA server architecture for cloud workloads. In: Proceedings of the 41st Annual International Symposium on Computer Architecture,Minnesota,USA, 2014. 325-336. 被引量:1
  • 9Oracle. Java HotSpot? Virtual Machine Performance Enhancements. http:// docs, oracle. com/javase/7/docs/ technotes/guides/vm/peiformance-enhancements-7. html : Oracle Corporation, 2014. 被引量:1
  • 10Kotzmann T, Wimmer C, Mossenbock H, et al. Design ofthe Java HotSpot? client compiler for Java 6. ACM Transactions on Architecture and Code Optimization, 2008,5(1) :l-32. 被引量:1

二级参考文献38

  • 1梅立军,周强,臧路,陈祖舜.知网与同义词词林的信息融合研究[J].中文信息学报,2005,19(1):63-70. 被引量:28
  • 2董振东,董强,郝长伶.知网的理论发现[J].中文信息学报,2007,21(4):3-9. 被引量:99
  • 3Amazon ec2. http :Haws. amazon, com/ec2/. 被引量:1
  • 4Barroso L, Dean J, HSlzle U. Web search for a planet: The Google cluster architecture. IEEE Micro, 2003, 23 (2) : 22-28. 被引量:1
  • 5Armbrust M, Fox A. Above the Clouds: A berkeley view of cloud computing, technical Report No. UCB/EECS- 2009-28, University of California at Berkley, USA, Feb. 10, 2009. 被引量:1
  • 6Menon A, Cox A, Zwaenepoel W. Optimizing network virtualization in Xen. In: Proceedings of the ACM/USE- NIX Annual Technical Conference, Boston, USA, 2006. 被引量:1
  • 7Menon A, Santos J, Turner Y, et al. Diagnosing perform- ance overheads in the Xen virtual machine environment. In : Proceedings of the ACM/USENIX Conference on Vir- tual Execution Environments, Chicago, USA, 2005. 被引量:1
  • 8Regnier G, Makineni S, Illikkal R, et al. TCP onloading for data center servers. IEEE Transactions On Computer, 2004, 37(11):48-58. 被引量:1
  • 9Zhao L, Bhuyan L, Iyer R, et al. Hardware support for accelerating data movement in server platform. IEEE Transactions On Computer, 2007, 56(6) :740-753. 被引量:1
  • 10Conway P, Kalyanasundharam N, Donley G, et al. Cache hierarchy and memory subsystem of the AMD Opteron processor. IEEE Micro, 2010, 30(2) : 16-29. 被引量:1

共引文献749

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部