基于NUMA架构的解释器访存优化设计与实现

Design and implementation of memory optimization for NUMA based Interpreters

下载PDF

导出

摘要为了提高非一致内存访问(NUMA)架构虚拟机解释器的访存性能,研究了解释器在NUMA架构下的访存优化技术,提出了一种NUMA架构下的解释器访存优化方案,而且设计并实现了解释器的静态指令分派优化方法和动态指令分派优化方法。根据这一方案虚拟机启动时首先获取NUMA节点信息,并在每个NUMA节点中自动生成解释器所需的全部数据结构;解释器在运行时,通过静态或动态的指令分派技术来实现其执行线程在NUMA节点上访存的局部化。试验结果表明,上述方法能够显著提升解释器在NUMA系统中的性能。在DaCapo测试集上的总体性能提升了8%,最高性能提升幅度高达23%,而且算法实现代价低,适用于绝大多数的NUMA服务器系统。 In order to improve the performance of a virtual machine＇ s interpreter under the non-uniform memory access （NUMA） architecture, a study of memory access optimization was conducted. Then, a scheme for memory optimi- zation of the interpreters under the NUMA architecture was proposed, and based on it, two novel approaches for an interpreter＇s memory optimization, namely the one using static instruction dispatching and another using dynamic instruction dispatching, were designed and implemented. According to the scheme, the virtual machine collects the information of the NUMA nodes when it starts up, and automatically generates all the data structures the interpreter needs in each NUMA node; when the interpreter is running on a NUMA node, it uses the two mentioned dispatc- hing optimization approaches to realize its local access to the NUMA nodes. The experimental results demonstrated that the proposed scheme can significantly improve the performance of interpreters in NUMA systems. The results of the experiment using the DaCapo showed that the overall performance of the interpreter was improved by 8 %, and the highest increment was up to 23%. The proposed memory optimization algorithm is very easyto implement and can be applied to most NUMA systems.

作者任彤傅杰靳国杰

机构地区中国科学院大学中国科学院计算技术研究所中国电子设备系统工程公司研究所计算机系统结构国家重点实验室龙芯中科技术有限公司

出处《高技术通讯》 CAS CSCD 北大核心 2015年第7期661-668,共8页 Chinese High Technology Letters

基金国家“核高基”科技重大专项课题(2009ZX01028-002-003,2009ZX01029-001-003,2010ZX01036-001-002,2012ZX01029-001-002-002,2014ZX01020201) 国家自然科学基金(61221062,61133004,61173001,61232009,61222204,61432016) 863计划(2012AA010901,2012AA011002,2013AA014301)资助项目

关键词非一致内存访问(NUMA) 虚拟机解释器响应速度启动性能访存优化 non-uniform memory access （NUMA）, virtual machine, interpreter, response speed, startupperformance, memory optimization

分类号 TP314 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献16

1程学旗,靳小龙,王元卓,郭嘉丰,张铁赢,李国杰.大数据系统和分析技术综述[J].软件学报,2014,25(9):1889-1908. 被引量：745
2Lindholm T, Yellin F, Bracha G, et al. The Java? Vir-tual Machine Specification. http://docs. oracle, com:Oracle Corporation, 2013. 被引量：1
3苏文,王焕东,台运方,王靖.面向云计算的多核处理器存储和网络子系统优化设计[J].高技术通讯,2013,23(4):360-367. 被引量：6
4Hennessy J L, Peterson D A. Computer ArchitectureA Quantitative Approach. 4th Edition. Amsterdam/Boston/Heidelberg/London/New York/Oxford/Paris / San Diego / San Francisco / Singapore / Sydney / Tokyo : Morgan Kaufmann Publishers, 2007. 195 -200. 被引量：1
5Lameter C. NUMA ( Non-Uniform Memory Access) : An Overview. Queuey 2013,11(7) :40-51. 被引量：1
6Arnold M, Fink S J,Grove D, et al. A Survey of adaptive optimization in virtual machines. Proceedings of the IEEE, 2005, 93(2) :449-466. 被引量：1
7Oracle. Java SE Development Kit 8. http://www. oracle.com/ : Oracle Corporation, 2014. 被引量：1
8Liu M,Li T. Optimizing virtual machine consolidation performance on NUMA server architecture for cloud workloads. In: Proceedings of the 41st Annual International Symposium on Computer Architecture,Minnesota,USA, 2014. 325-336. 被引量：1
9Oracle. Java HotSpot? Virtual Machine Performance Enhancements. http:// docs, oracle. com/javase/7/docs/ technotes/guides/vm/peiformance-enhancements-7. html : Oracle Corporation, 2014. 被引量：1
10Kotzmann T, Wimmer C, Mossenbock H, et al. Design ofthe Java HotSpot? client compiler for Java 6. ACM Transactions on Architecture and Code Optimization, 2008,5(1) :l-32. 被引量：1

二级参考文献38

1梅立军,周强,臧路,陈祖舜.知网与同义词词林的信息融合研究[J].中文信息学报,2005,19(1):63-70. 被引量：28
2董振东,董强,郝长伶.知网的理论发现[J].中文信息学报,2007,21(4):3-9. 被引量：99
3Amazon ec2. http :Haws. amazon, com/ec2/. 被引量：1
4Barroso L, Dean J, HSlzle U. Web search for a planet: The Google cluster architecture. IEEE Micro, 2003, 23 (2) : 22-28. 被引量：1
5Armbrust M, Fox A. Above the Clouds: A berkeley view of cloud computing, technical Report No. UCB/EECS- 2009-28, University of California at Berkley, USA, Feb. 10, 2009. 被引量：1
6Menon A, Cox A, Zwaenepoel W. Optimizing network virtualization in Xen. In: Proceedings of the ACM/USE- NIX Annual Technical Conference, Boston, USA, 2006. 被引量：1
7Menon A, Santos J, Turner Y, et al. Diagnosing perform- ance overheads in the Xen virtual machine environment. In : Proceedings of the ACM/USENIX Conference on Vir- tual Execution Environments, Chicago, USA, 2005. 被引量：1
8Regnier G, Makineni S, Illikkal R, et al. TCP onloading for data center servers. IEEE Transactions On Computer, 2004, 37(11):48-58. 被引量：1
9Zhao L, Bhuyan L, Iyer R, et al. Hardware support for accelerating data movement in server platform. IEEE Transactions On Computer, 2007, 56(6) :740-753. 被引量：1
10Conway P, Kalyanasundharam N, Donley G, et al. Cache hierarchy and memory subsystem of the AMD Opteron processor. IEEE Micro, 2010, 30(2) : 16-29. 被引量：1

共引文献749

1郭玉洁,何钰,刘家强,周英华,吴超.大数据技术在中原城市群科技服务资源池构建中的应用[J].中国基础科学,2020(5):41-44.
2刘洋.谈基于结合大数据技术的用户画像推荐方法[J].新闻传播,2019,0(24):29-30.
3吕明元,苗效东.大数据能促进中国制造业结构优化吗?[J].云南财经大学学报,2020,0(3):31-42. 被引量：30
4师洪波,郭红梅,岳婷,钱力,黄定余,常志军.基于分布式大数据技术的科学计量模块化分析平台构建研究[J].数据分析与知识发现,2020,4(2):231-238. 被引量：3
5李振泉,张丁涌,周长敬,王兴武,安学先,高华,孙东,刘文聪,闫恩祥,李红强,孙秀玲,杨文辉,张腾,梁莹,王增光.集输系统能耗定量化预测大数据模型应用研究[J].当代化工,2020(12):2818-2821. 被引量：2
6蒋云钟,冶运涛,赵红莉,梁犁丽,曹引,顾晶晶.水利大数据研究现状与展望[J].水力发电学报,2020,39(10):1-32. 被引量：90
7施珠妹.从经验驱动到数据驱动——逮捕社会危险性评估模式的逻辑转换[J].人权研究（辑刊）,2023(1):400-422.
8韩旭,罗登昌.长江堤防工程大数据基本特征及应用策略[J].人民长江,2020(S01):262-264. 被引量：4
9庞景安.大数据时代:思维变革、产业转型与数据科学兴起[J].情报学进展,2016(1):186-218.
10段鹏.大数据时代的重大公共事务决策风险治理:研究历史与现状[J].文化产业研究,2019(4):20-33. 被引量：1

1万健如,张海波,曹才开.单神经元PID控制器永磁同步电机调速系统[J].电力电子技术,2005,39(1):75-77. 被引量：21
2胥京宇.全新Spansion闪存产品面向嵌入式应用[J].世界电子元器件,2011(3):57-57.
3全新Spansion闪存产品带来强劲创新力[J].消费电子,2011(3):86-86.
4张立峰,王素娥.影响柴油机冷启动的因素及改善措施[J].山东农机化,2006(10):21-21. 被引量：4
5赵昆仑（选译）.十大神秘传奇之地[J].英语沙龙（原版阅读）,2012(6):40-45.
6王韧.利用QQ在iPhone访问计算机数据[J].电脑迷,2015,0(9):85-85.
7全新Spansion闪存产品为嵌入式应用提供突破性能[J].中国集成电路,2011,20(3):14-14.
8邓奎刚,翟羽健,李广安,陈文吾.PLC在金属卤化物灯启动性能自动测试系统中的应用[J].机械设计与制造工程,2002,31(5):77-78.
9密度高达2Gb的Nor闪存[J].今日电子,2011(4):64-65.
10假爱之名."两只蚂蚁的差别"360°看透笔记本光驱[J].电脑与电信,2005(5):62-64.

高技术通讯

2015年第7期

浏览历史

内容加载中请稍等...

基于NUMA架构的解释器访存优化设计与实现

参考文献16

二级参考文献38

共引文献749

相关作者

相关机构

相关主题

浏览历史