处理器片上渗透缓存蕴含的时间与空间及时局部性

The Time and Spatial Just-in-Time Locality of on-Chip Percolation Cache

下载PDF

导出

摘要处理器片上寄存器的分布形态与数量规模对处理器的整体计算性能有直接影响,这种影响表面上看是波及处理器片上缓存结构的改进和优化,本质上是时间要素与空间要素交织在一起的综合反映.因此,从时间和空间上确保处理器内核对片上缓存的局部化访问必将进一步提高处理器的整体计算性能.为了认识处理器片上缓存中存在的时间与空间及时局部性,以由传统缓存耦合而成的渗透缓存为工具来分析处理器内核访问片上缓存的时间与空间局部性,仿真实验表明渗透缓存因具备容纳时间与空间局部性的结构提高了处理器访问片上缓存的命中率,客观上缩短访存延迟,从而为提高处理器性能创造了有利条件. It is believed that the arrangements and amounts of the registers in processor chip have much heavy impact on the operation speed of the processor,which has induced the improvement of the structure of the on-chip cache,whose central task is to realize the fast access to the data in registers in the term of time and space.This kind of fast access to the register can be investigated vise the access process in which the data and structures in on-chip cache are accessed.By introducing the a new on-chip cache percolation cache,we prove that the existent of the time and spatial just-in-time locality which the percolation cache equips has contributed much to shorten the memory access delay by raising the hit rates when processor core accesses the percolation cache.

作者胡九川程建聪万良易吴楠士叶笑春严龙 HU Jiu-chuan;CHENG Jian-cong;WAN Liang-yi;WU Nan-shi;YE Xiao-chun;YAN Long(School of Computer and Information Technology,Beijing Jiaotong University,Beijing 100044,China;Center for Information of Education Management,Ministry of Education,Beijing 100816,China;The 32nd Research Institute of China Electronics Technology Group Corporation,Shanghai 201800,China;State Key Laboratory of Computer Architecture,Institute of Computing Technology,Chinese Academy of Sciences,Beijing 100190,China)

机构地区北京交通大学计算机科学与技术学院教育部教育管理信息中心中国电子科技集团公司第中国科学院计算技术研究所计算机体系结构国家重点实验室

出处《电子学报》 EI CAS CSCD 北大核心 2024年第10期3589-3599,共11页 Acta Electronica Sinica

基金国家自然科学基金(No.62202451)。

关键词渗透缓存时间局部性空间局部性片上数据流转 percolation cache temporal locality spatial locality data flow on-chip

分类号 TP302.1 [自动化与计算机技术—计算机系统结构]

引文网络
相关文献

参考文献3

1胡伟武等编著..计算机体系结构[M].北京:清华大学出版社,2011:260.
2范东睿,袁楠,张军超,周永彬,林伟,宋风龙,叶笑春,黄河,余磊,龙国平,张浩,刘磊.Godson-T:An Efficient Many-Core Architecture for Parallel Program Executions[J].Journal of Computer Science & Technology,2009,24(6):1061-1073. 被引量：12
3胡九川,范东睿,李丹萍,严龙,叶笑春.一种支持数据渗透迁移的片上缓存模型研究[J].北京交通大学学报,2017,41(5):1-9. 被引量：5

二级参考文献41

1Asanovic K et al. The landscape of parallel computing research: A view from Berkeley. Technical Report No.UCB/EECS-2006-183, University of California, Berkeley, December 18, 2006. 被引量：1
2Lee E A. The problem with threads. Computer, 2006, 39(5): 33-42. 被引量：1
3Cantrill B, Bonwick J. Real-world concurrency. ACM Queue, 2008, 6(5): 16-25. 被引量：1
4Adve S V, Adve V Set al. Parallel computing research at Illinois: The UPCRC agenda. Technical Report, University of Illinois at Urbana-Chmnpaign, November 2008. 被引量：1
5Yuan N, Yu L, Fan D. An efficient and flexible task management for many-core architectures. In Proc. Workshop on Software and Hardware Challenges of Manycore Platforms, in Conjunction with the 35th International Symposium on Computer Architecture (ISCA-35), Beijing, China, June 22- 26, 2008, pp.1-17. 被引量：1
6Blumofe R D, Leiserson C E. Scheduling multithreaded computations by work stealing. Journal of the ACM, 1999, 46(5): 720-748. 被引量：1
7Palatin P, Lhuillier Y, Temam O. CAPSULE: Hardwareassisted parallel execution of component-based programs. In Proe. the 39th Annual IEEE/A CM International Symposium on Micro-Architecture, Washington, DC, USA: IEEE Computer Society, Dec. 9-13, 2006, pp.247-258. 被引量：1
8Villa O, Palermo G, Silvano C. Efficiency and scalability of barrier synchronization on NoC based many-core architecture. In Proc. CASES2008, Atlanta, USA, Oct. 19-24, 2008, pp.81-90. 被引量：1
9Carlson W W, Draper J Met al. Introduction to UPC and language specification. Technical Report No. CCS-TR-99- 157, University of California, Berkeley, 1999. 被引量：1
10Numrich R W, Reid J. Co-array Fortran for parallel programming. SIGPLAN Fortran Forum, 1998, 17(2): 1-31. 被引量：1

共引文献13

1张轮凯,宋风龙,王达.一种针对片上众核结构共享末级缓存的改进的LFU替换算法[J].计算机应用与软件,2013,30(1):1-6. 被引量：5
2吴志敏,吕慧伟,陈明宇.一个针对并行模拟引擎的性能评测实例[J].计算机科学,2013,40(3):41-45.
3吕慧伟,程元,白露,陈明宇,范东睿,孙凝晖.众核处理器和众核集群的并行模拟[J].计算机研究与发展,2013,50(5):1110-1117. 被引量：4
4吕方,崔慧敏,霍玮,冯晓兵.面向并发性能下降的调度策略的综述[J].计算机研究与发展,2014,51(1):17-30. 被引量：4
5吴筱,郭培源,何多多.DES和SM4算法的可重构研究与实现[J].计算机应用研究,2014,31(3):853-856. 被引量：10
6闫乔,覃志东,王绍宇,闫红曼.同构多核/众核处理器任务分配自适应模拟退火算法[J].计算机科学,2014,41(6):18-21. 被引量：6
7石嵩,李宏亮,朱巍.阵列众核处理器上的高效归并排序算法[J].计算机研究与发展,2016,53(2):362-373. 被引量：6
8石嵩,宁永波,李宏亮,郑方.阵列众核结构上的一种多层分区Hash连接算法[J].计算机科学,2016,43(3):18-22.
9李灵枝,胡九川,叶笑春,范东睿,严龙.渗透缓存命中率诱导的缓存区域动态分配机制研究[J].软件导刊,2020,19(4):1-8.
10王娜娜.混合云存储中网络稀疏大数据渗透迁移算法[J].计算机工程与设计,2021,42(3):719-725. 被引量：6

1赵宏,李珈瑞,刘静.基于局部时空的多峰优化算法及其在PID控制中的应用[J].计算机学报,2024,47(6):1323-1340.
2何鹏.计算机视觉技术在自动化中的应用[J].计算机产品与流通,2024(8):122-124.
3罗俊,张春亢,罗启雄.顾及邻域局部特征的车载点云城市道路提取[J].测绘通报,2024(9):62-66.
4张帆,卢琪.女性视角下的商业步行街空间视觉感知偏好影响因素研究[J].华中建筑,2024,42(11):41-45.
5古丽斯坦·库尔班尼牙孜,吴晓晓.基于MGWR模型的中国地级市城乡收入差距的空间异质性分析[J].统计与管理,2024,39(10):23-31.
6侯昌明,张朔,陈晓,杨智峰,张永兴.应变硬化水泥基复合材料增强钢筋混凝土构件受弯性能试验研究[J].混凝土与水泥制品,2024(11):65-69.
7《守制与应变:清代台湾城市规划研究》[J].台湾历史研究,2024(4):12-12.
8曹宇超,张楠,师晓龙.当代观演场景构成视角下的古典园林空间模式[J].西部人居环境学刊,2024,39(5):103-109.
9李燕,杜宏武.高层建筑空中花园空间感知及需求研究[J].中国园林,2024,40(9):57-63.
10丰叶.刑事大数据证据的证据化路径研究[J].大连理工大学学报（社会科学版）,2024,45(5):93-101.

电子学报

2024年第10期

浏览历史

内容加载中请稍等...

处理器片上渗透缓存蕴含的时间与空间及时局部性

参考文献3

二级参考文献41

共引文献13

相关作者

相关机构

相关主题

浏览历史