降低指令存储器功耗的一种有效方法:循环缓冲被引量：2

Loop Buffering:An Effective Method to Reduce the Power Consumption of Instruction Memory

下载PDF

导出

摘要在超长指令字结构的数字信号处理器中,其指令存储器的功耗所占比重较大。但是,根据数字信号应用的特点,可以采用循环缓冲来减小指令存储器的功耗。本文提出了一种编译器控制的循环缓冲技术,由编译器选择合适的循环代码将其放入循环缓冲,从而减小了取指过程中指令存储器的功耗;给出了循环缓冲的体系结构设计、功耗分析以及有效利用循环缓冲的编译方法;最后用功能级功耗模型验证了该方法的有效性。 For digital processing processors with the Very Long Instruction Word （VLIW） architecture, a significant amount of power is consumed in instruction memories. According to the characteristics of digital processing applications, loop buffering can be used to reduce the power consumption of instruction memories while fetching instructions. This paper presents a low power compilation method based on the compiler-controlled loop buffer where the compiler is responsible for selecting appropriate loops and putting them into the buffer. The paper gives an analysis of the power dissipation and architectural design of the loop buffer, and a compilation method to use the loop buffer effectively. Finally, the effectiveness of the proposed method is validated by a function-level power analysis model.

作者胡定磊陈书明

机构地区国防科技大学计算机学院

出处《计算机工程与科学》 CSCD 2007年第6期93-96,112,共5页 Computer Engineering & Science

基金国家863计划资助项目(2004AA1Z1040) 国家自然科学基金资助项目(60473079)

关键词编译器循环缓冲低功耗 compiler loop buffering low power

分类号 TP314 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献13

1胡定磊,陈书明,刘春林.分簇结构超长指令字DSP编译器的设计与实现[J].小型微型计算机系统,2006,27(2):348-353. 被引量：7
2Dspstone Benchmark Suite[EB/OL].http://www.ert.rwth-aachen.de/projekte/tools/dspstone/dspstone.html,2005-10. 被引量：1
3Senn E,Laurent J,Julien N,et al.SoftExplorer:Estimation of the Power and Energy Consumption for DSP Applications[A].Conférence IEEE EDERS 04 Birmingham Royaume Uni Novembre[C].2004. 被引量：1
4Villarreal J,Lysecky R,Cotterell S,et al.A Study on the Loop Behavior of Embedded Programs[R].Technical Report UCR-CSE-01-03,University of California,2002. 被引量：1
5Kin J,Gupta M,Mangione-Smith W H.The Filter Cache:An Energy Efficient Memory Structure[A].Proc of the 30th Annual ACM/IEEE Int'l Symp on Microarchitecture[C].1997.184-193. 被引量：1
6Lee L H,Moyer B,Arends J.Instruction Fetch Energy Reduction Using Loop Caches for Embedded Applications with Small Tight Loops[A].Proc of the 1999 Int'l Symp on Low Power Electronics and Design[C].1999.267-269. 被引量：1
7Gordon-Ross A,Cotterell S,Vahid F.Exploiting Fixed Programs in Embedded Systems:A Loop Cache Example[J].IEEE Computer Architecture Letters,2002,1(1). 被引量：1
8胡定磊,陈书明.低功耗编译技术综述[J].电子学报,2005,33(4):676-682. 被引量：11
9Laurent J,Julien N,Senn E,et al.Functional Level Power Analysis:An Efficient Approach for Modeling the Power Consumption of Complex Processors[A].Proc of the Conf on Design,Automation and Test in Europe[C].2004.666-667. 被引量：1
10Cotterell S,Vahid F.Tuning of Loop Cache Architectures to Programs in Embedded System Design[A].Proc of the 15th Int'l Symp on System Synthesis[C].2002.8-13. 被引量：1

二级参考文献49

1Fisher J.Very long instruction word architectures and the ELI-512[C].Proceedings of the Tenth Annual International Symposium on Computer Architecture,Stockholm,Sweden,1983,140-150. 被引量：1
2Faraboschi P,Fisher J,Young C.Instruction scheduling for instruction level parallel processors[C].Proceedings of the IEEE,2001,89(11):1638-1659. 被引量：1
3Kim J et al.Experience with a retargetable compiler for a commercial network processor[C].Proceedings of the 2002 International Conference on Compilers,Architecture,and Synthesis for Embedded Systems,Grenoble,France,2002,178-187. 被引量：1
4S Rajagopalan et al.A retargetable VLIW compiler framework for DSPs with instruction level parallelism[J].IEEE Trans.on Computer-Aided Design,2001,20(11):1319-1328. 被引量：1
5Shannon C J.The IMPACT SC140 code generator[D].MS Thesis,Department of Electrical and Computer Engineering,University of Illinois,Urbana IL,2002. 被引量：1
6Chakrapani L N et al.Triceps:enhancing the trimaran compiler infrastructure for strongARM code generation[R].CREST Technical Report:CREST-TR-01-01. 被引量：1
7Leupers R.Instruction scheduling for clustered VLIW DSPs[C].IEEE PACT 2000,291-300. 被引量：1
8Lapinskii V S et al.Cluster assignment for high-performance embedded VLIW processors[J].ACM Transactions on Design Automation of Electronic Systems,2002,7(3):430-454. 被引量：1
9Jang S et al.A code generation framework for VLIW architectures with partitioned register banks[C].In:Proceedings of the Third International Conference on Massively Parallel Computing Systems (MPCS),1998,61-69. 被引量：1
10Chang P P et al.IMPACT:an architectural framework for multiple-instruction-issue processors[C].ISCA 1991,266-275. 被引量：1

共引文献16

1蒋湘涛,胡志刚,贺建飚.基于调用链分析的低功耗编译优化[J].吉林大学学报（工学版）,2009,39(1):143-147. 被引量：6
2陈书明,李振涛,万江华,胡定磊,郭阳,汪东,扈啸,孙书为.“银河飞腾”高性能数字信号处理器研究进展[J].计算机研究与发展,2006,43(6):993-1000. 被引量：29
3胡定磊,陈书明,刘春林.奇异数据类型的编译支持[J].计算机工程,2007,33(3):29-31. 被引量：1
4胡定磊,陈书明,刘春林.基于超块的统一分簇与模调度[J].计算机研究与发展,2007,44(8):1429-1438.
5孙含欣,佟冬,袁鹏,程旭.基于簇的寄存器堆功耗管理方法[J].电子学报,2008,36(2):278-284. 被引量：2
6郭松柳,汪东升,汤志忠.同时多线程微处理器结构的性能功耗研究[J].计算机工程与应用,2008,44(28):4-8. 被引量：2
7余锋林,耿锐,戴福泉.一种支持VLIW DSP条件跳转指令的技术研究[J].工业控制计算机,2009,22(2):35-37. 被引量：1
8张海军,郑艳,李中升.基于编译的低功耗技术研究[J].计算机工程与科学,2009,31(A01):112-114.
9王昌龙.嵌入式软件系统的节能策略研究[J].现代电子技术,2009,32(22):39-41. 被引量：2
10郑启龙,汪胜,夏霏.DSP编译器中一种基于子图的分簇算法[J].微电子学与计算机,2010,27(8):49-52. 被引量：1

同被引文献3

1陈志坚,孟建熠,严晓浪,沙子岩.基于神经网络的重构指令预取机制及其可扩展架构[J].电子学报,2012,40(7):1476-1480. 被引量：2
2王琪,高瑛珂,华斯亮,张铁军,王东辉,侯朝焕.可复用微处理器片上调试功能的设计与实现[J].计算机辅助设计与图形学学报,2012,24(10):1369-1374. 被引量：7
3郑方,张昆,邬贵明,高红光,唐勇,吕晖,过锋,李宏亮,谢向辉,陈左宁.面向高性能计算的众核处理器结构级高能效技术[J].计算机学报,2014,37(10):2176-2186. 被引量：17

引证文献2

1王琪,鲍丽丹,张铁军,王东辉,侯朝焕.软硬件协同循环优化方法的设计与实现[J].计算机辅助设计与图形学学报,2013,25(10):1574-1581. 被引量：1
2刘骁,高红光,陈芳园,丁亚军.一种高能效的结构不对称指令缓存[J].计算机工程与科学,2017,39(3):443-450.

二级引证文献1

1余学锋,刘辉,翟巧丽.虚拟仪器软件执行时间测试与评估方法[J].电子测量技术,2019,42(18):123-126.

1戚玉松.Windows与DOS下的串口通讯及其在数控系统中的应用[J].机械设计与制造工程,2001,30(6):40-41.
2张怡,张丛.基于面向对象的机房电力监控系统的设计与实现[J].测控与通信,2008,32(2):56-61.
3张怡,张丛,黄健.基于面向对象的机房电力监控系统的设计与实现[J].航空计算技术,2009,39(6):81-84. 被引量：1
4陈纪孝,李勇.软件流水循环缓冲的设计与实现[J].计算机科学,2013,40(4):35-37. 被引量：4
5戚玉华,吴学智,顿新平.高速网络数据流分类系统[J].电子测量技术,2006,29(5):148-150. 被引量：2
6姚炜,马中.一种高速数据采集系统的设计[J].计算机与数字工程,1997,25(5):27-30.
7韩仲志,黄汉明,叶洪涛,匡贵娟.多通道同步信号采集与盲源分离研究[J].广西物理,2007,28(4):14-16.
8李泉泉,张铁军,王东辉,侯朝焕.基于分支执行历史的循环缓冲低功耗方法[J].微电子学与计算机,2014,31(9):7-10.
9路军杰,周军,赵斌.基于循环缓冲区的xPC目标串口通信研究[J].测控技术,2011,30(7):53-57. 被引量：3
10杨惠,陈书明,万江华.一种基于VLIW DSP架构的高性能取指流水线[J].国防科技大学学报,2011,33(4):102-106. 被引量：1

计算机工程与科学

2007年第6期

浏览历史

内容加载中请稍等...

降低指令存储器功耗的一种有效方法:循环缓冲被引量：2

参考文献13

二级参考文献49

共引文献16

同被引文献3

引证文献2

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

降低指令存储器功耗的一种有效方法:循环缓冲 被引量：2

参考文献13

二级参考文献49

共引文献16

同被引文献3

引证文献2

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

降低指令存储器功耗的一种有效方法:循环缓冲被引量：2