期刊文献+

降低指令存储器功耗的一种有效方法:循环缓冲 被引量:2

Loop Buffering:An Effective Method to Reduce the Power Consumption of Instruction Memory
下载PDF
导出
摘要 在超长指令字结构的数字信号处理器中,其指令存储器的功耗所占比重较大。但是,根据数字信号应用的特点,可以采用循环缓冲来减小指令存储器的功耗。本文提出了一种编译器控制的循环缓冲技术,由编译器选择合适的循环代码将其放入循环缓冲,从而减小了取指过程中指令存储器的功耗;给出了循环缓冲的体系结构设计、功耗分析以及有效利用循环缓冲的编译方法;最后用功能级功耗模型验证了该方法的有效性。 For digital processing processors with the Very Long Instruction Word (VLIW) architecture, a significant amount of power is consumed in instruction memories. According to the characteristics of digital processing applications, loop buffering can be used to reduce the power consumption of instruction memories while fetching instructions. This paper presents a low power compilation method based on the compiler-controlled loop buffer where the compiler is responsible for selecting appropriate loops and putting them into the buffer. The paper gives an analysis of the power dissipation and architectural design of the loop buffer, and a compilation method to use the loop buffer effectively. Finally, the effectiveness of the proposed method is validated by a function-level power analysis model.
出处 《计算机工程与科学》 CSCD 2007年第6期93-96,112,共5页 Computer Engineering & Science
基金 国家863计划资助项目(2004AA1Z1040) 国家自然科学基金资助项目(60473079)
关键词 编译器 循环缓冲 低功耗 compiler loop buffering low power
  • 相关文献

参考文献13

  • 1胡定磊,陈书明,刘春林.分簇结构超长指令字DSP编译器的设计与实现[J].小型微型计算机系统,2006,27(2):348-353. 被引量:7
  • 2Dspstone Benchmark Suite[EB/OL].http://www.ert.rwth-aachen.de/projekte/tools/dspstone/dspstone.html,2005-10. 被引量:1
  • 3Senn E,Laurent J,Julien N,et al.SoftExplorer:Estimation of the Power and Energy Consumption for DSP Applications[A].Conférence IEEE EDERS 04 Birmingham Royaume Uni Novembre[C].2004. 被引量:1
  • 4Villarreal J,Lysecky R,Cotterell S,et al.A Study on the Loop Behavior of Embedded Programs[R].Technical Report UCR-CSE-01-03,University of California,2002. 被引量:1
  • 5Kin J,Gupta M,Mangione-Smith W H.The Filter Cache:An Energy Efficient Memory Structure[A].Proc of the 30th Annual ACM/IEEE Int'l Symp on Microarchitecture[C].1997.184-193. 被引量:1
  • 6Lee L H,Moyer B,Arends J.Instruction Fetch Energy Reduction Using Loop Caches for Embedded Applications with Small Tight Loops[A].Proc of the 1999 Int'l Symp on Low Power Electronics and Design[C].1999.267-269. 被引量:1
  • 7Gordon-Ross A,Cotterell S,Vahid F.Exploiting Fixed Programs in Embedded Systems:A Loop Cache Example[J].IEEE Computer Architecture Letters,2002,1(1). 被引量:1
  • 8胡定磊,陈书明.低功耗编译技术综述[J].电子学报,2005,33(4):676-682. 被引量:11
  • 9Laurent J,Julien N,Senn E,et al.Functional Level Power Analysis:An Efficient Approach for Modeling the Power Consumption of Complex Processors[A].Proc of the Conf on Design,Automation and Test in Europe[C].2004.666-667. 被引量:1
  • 10Cotterell S,Vahid F.Tuning of Loop Cache Architectures to Programs in Embedded System Design[A].Proc of the 15th Int'l Symp on System Synthesis[C].2002.8-13. 被引量:1

二级参考文献49

  • 1Fisher J.Very long instruction word architectures and the ELI-512[C].Proceedings of the Tenth Annual International Symposium on Computer Architecture,Stockholm,Sweden,1983,140-150. 被引量:1
  • 2Faraboschi P,Fisher J,Young C.Instruction scheduling for instruction level parallel processors[C].Proceedings of the IEEE,2001,89(11):1638-1659. 被引量:1
  • 3Kim J et al.Experience with a retargetable compiler for a commercial network processor[C].Proceedings of the 2002 International Conference on Compilers,Architecture,and Synthesis for Embedded Systems,Grenoble,France,2002,178-187. 被引量:1
  • 4S Rajagopalan et al.A retargetable VLIW compiler framework for DSPs with instruction level parallelism[J].IEEE Trans.on Computer-Aided Design,2001,20(11):1319-1328. 被引量:1
  • 5Shannon C J.The IMPACT SC140 code generator[D].MS Thesis,Department of Electrical and Computer Engineering,University of Illinois,Urbana IL,2002. 被引量:1
  • 6Chakrapani L N et al.Triceps:enhancing the trimaran compiler infrastructure for strongARM code generation[R].CREST Technical Report:CREST-TR-01-01. 被引量:1
  • 7Leupers R.Instruction scheduling for clustered VLIW DSPs[C].IEEE PACT 2000,291-300. 被引量:1
  • 8Lapinskii V S et al.Cluster assignment for high-performance embedded VLIW processors[J].ACM Transactions on Design Automation of Electronic Systems,2002,7(3):430-454. 被引量:1
  • 9Jang S et al.A code generation framework for VLIW architectures with partitioned register banks[C].In:Proceedings of the Third International Conference on Massively Parallel Computing Systems (MPCS),1998,61-69. 被引量:1
  • 10Chang P P et al.IMPACT:an architectural framework for multiple-instruction-issue processors[C].ISCA 1991,266-275. 被引量:1

共引文献16

同被引文献3

引证文献2

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部