期刊文献+

双簇结构DSP的数据Cache优化

Optimization Design of the Data Cache in Dual Cluster DSPs
下载PDF
导出
摘要 数字信号处理常常包含大量数据运算,这使得数据Cache成为影响其性能的关键因素。特别是对于我们研制的双簇VLIW结构YHFT DSP系列处理器,Cache的失效会导致整个内核八条流水线同时停顿。所以,减小Cache失效延迟能给处理器性能带来显著的提升。本文研究的主要问题是如何针对一级数据Cache的读失效操作进行优化,从四个方面进行,分别为提前发读请求、请求字优先、合并并行失效读和后台处理Snooping。模拟结果表明,采用这些优化措施后,处理器的性能提高了8.36%。 In digital signal processors, the data cache is a key element. It is important for the designer to consider how to reduce the overhead caused by cache miss. Especially in the dual cluster VLIW DSP that we have developed, the cache miss will stall all 8 pipelines in the kernel. So an optimization design of the data cache will get a great performance improvement. In this paper, we present four techniques for optimizing the data cache: early read request, request word first, merging two parallel read misses, and background processing snooping. The simulation results show that these methods can improve the processor 's performance by 8. 36 %.
出处 《计算机工程与科学》 CSCD 2008年第9期119-121,125,共4页 Computer Engineering & Science
基金 国家自然科学基金资助项目(60473079) 教育部高等学校博士学科点专项科研基金资助项目(20059998026)
关键词 数字信号处理器 高速缓存 失效 超长指令字 双簇 流水线 digital signal processor cache miss VLIW dual cluster pipeline
  • 相关文献

参考文献7

  • 1Hennessy J L, Patterson D A. Computer Architecture: A Quantitative Approach [M].3rd Ed. Morgan Kaufmann, 2003:435-438. 被引量:1
  • 2Delano E, Mulla D. Data Cache Design Considerations for the Itanium 2 Processor[C]//Proc of IEEE 2002 Conf: VLSI in Computers and Processors, 2002:356 -363. 被引量:1
  • 3陈书明,李振涛,万江华,胡定磊,郭阳,汪东,扈啸,孙书为.“银河飞腾”高性能数字信号处理器研究进展[J].计算机研究与发展,2006,43(6):993-1000. 被引量:29
  • 4马鹏勇 陈书明 等.支持两条并行存取指令的Cache控制器的设计[J].高技术通讯,2002,12:408-410. 被引量:2
  • 5张丹俞.陈书明,马鹏勇.双簇VLIW结构下数据Cache失效流水机制的设计与实现[C]//第13届全国信息存储技术学术会议,2004:451-456. 被引量:1
  • 6张春元,罗莉等.计算机系统结构[M].第一版.长沙:国防科技大学出版社.2002. 被引量:1
  • 7Ma Pengyong, Chen Shurning. MID: A Novel Coherence Protocol in CMP[C]//Proc of CIT'06,2006 : 50-57. 被引量:1

二级参考文献27

  • 1胡定磊,陈书明.低功耗编译技术综述[J].电子学报,2005,33(4):676-682. 被引量:11
  • 2Jennifer Eyre, Jeff Bier. The evolution of DSP processors, http://www. BDTI. com, 2000 被引量:1
  • 3DSP adapt to new challenges, http://www. BDTI.com, 2003 被引量:1
  • 4A BDTI analysis of Texas Instrument TMS320C64x. http://www. BDTI. com, 2003 被引量:1
  • 5ADSP-BF531/ADSP-BF532/ADSP-BF533 Datasheet. Analog Devices, Inc. http://www. analog.com, 2004 被引量:1
  • 6MSC8126 Reference Guide. Freescale Semiconductor, http://www. freescale.com, 2004 被引量:1
  • 7Piia Simonen, Ilkka Saastamoinen, Mika Kuulusa, et al.Advanced instruction set architectures for reducing program usage in a DSP processor. The first IEEE Int'l Workshop on Electronic Design, Test and Applications, Christchurch, New Zealand, 2002 被引量:1
  • 8Paul M Heysters, Gerard J M Smit. Mapping of DSP algorithms on the MONTIUM architecture. In: Proc. 17th Int'l Parallel and Distributed Processing Symposium. Los Alamitos, CA: IEEE Computer Society Press, 2003 被引量:1
  • 9Christopher Pretty, J Geoffrey Chase. Reconfigurable DSP's for efficient MPEG-4 video and audio decoding. The First IEEE International Workshop on Electronic Design, Test and Applications, Christchurch, New Zealand, 2002 被引量:1
  • 10Paolo Gai, Luca Abeni, Giorgio Buttazzo. Multiprocessor DSP scheduling in system on-a-chip architectures. The 14th Euromicro Conf. Real-Time Systems, Vienna, Austria, 2002 被引量:1

共引文献29

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部