期刊文献+

一种基于代价子图的子字并行指令选择算法

An Instruction Selection Algorithm for Subword Parallelism Based on the Cost Subgraph
下载PDF
导出
摘要 子字并行能够充分利用多媒体算法的数据精度小、内部循环处理形式规则的特点,是加速多媒体处理的有效方式。然而,如何充分挖掘多媒体应用中的子字并行仍然是一个难题。本文说明传统的并行技术可以有效地开发循环中的子字并行性,同时提出一种基于代价子图的子字并行指令自动识别的方法。与其他方法相比,该方法利用代价模型对子字并行指令选择进行定量评估。本文在TTA体系结构框架下实现了这一方法。实验结果表明,该方法可以充分地提取循环中的子字并行性。 Subword parallelism can fully utilize the characteristics of multimedia algorithms, and it is an effective way to accelerate multimedia processing. However it is hard to mine the subwords in multimedia applications. This paper shows that the traditional parallelization techniques can be used to exploit subword parallelism, and also proposes a novel method to extract subword parallelism based on the cost subgraph. We evaluate the effectiveness of our methods for a number of benchmarks on the TTA framework. The results reveal that this method can significantly obtain the available subword parallelism in the loop.
作者 王淼 王志英
出处 《计算机工程与科学》 CSCD 2008年第9期141-144,150,共5页 Computer Engineering & Science
基金 国家自然科学基金资助项目(60173040)
关键词 子字并行 指令选择 代价子图 subword parallelism instruction selection cost subgraph
  • 相关文献

参考文献10

  • 1Lee R B,Murat Fiskiran A. PLX: A Fully Subword-Parallel Instruction Set Architecture for Fast Sealable Multimedia[C] //Proc of the 3rd Int'l Conf on Multimedia and Expo, 2002: 117-120. 被引量:1
  • 2Intel Corporation. Intel Architecture MMX Technology, Programmer's Reference Manual [M].Intel Corporation, 1996:11-120. 被引量:1
  • 3Raman S, Pentkovski V, Keshava J. Implementing Streaming SIMD Extensions on the Pentium Ⅲ Processor[J]. IEEE Micro, 2000,20(4) : 47-57. 被引量:1
  • 4Sun Microeleetronics. Visual Instruction Set (VIS) User's Guide[M]. Sun Microelectronics, 1996. 被引量:1
  • 5Lee R B. Subword Parallelism with MAX-2[J]. IEEE Micro, 1996,16(4) :51-59. 被引量:1
  • 6Larsen S,Amarasinghe S. Exploiting Supervcord Level Parallelism with Multimedia Instruction Sets [C] // Proc of the SIGPLAN ' 00 Conf on Programming Language Design and Implementation, 2000:145-156. 被引量:1
  • 7Leupers R. Code Selection for Media Processors with SIMD Instructions[C]//Proc of DATE' 00,2000. 被引量:1
  • 8Corporaal H. Transport Triggered Architectures: Design and Evaluation: [Ph D Thesis][D]. Delft University of Technology, 1995. 被引量:1
  • 9Guthaus M R, Ringenberg J S, Ernst D, et al. MiBeneh: A Free, Commercially Representative Embedded Benchmark Suite[C]//Proc of the 4th Annual IEEE Int'l Workshop on Workload Characterization, 2001 : 3-14. 被引量:1
  • 10McMahon F. The Livermore Fortran Kemels: A Computer Test of the Numerical Performance Range[R]. Technical Report UCRL-53745, Lawrence Livermore National Laboratory, 1986. 被引量:1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部