期刊文献+

FFT的数据并行计算方法研究 被引量:1

Research on Data Parallel Computation Method of FFT
下载PDF
导出
摘要 为满足G(Gigabytes)级像素帧的实时性处理需求,针对信号处理系统中处理计算量大、实时性要求高的特点,剖析了解算过程内在的数据并行特性,深入研究了基于计算阵列的谱图解算数据并行算法。提出了一种基于MPP(Massively Parallel Processor)计算机SIMD PE阵列的FFT的数据并行计算实现方法。首先根据FFT架构中的数据交互一致性,给出了数据并行计算的表达式。提出一种基于PE标识,进行条件操作的SIMD PE阵列数据并行实现方法。该方法不但省去了并行处理中的数据寻址时间开销,而且使得数据并行操作更为规则、简洁,满足了阵列操作规则性强的处理要求,大幅度地提高了MPP计算机并行计算处理速度。该方案是一种简洁有效的PE自治问题解决方案,以更合理的方法和更高的效率实现了常规经典算法,在数据并行计算领域中,无疑具有重要的理论意义和应用价值,将在嵌入式信号处理中发挥愈来愈重要的作用。 In order to satisfy the real-time processing requirement of G-level pixel frame,considering the intensive and real time compu- ting requirement of signal processing in embedded signal system,the inner data parallelism of the calculation process is analyzed,and the data parallel algorithms of spectrogram calculation based on computing array is also researched. A data parallel computation method im- plemented on SIMD PE array of MPP ( Massively Parallel Processor ) computer for FFT transform is presented. Based on the data compu- ting consistency of FFT,the expression of data parallel computing is given firstly. Then a method of data parallel computing based on SIMD PE array to execute conditional operations by using of PE identifier is proposed, which not only omits the time cost of addressing, but makes the data parallel operation more regular and compact ( only in computation statements and move statements). It meets the fea- tures of high regularity required by SIMD and greatly improves MPP computer processing speed,which is also a simple and effective PE autonomy solution,realizing conventional classic algorithms with more rational method and higher efficiency. It has important theoretical significance and application value in the area of data parallel undoubtedly, which will play a more and more important role in embedded signal processing.
出处 《计算机技术与发展》 2017年第10期91-95,共5页 Computer Technology and Development
基金 陕西省科技统筹创新工程计划(2015KTTSGY04-05)
关键词 快速傅里叶变换 SIMD PE阵列 映射语言 MPP计算机 FFT SIMD PE army mapping language MPP computer
  • 相关文献

参考文献5

二级参考文献49

  • 1沈绪榜,张发存,冯国臣,车得亮,王光.计算机体系结构的分类模型[J].计算机学报,2005,28(11):1759-1766. 被引量:10
  • 2沈绪榜,刘泽响,王茹.计算机体系结构的统一模型[J].计算机学报,2007,30(5):729-736. 被引量:17
  • 3HSIASC,LIU B D,et al.VLSI implementation of parallel coefficient-by-coefficient two-dimensional IDCT processor[J].IEEE Transaction,1995,5(5):396-406. 被引量:1
  • 4CHIANG J S,CHUI Y F,et al.A high throughput 2-dimensional DCT/IDCT architecture for real-time and video system[A].Proceedings of the 8th IEEE International Conference on Electronics,Circuits and Systems (ICECS)[C].IEEE Press,2001,Ⅵ.2,867-870. 被引量:1
  • 5CHANG Y T,WANG C L.New systolic array implementation of the 2D discrete cosine transform[J].IEEE Transaction,1995,5(2):150-157. 被引量:1
  • 6Hyesook Lim,Changhoon Yim,et al.Finite wordlength effects of an unified systolic array for 2-d dct/idct[A].Proceedings of the IEEE International Conference on Application-Specific Systems,Architectures,and Processors (ASAP)[C].Washington,DC,USA:IEEE Computer Society,1996,19(21):35-44. 被引量:1
  • 7A Rosenfeld.Parallel image processing using cellular-array computers[J].Computer,1983,1 (1):177 -191. 被引量:1
  • 8K Preston.Cellular logic computers for pattern recognition[J].Gomputer,1983,6(4):36-37. 被引量:1
  • 9Mohamed F Mansour.On the Odd-DFT and its applications to DCT/IDCT computation[J].IEEE transactions,2006,54(7):2819-2822. 被引量:1
  • 10M Narashima,A Peterson.On the computation of the discrete cosine transform[J].IEEE transactions,1978,26 (6):934-936. 被引量:1

共引文献26

同被引文献4

引证文献1

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部