期刊文献+

二维FFT在GPU上的并行实现 被引量:1

Parallel Implementation of 2D FFT on a Graphics Processing Unit
下载PDF
导出
摘要 FFT算法是高度并行的分治算法,因此适合在GPU(Graphics Processing Unit,图形处理器)的CUDA(Compute Unified Device Architecture,计算统一设备体系结构)构架上实现.阐述了GPU用于通用计算的原理和方法,并在Geforce8800 GT平台上完成了二维卷积FFT的运算实验.实验结果表明,随着图像尺寸的增加,CPU和GPU上的运算量和运算时间大幅度增加,GPU上运算的速度提高倍数也随之增加,平均提升20倍左右. The fact that FFT is a highly paralleled divide-and-conquer algorithm determines that it can be applied to compute unified dovice architecture (CUDA) of graphics processing unit (GPU). This paper deals with the principle and method of applying GPU to general purpose computation. And the algorithm of 2D FFT was simulated on the platform of Geforce8800 GT. The results indicate that with the increase of image size, the calculation and calculating time of CPU and GPU increase significantly and the calculating speed increases by 20 times on the average.
作者 陈瑞 童莹
出处 《南京工程学院学报(自然科学版)》 2009年第2期41-45,共5页 Journal of Nanjing Institute of Technology(Natural Science Edition)
基金 南京工程学院科研基金项目(KXJ07014)
关键词 图形处理器 计算统一设备体系结构 通用计算 二维FFT graphics processing unit computer unified device architecture general purpose computation 2D FFT
  • 相关文献

参考文献8

  • 1MACEDONIA M.The GPU enters computing's mainstream[J].IEEE Computer,2003,36(10):106-108. 被引量:1
  • 2CUDA Programming Guide Version 2.0[M].NVIDIA Corporation,2008:30-35. 被引量:1
  • 3KRUGER J,WESTERMANN R.Linear algebra operators for GPU implementation of numerical algorithms[J].ACM Trans on Graphics,2003,22 (3):908-916. 被引量:1
  • 4HALL J D,CARR N A,HART J C.Cache and bandwidth aware matrix multiplication on the GPU[R].UIUCDCS-R -2003 -2328,Champaign:University of Illinois at Urbana-Champaign,2003. 被引量:1
  • 5MORELAND K,ANGEL E.The FFT on a GPU[C].Proc SIG-GRAPH/EG Conference on Graphics Hardware,2003:78 -81. 被引量:1
  • 6SUMANAWEERA T,DONALD L.Medical image reconstruction with the FFT[C]// RANDIMA F.GPU Gems 2:Programming Techniques for High-performance Graphics and General-purpose Computation.Addison Wesley,2005. 被引量:1
  • 7吴恩华,柳有权.基于图形处理器(GPU)的通用计算[J].计算机辅助设计与图形学学报,2004,16(5):601-612. 被引量:227
  • 8孙世新等编著..并行算法及其应用[M].北京:机械工业出版社,2005:196.

二级参考文献57

  • 1Clark James H.The geometry engine:A VLSI geometry system for graphics[A].In:Computer Graphics Proceedings,Annual Conference Series,ACM SIGGRAPH,Boston,1982.127~133 被引量:1
  • 2Fuchs Herry,Poulton John.Pixel-planes:A VLSI-Oriented design for a raster graphics engine[J].VLSI Design,1981,2(3):20~28 被引量:1
  • 3Eyles John,Austin John,Fuchs Henry,et al.Pixel-plane 4:A summary,advances in computer graphics hardware II[A].Eurographic Seminars Tutorials and Perspectives in Computer Graphics,New York:Springer-Verlag,1988.183~208 被引量:1
  • 4Fuchs Herry,Israel Laura,Poulton John,et al.Pixel-planes 5:A heterogeneous multiprocessor graphics system using processor-enhanced memories[A].In:Computer Graphics Proceedings,Annual Conference Series,ACM SIGGRAPH,Boston,1989.79~88 被引量:1
  • 5http://www.nvidia.com/object/gpu.html[OL] 被引量:1
  • 6http://developer.nvidia.com/[OL] 被引量:1
  • 7http://www.ati.com/developer/[OL] 被引量:1
  • 8http://www.gpgpu.org[OL] 被引量:1
  • 9Joo Luiz Dihl Comba,Dietrich Carlos A,Pagot Christian A,et al.Computation on GPUs:From a programmable pipeline to an efficient stream processor[J].Revista de Informática Teóricae Aplicada,2003,X(2):41~70 被引量:1
  • 10Krüger Jens,Westermann Rüdiger.Linear algebra operators for GPU implementation of numerical algorithms[J].ACM Transactions on Graphics,2003,22(3):908~916 被引量:1

共引文献226

同被引文献7

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部