期刊文献+

节点内无冗余图像合成方法 被引量:1

Composition without Inactive Pixels in Cluster Node
下载PDF
导出
摘要 为了提高图像合成性能,提出一种基于GPU加速的节点内多GPU图像高效合成方法.首先通过判断帧缓存是否更新生成有效像素表,然后采用GPGPU计算生成有效像素合成索引列表,最后用其指导CPU端图像合成操作,从而完全避免了CPU端的冗余合成计算.理论分析表明,在理想负载平衡条件下,该方法的加速比为图像有效像素百分比与节点内GPU数量的比值.实验结果表明,在节点内配置4个GPU时,针对有效像素比为12%~76%的高分辨率图像,该方法比全图像合成方法的性能提高了3~5倍. A novel GPU acceleration method was proposed in this paper to improve performance of image composition in multi-GPU cluster node. The proposed method constructed an active pixel list by judging whether frame buffer pixels were updated. An index list was computed from the active pixel list using the GPGPU technology. Then the index list was used to conduct pixels composition on CPU. Given these operations, inactive pixel compositions could be totally avoided. Theoretical analysis show that the speedup of full image composition is determined by how many GPUs are configured in node and the proportion of active pixels to the full image. In the node with 4 GPUs, the speedup was about 3-5 times when composited different images whose active pixels was about 12% to 76%.
出处 《计算机辅助设计与图形学学报》 EI CSCD 北大核心 2013年第5期646-652,共7页 Journal of Computer-Aided Design & Computer Graphics
基金 国家"九七三"重点基础研究发展计划项目(2009CB723803) 国家自然科学基金(61170157) xxx项目(2011AA7123022)
关键词 图像合成 多GPU GPGPU 并行绘制 sort—last composition multi-GPU GPGPU parallel rendering sort-last
  • 相关文献

参考文献20

  • 1Bhaniramka P, Robert P C D, Eilemann S. OpenGL multipipe SDK: a toolkit for scalable parallel rendering [C] // Proceedings of IEEE Visualization. Los Alamitos: IEEE Computer Society Press, 2005:119-126. 被引量:1
  • 2Eilemann S, Makhinya M, Pajarola R. Equalizer; a scalable parallel rendering framework [J]. IEEE Transactions on Visualization and Computer Graphics, 2009, 15(3) : 436-452. 被引量:1
  • 3Zhou K, Hou Q M, Ren Z, et al. RenderAnts: interactive Reyes rendering on GPUs [J]. ACM Transactions on Graphics, 2009, 28(5): Article No. 155. 被引量:1
  • 4刘华海,王攀,蔡勋,曾亮,王文珂,李思昆.面向多核CPU多GPU的节点内并行混合绘制模型[J].系统仿真学报,2012,24(1):94-98. 被引量:3
  • 5Moll L, Heirich A, Shand M. Sepia: scalable 3D eompositing using PCI pamette [C] //Proceedings of IEEE Symposium on Field -Programmable Custom Computing Machines. Los Alamitos: IEEE Computer Society Press, 1999:146-155. 被引量:1
  • 6Lombeyda S, Moll L, Shand M, et al. Scalable interactive volume rendering using Off-The-Shelf components [C] // Proceedings of the IEEE Symposium on Parallel and Large-data Visualization and Graphics. Los Alamitos: IEEE Computer Society Press, 2001:115-121. 被引量:1
  • 7Stoll G, Eldridge M, Patterson D, et al. Lighming-2; a high-performance display subsystem for PC clusters [C] // Proceedings of ACM SIGGRAPH. New York: ACM Press, 2001:141-148. 被引量:1
  • 8Zhang X Y, Ba]a] C, Blanke W. Scalable isosur{ace visualization of massive datasets on COTS clusters [C] // Proceedings of IEEE Symposium on Parallel and Large Data Visualization and Graphics. Los Alamitos: IEEE Computer Society Press, 2001 : 51-58. 被引量:1
  • 9Muraki S, Ogata M, Ma K L, et al. Next-generation visual supereomputing using PC clusters with volume graphics hardware deviees [C]//Proceedings of ACM/IEEE Conference on Supereomputing. New York: ACM Press, 2001:51-95. 被引量:1
  • 10Molnar S, Eyles J, Pouhon J. PixelFlow: highspeed rendering using image composition [C] /]Proceedings of ACM SIGGRAPH. New York: ACM Press, 1992:231-240. 被引量:1

二级参考文献10

  • 1T Fogal, H Childs, S Shankar, J Krueger, R D Bergeron, P Hatcher Large Data Visualization on Distributed Memory Multi-GPU Clusters [C]// Proc. of High Performance Graphics 2010. Switzerland: Eurographics Association, 2010: 57-66. 被引量:1
  • 2Kitware Incorporated. ParaView. [EB/OL]. (2009) [2011 ]. http://www.paraview.org. 被引量:1
  • 3NVIDIA Incorporated. NVIDIA Hybrid_SLI. [EB/OL]. (2010) [2011]. http ://www.nvidia.com/obj ectJhybrid_sli.html. 被引量:1
  • 4AMD Incorporated. AMD CrossFire. [EB/OL]. (2010) [2011]. http://www.amd.com/us/PRODUCTS/ WORKSTATION/ GRAPHICS/ CROSSFIRE-PRO/Pages/crossfire-pro.aspx. 被引量:1
  • 5G Humphreys, M Eldridge, I Buck, G Stoll, M Everett, P Hanrahan. WireGL: A Scalable Graphics System for Clusters [C]// Proc. of SIGGRAPH 2001. USA: ACM, 2001: 129-140. 被引量:1
  • 6G Humphreys, H Mike, T James. Chromium: A Stream- Processing Framework for Interactive Rendering on Clusters [C]// Proc. of SIGGRAPH 2002. USA: ACM, 2002: 693-702. 被引量:1
  • 7Moerschell J Owens. Distributed Texture Memory in a Multi- GPU Environment [J]. Computer Graphics Forum (S0167-7055), 2008, (27): 130-151. 被引量:1
  • 8Lefohn S Sengupta, J Kniss, R Strzodka, J Owens Glift. Generic, Efficient, Random-Access GPU Data Structures [J]. ACM Transactions on Graphics (S0730-0301), 2006, (25): 60-99. 被引量:1
  • 9P Bhaniramka, P C D Robert, S Eilemann. OpenGL Multipipe SDK: A Toolkit for Scalable Parallel Rendering [C]// Proc. of IEEE Visualization 2005. USA: IEEE, 2005: 119-126. 被引量:1
  • 10S Eilemarm, M Maldainya, R Pajarola. Equalizer: A Scalable Parallel Rendering Framework Visualization and Computer Graphics [J]. IEEE Transactions on Graphics (S0730-0301), 2009, (15): 436-452. 被引量:1

共引文献2

同被引文献7

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部