期刊文献+

核外计算中的几种I/O优化方法 被引量:4

Research on I?O Optimizations in Out-of-Core Computation
下载PDF
导出
摘要 大数据量应用问题引入核外计算模式,由于访问磁盘数据的速度比较慢,I/O成为核外计算性能重要的限制因素·提出了一种使用运行库进行I/O优化的方法,给出了3种有效的优化策略:规则区域筛选、数据预取和边缘重用·编程人员可针对不同的应用问题使用相应的优化API来缩短程序执行时间·实验结果表明,通过减少I/O操作次数和内外存交换的数据量以及隐藏部分I/O操作延迟,有效提高了核外计算的性能· Applications with large amounts of data bring the mode of out-of-core computation in which I/O becomes the important limiting factor because of the low speed of accessing data on disks. A method of using runtime library is presented for I/O optimizations. Three optimization strategies including data sieving on regular section, data prefetching and data reuse on the edge are described. Programmers may adopt corresponding APIs for different applications to reduce the execution time. The experiment results show that the performance of the out-of-core computation is efficiently improved by reducing the number of I/O operations and the amount of exchanged data between the main memory and disks as well as hiding part of the I/O operation latency.
出处 《计算机研究与发展》 EI CSCD 北大核心 2005年第10期1820-1825,共6页 Journal of Computer Research and Development
基金 国家自然科学基金项目(90412001)
关键词 核外计算 规则区域筛选 预取 边缘重用 out-of-core computation regular section sieving prefetching data reuse on the edge
  • 相关文献

参考文献9

  • 1J. Ramanujam, M. Kandemir, A. Choudhary, et al.Compilation techniques for out-of-core parallel computations.Parallel Computing, 1998, 23(3-4): 597~628. 被引量:1
  • 2H. Simitci, D. Reed. A. comparison of logical and physical parallel I/O patterns. The Int'l Journal of High Performance Computing Applications, 1998, 12(3): 364~380. 被引量:1
  • 3M. Kandemir, A. Choudhary, J. Ramanujam. Compiler optimizations for I/O-intensive computations. In: Proc. 1999Int'l Conf. Parallel Processing. Wakamatsu, Japan: IEEE Computer Press, 1999. 164 ~171. 被引量:1
  • 4D. Callahan, K. Kennedy, A. Porterfield. Software prefetching.In: Proc. 4th Int'l Conf. Architectural Support for Programming Languages and Operation Systems. New York: ACM Press,1991. 40~52. 被引量:1
  • 5W.Y. Chen. Data preload for superscalar and VLIW processors:[Ph. D. dissertation]. Illinois: University of Illinois, 1993. 被引量:1
  • 6连瑞琦,张兆庆,乔如良.指令级并行编译器的数据预取及优化方法[J].计算机学报,2000,23(6):576-584. 被引量:8
  • 7A.D. Brown, T. C. Mowry, O. Krieger. Compiler-based I/O prefetching for out-of-core applications. ACM Trans. Computer Systems, 2001, 19(2): 111~170. 被引量:1
  • 8S. Carr, K. S. McKinley, C. W. Tseng. Compiler optimizations for improving data locality. In: Proc. 6th Int'l Conf.Architectural Support for Programming Languages and Operating Systems. New York: ACM Press, 1994. 252~262. 被引量:1
  • 9M.S. Lam, M. E. Wolf. A. data locality optimizing algorithm.ACM SIGPLAN Notices, 2004, 39(4): 442~459. 被引量:1

二级参考文献2

  • 1Chen W Y W,博士学位论文,1993年 被引量:1
  • 2Chen Tienfu,Proceedings of the 5th International Conference on Architectural Support for Pro,1992年,51页 被引量:1

共引文献7

同被引文献21

  • 1王威,胡铭曾.核外计算中I/O优化策略的研究[J].哈尔滨商业大学学报(自然科学版),2005,21(5):600-603. 被引量:3
  • 2Huijuan Zhang, Timothy S Newman. Efficient Parallel Out-of-Core Isosurface Extraction [R]// IEEE Symposium on Parallel and Large-Data Visualization and Graphics. USA: IEEE, 2003: 9-16. 被引量:1
  • 3Y-J Chiang, C T Silva. External Memory Techniques for Isosurface Extraction in Scientific Visualization [C]// DIMACS Series. Boston MA USA: American Mathematical Society, 2005. 被引量:1
  • 4Qin Wang, Joseph Ja Ja, Amitabh Varshney. An efficient and scalable parallel algorithm for out-of-core isosurface, extraction and rendering [J]. Journal of Parallel and Distributed Computing, Parallel Distrib. Comput. (S0743-7315), 2007, 67(5): 592-603. 被引量:1
  • 5J Wilhelms, A Van Gelder. Octrees for faster isosurface generation [J]. Computer Graphics (S0097-8493), 2002, 24(9): 57-62. 被引量:1
  • 6Qinming Shi, Joseph Ja Ja. Isosurface Extraction and Spatial Filtering Using Persistent Octree (POT) [J]. IEEE Transactions on Visualization and Computer Graphics (SI077-2626), 2006, 12(5): 1283-1290. 被引量:1
  • 7H Edelsbrunner. A new approach to rectangle intersections [J]. Comput. Math. (S0020-7160), 2003, 13(7): 209-219. 被引量:1
  • 8Y-J Chiang, C T Silva, W J Schroeder. Interactive Out-Of-Core Isosurface Extraction [C]// IEEE Visualization' 01. USA: IEEE, 2001: 16-174. 被引量:1
  • 9Distributed SBP Cholesky Factorization Algorithms with Near- Optimal Scheduling[ J ], FRED G. GUSTAVSON, LARS KARLS-SON, BO KAGSTRO M, ACM Transactions on Mathematical Software,2009, 36 (2) :1 -25. 被引量:1
  • 10A Fully Portable High Performance Minimal Storage Hybrid Format Cholesky Algorithm[ J ], BJARNE S. ANDERSEN, JOHN A. GUNNELS, FRED G. GUSTAVSON, JOHN K. REID, JERZY WASNIEWSKI, ACM Transactions on Mathematical Software,2005, 31(2) :201 -227. 被引量:1

引证文献4

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部