期刊文献+

实用的机群监控系统 被引量:2

Practical cluster monitoring system
下载PDF
导出
摘要 在研制遥感图像并行处理系统的过程中,发现当前的机群监控不能为高性能计算的调试和运行提供足够支持。通过将MPI库编写的并行程序的运行状态反映在机群监控界面上,能为图像处理模块的调试和运行监测提供了方便。基本原理是将被监控项目记录在一个单独的动态汇聚表中,通过动态增删监测条目来记录需要交换的数据包括对某个特定的MPI进程的监控数据,同时,与Ganglia中Gmon实现的gmond相比较,由于所有的数据传输都是由上层节点的订阅发起的,在没有用户使用监控界面的时候,可以减少数据传输量和网络占用。 During the design and programming of the parallel image processing system, investigation showed that the existing cluster monitoring systems can not provide enough information for debugging and running of high performance computing programs. Practice proved that the debugging and status monitoring of the image processing module are facilitated if the status of process programs are embedded in the GUI interface of the cluster monitoring system. The basic principle is to record all monitored metrics in a separate aggregation table, which stores all monitoring data including data about a specified MPI process. In addition, all data transferred are initialized by subscription of the upper nodes. That assures only necessary data are transferred on the net, which decreases net occupation in comparison with the Gmon system of Ganglia when monitoring GUI is not used by any user.
出处 《计算机工程与设计》 CSCD 北大核心 2008年第1期190-192,212,共4页 Computer Engineering and Design
关键词 机群 监控系统 高性能科学计算 订阅 动态汇聚表 cluster monitoring system high performance computing subscribe dynamic aggregation table
  • 相关文献

参考文献9

二级参考文献23

  • 1陈熠,孟丹,詹剑锋,甄宁.基于联邦的数据公告的设计与实现[J].计算机工程与应用,2004,40(25):107-110. 被引量:4
  • 2刘礼农 等.波动方程三维叠前深度偏移并行计算流程探索[A]..油储地球物理论文集[C].,1999.. 被引量:1
  • 3..http://www.linuxdoc.com.,. 被引量:1
  • 4..http://java.sun.com.,. 被引量:1
  • 5..http://www.fping.com.,. 被引量:1
  • 6David M Geary.Java2图形设计[M].北京:机械工业出版社,2000.. 被引量:1
  • 7Massie M L,Chun B N,Culler D E.The Ganglia Distributed Monitoring System: Design, Implementation, and Experience.submitted for publication, 2003-02. 被引量:1
  • 8Rajkumar Buyya. PARMON:a portable and scalable monitoring system for clusters[J].Software Practice and Experience,2000;30(7):723~739. 被引量:1
  • 9Federico D Sacerdoti,Mason J katz,Matthew L Massie et al. Wide Area Cluster Monitoring with Ganglia. 被引量:1
  • 10Matt Sottile, Ron Minnich. Supermon: A highspeedcluster monitoring system[C].In:Proceedings of Cluster 2002,2002-09. 被引量:1

共引文献15

同被引文献24

引证文献2

二级引证文献13

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部