server_monitoring
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revisionNext revisionBoth sides next revision | ||
server_monitoring [2009/11/27 12:10] – alan | server_monitoring [2010/06/16 09:33] – 172.26.0.166 | ||
---|---|---|---|
Line 2: | Line 2: | ||
* [[# | * [[# | ||
* [[# | * [[# | ||
- | * [[# | + | * [[# |
+ | * [[# | ||
===== Ganglia ===== | ===== Ganglia ===== | ||
Line 13: | Line 15: | ||
Interesting documentation: | Interesting documentation: | ||
- | |||
==== Troubleshooting ==== | ==== Troubleshooting ==== | ||
From time to time there are problems with Ganglia' | From time to time there are problems with Ganglia' | ||
- Stop data collection daemon on HPC: '' | - Stop data collection daemon on HPC: '' | ||
- | - Stop monitoring daemon on HPC: '' | ||
- Stop monitoring daemon on compute nodes: '' | - Stop monitoring daemon on compute nodes: '' | ||
- Start data collection daemon on HPC: '' | - Start data collection daemon on HPC: '' | ||
- | - Star monitoring daemon on HPC: '' | + | - Wait a minute or two |
- Start monitoring daemon on compute nodes: '' | - Start monitoring daemon on compute nodes: '' | ||
Line 87: | Line 87: | ||
with username = " | with username = " | ||
+ | ==== Zabbix ==== | ||
+ | ---- | ||