server_monitoring
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revisionNext revisionBoth sides next revision | ||
server_monitoring [2010/06/16 09:38] – 172.26.0.166 | server_monitoring [2010/06/21 22:42] – 172.26.14.218 | ||
---|---|---|---|
Line 1: | Line 1: | ||
===== Server Monitoring ===== | ===== Server Monitoring ===== | ||
- | * [[#ganglia|Ganglia]] - Monitors cluster CPU, disk, network usage | + | * [[server_monitoring: |
- | * [[#monit|Monit]] - Monitors specific services | + | * [[server_monitoring: |
- | * [[#nagios|Nagios]] - Monitors servers, | + | * [[server_monitoring: |
- | * [[#Zabbix|Zabbix]] -Monitor servers, | + | * [[server_monitoring: |
- | + | ||
- | + | ||
- | ===== Ganglia ===== | + | |
- | + | ||
- | [[http:// | + | |
- | + | ||
- | You can see the ganglia installation here: http:// | + | |
- | + | ||
- | {{: | + | |
- | + | ||
- | Interesting documentation: | + | |
- | ==== Troubleshooting ==== | + | |
- | From time to time there are problems with Ganglia' | + | |
- | + | ||
- | - Stop data collection daemon on HPC: '' | + | |
- | - Stop monitoring daemon on compute nodes: '' | + | |
- | - Start data collection daemon on HPC: '' | + | |
- | - Wait a minute or two | + | |
- | - Start monitoring daemon on compute nodes: '' | + | |
- | + | ||
- | Now go check the Ganglia web interface and see if the nodes have returned. | + | |
- | + | ||
- | ===== Monit ===== | + | |
- | + | ||
- | Monit is a free open source utility for managing and monitoring, processes, files, directories and filesystems on a UNIX system. Monit conducts automatic maintenance and repair and can execute meaningful causal actions in error situations. | + | |
- | Monit can start a process if it does not run, restart a process if it does not respond and stop a process if it uses too much resources. it logs to syslog or to its own log file and notifies you about error conditions and recovery status via customizable alert. | + | |
- | Monit provides a built-in HTTP(S) interface and you can use a browser to access the Monit server. | + | |
- | + | ||
- | M/Monit expand upon Monit' | + | |
- | + | ||
- | Get the latest version at: http:// | + | |
- | + | ||
- | < | + | |
- | $ tar xfz monit-5.0.3.tar.gz | + | |
- | $ cd monit-5.0.3 | + | |
- | $ ./configure && make && make install</ | + | |
- | Accessing monit: | + | |
- | http:// | + | |
===== Nagios ===== | ===== Nagios ===== | ||
Line 48: | Line 10: | ||
=== Installation === | === Installation === | ||
- | |||
- | ---- | ||
Download the latest version of nagios while hot, from http:// | Download the latest version of nagios while hot, from http:// | ||
Line 86: | Line 46: | ||
with username = " | with username = " | ||
- | |||
==== Zabbix ==== | ==== Zabbix ==== | ||
---- | ---- | ||
- | Installation: | + | Installation: |
- | RHEL-compatible Linux: | + | RHEL-compatible Linux: |
< | < | ||
name=Andrew Farley RPM Repository | name=Andrew Farley RPM Repository | ||
Line 110: | Line 69: | ||
sudo yum clean metadata | sudo yum clean metadata | ||
+ | |||
+ | |||
+ | Debian-Based Linux: | ||
+ | ---- | ||
+ | < | ||
+ | root@simple: | ||
+ | zabbix-agent - network monitoring solution - agent | ||
+ | zabbix-frontend-php - network monitoring solution - PHP front-end | ||
+ | zabbix-proxy-mysql - network monitoring solution - proxy (using MySQL) | ||
+ | zabbix-proxy-pgsql - network monitoring solution - proxy (using PostgreSQL) | ||
+ | zabbix-server-mysql - network monitoring solution - server (using MySQL) | ||
+ | zabbix-server-pgsql - network monitoring solution - server (using PostgreSQL) | ||
+ | root@simple: | ||
+ | </ | ||
+ | |||
+ | === Accessing Zabbix === | ||
+ | |||
+ | http:// | ||
+ | username: Admin | ||
+ | password: zabbix | ||
+ | |||