User Tools

Site Tools


server_monitoring

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
Next revisionBoth sides next revision
server_monitoring [2010/01/11 11:26] 172.26.0.166server_monitoring [2010/06/16 09:51] 172.26.15.75
Line 2: Line 2:
   * [[#ganglia|Ganglia]] - Monitors cluster CPU, disk, network usage   * [[#ganglia|Ganglia]] - Monitors cluster CPU, disk, network usage
   * [[#monit|Monit]] - Monitors specific services   * [[#monit|Monit]] - Monitors specific services
-  * [[#nagios|Nagios]] - Monitors services services+  * [[#nagios|Nagios]] - Monitors servers,hosts and services 
 +  * [[#Zabbix|Zabbix]] -Monitor servers,host and services 
  
 ===== Ganglia ===== ===== Ganglia =====
Line 13: Line 15:
  
 Interesting documentation: http://www.ibm.com/developerworks/wikis/display/WikiPtype/ganglia Interesting documentation: http://www.ibm.com/developerworks/wikis/display/WikiPtype/ganglia
- 
 ==== Troubleshooting ==== ==== Troubleshooting ====
 From time to time there are problems with Ganglia's web interface.  You can restart the needed services following this basic procedure: From time to time there are problems with Ganglia's web interface.  You can restart the needed services following this basic procedure:
Line 20: Line 21:
   - Stop monitoring daemon on compute nodes: ''rocks run host compute %%'%%service gmond stop%%'%%''   - Stop monitoring daemon on compute nodes: ''rocks run host compute %%'%%service gmond stop%%'%%''
   - Start data collection daemon on HPC: ''service gmetad start''   - Start data collection daemon on HPC: ''service gmetad start''
 +  - Wait a minute or two
   - Start monitoring daemon on compute nodes: ''rocks run host compute %%'%%service gmond start%%'%%''   - Start monitoring daemon on compute nodes: ''rocks run host compute %%'%%service gmond start%%'%%''
  
Line 84: Line 86:
  
 with  username = "nagiosadmin" and password = "nagios" with  username = "nagiosadmin" and password = "nagios"
 +==== Zabbix ====
 +----
 +Installation:
 +
 +RHEL-compatible Linux:
 +<code>sudo echo '[andrewfarley]
 +name=Andrew Farley RPM Repository
 +baseurl=http://repo.andrewfarley.com/centos/$releasever/$basearch/
 +enabled=1
 +gpgcheck=0' > /etc/yum.repos.d/andrewfarley.com.repo</code>
 +
 +
 +
 +And then you can install zabbix agent, zabbix server, zabbix get, or zabbix proxy with…
 +<file>
 +    sudo yum install zabbix-agent
 +    sudo yum install zabbix-server
 +    sudo yum install zabbix-get
 +    sudo yum install zabbix-proxy </file>
 +
 +If it fails to install, you might need to clean the metadata with the following command and try again…
 +
 +    sudo yum clean metadata
 +
 +
 +Debian-Based Linux:
 +----
 +<code>
 +root@simple:~# apt-cache search zabbix 
 +zabbix-agent - network monitoring solution - agent
 +zabbix-frontend-php - network monitoring solution - PHP front-end
 +zabbix-proxy-mysql - network monitoring solution - proxy (using MySQL)
 +zabbix-proxy-pgsql - network monitoring solution - proxy (using PostgreSQL)
 +zabbix-server-mysql - network monitoring solution - server (using MySQL)
 +zabbix-server-pgsql - network monitoring solution - server (using PostgreSQL)
 +root@simple:~# 
 +</code>
 +
 +=== Accessing Zabbix ===
 +
 +http://172.26.12.29/zabbix
 +username: Admin
 +password: zabbix