server_monitoring
Differences
This shows you the differences between two versions of the page.
| Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
| server_monitoring [2010/06/16 09:58] – 172.26.0.166 | server_monitoring [2024/01/16 09:21] (current) – removed aorth | ||
|---|---|---|---|
| Line 1: | Line 1: | ||
| - | ===== Server Monitoring ===== | ||
| - | * [[# | ||
| - | * [[# | ||
| - | * [[# | ||
| - | * [[# | ||
| - | |||
| - | |||
| - | ===== Ganglia ===== | ||
| - | |||
| - | [[http:// | ||
| - | |||
| - | You can see the ganglia installation here: http:// | ||
| - | |||
| - | {{: | ||
| - | |||
| - | Interesting documentation: | ||
| - | ==== Troubleshooting ==== | ||
| - | From time to time there are problems with Ganglia' | ||
| - | |||
| - | - Stop data collection daemon on HPC: '' | ||
| - | - Stop monitoring daemon on compute nodes: '' | ||
| - | - Start data collection daemon on HPC: '' | ||
| - | - Wait a minute or two | ||
| - | - Start monitoring daemon on compute nodes: '' | ||
| - | |||
| - | Now go check the Ganglia web interface and see if the nodes have returned. | ||
| - | |||
| - | ===== Monit ===== | ||
| - | |||
| - | Monit is a free open source utility for managing and monitoring, processes, files, directories and filesystems on a UNIX system. Monit conducts automatic maintenance and repair and can execute meaningful causal actions in error situations. | ||
| - | Monit can start a process if it does not run, restart a process if it does not respond and stop a process if it uses too much resources. it logs to syslog or to its own log file and notifies you about error conditions and recovery status via customizable alert. | ||
| - | Monit provides a built-in HTTP(S) interface and you can use a browser to access the Monit server. | ||
| - | |||
| - | M/Monit expand upon Monit' | ||
| - | |||
| - | Get the latest version at: http:// | ||
| - | |||
| - | < | ||
| - | $ tar xfz monit-5.0.3.tar.gz | ||
| - | $ cd monit-5.0.3 | ||
| - | $ ./configure && make && make install</ | ||
| - | Accessing monit: | ||
| - | http:// | ||
| - | |||
| - | ===== Nagios ===== | ||
| - | |||
| - | Nagios is a powerful monitoring system that enables organizations to identify and resolve IT infrastructure problems before they affect critical business processes. http:// | ||
| - | |||
| - | === Installation === | ||
| - | |||
| - | ---- | ||
| - | |||
| - | Download the latest version of nagios while hot, from http:// | ||
| - | < | ||
| - | $ cd nagios-3.2.0 | ||
| - | $ ./configure | ||
| - | $ make all | ||
| - | $ useradd nagios | ||
| - | $ make install | ||
| - | $ make install-init | ||
| - | $ make install-commandmode | ||
| - | $ make install-config | ||
| - | $ make install-webconf | ||
| - | </ | ||
| - | === Configuration === | ||
| - | |||
| - | ---- | ||
| - | Running the following command will create a new file called htpasswd.users in the / | ||
| - | < | ||
| - | |||
| - | Download and install plugins | ||
| - | < | ||
| - | $ wget http:// | ||
| - | $ tar xfz nagios-plugins-1.4.14.tar.gz | ||
| - | $ cd nagios-plugins-1.4.14 | ||
| - | $ ./configure && make && make install | ||
| - | </ | ||
| - | Edit the configuration files to add host and services to be monitored: | ||
| - | < | ||
| - | |||
| - | Check remote services http:// | ||
| - | === Accessing Nagios === | ||
| - | |||
| - | ---- | ||
| - | http:// | ||
| - | |||
| - | with username = " | ||
| - | ==== Zabbix ==== | ||
| - | ---- | ||
| - | Installation: | ||
| - | |||
| - | RHEL-compatible Linux: Ref: http:// | ||
| - | < | ||
| - | name=Andrew Farley RPM Repository | ||
| - | baseurl=http:// | ||
| - | enabled=1 | ||
| - | gpgcheck=0' | ||
| - | |||
| - | |||
| - | |||
| - | And then you can install zabbix agent, zabbix server, zabbix get, or zabbix proxy with… | ||
| - | < | ||
| - | sudo yum install zabbix-agent | ||
| - | sudo yum install zabbix-server | ||
| - | sudo yum install zabbix-get | ||
| - | sudo yum install zabbix-proxy </ | ||
| - | |||
| - | If it fails to install, you might need to clean the metadata with the following command and try again… | ||
| - | |||
| - | sudo yum clean metadata | ||
| - | |||
| - | |||
| - | Debian-Based Linux: | ||
| - | ---- | ||
| - | < | ||
| - | root@simple: | ||
| - | zabbix-agent - network monitoring solution - agent | ||
| - | zabbix-frontend-php - network monitoring solution - PHP front-end | ||
| - | zabbix-proxy-mysql - network monitoring solution - proxy (using MySQL) | ||
| - | zabbix-proxy-pgsql - network monitoring solution - proxy (using PostgreSQL) | ||
| - | zabbix-server-mysql - network monitoring solution - server (using MySQL) | ||
| - | zabbix-server-pgsql - network monitoring solution - server (using PostgreSQL) | ||
| - | root@simple: | ||
| - | </ | ||
| - | |||
| - | === Accessing Zabbix === | ||
| - | |||
| - | http:// | ||
| - | username: Admin | ||
| - | password: zabbix | ||
| - | |||
server_monitoring.1276682304.txt.gz · Last modified: by 172.26.0.166
