check_hadoop-dfs.sh (Visit this link)
from 0 votessh-compliant script to check the amount of available datanodes of a Hadoop cluster and its total and used size as well.
= Description =
Highly work in progress since I'm currently occupied with kicking off a large Hadoop cluster and the checks are written en passant. Please note that there's a small shell snippet that needs to be sudo'd by the Nagios user to get the actual dfs statistics. This is because I didn't want to give hadoop related permissions to the Nagios user.
Put the small snippet attached below in a directory of your choice (configurable via -s/--path-sh), but name it get-dfsreport.sh and make it read, write and accessible by root only. Then enable the Nagios user via /etc/sudoers (or better visudo) to run the script.
= Version =
Version 1.0, 2009, Mike Adolphs ([http://www.matejunkie.com/])
= -h/--help Output =
check_hadoop-dfs.sh is a Nagios plugin to check the status of HDFS, Hadoop's
underlying, redundant, distributed file system.
check_hadoop-dfs.sh -s /usr/local/sbin [-w 10] [-c 5]
Options:
-s|--path-sh)
Path to the shell script that is mentioned in the
documentation. Default is: /usr/local/sbin
-w|--warning)
Defines the warning level for available datanodes. Default
is: off
-c|--critical)
Defines the critical level for available datanodes. Default
is: off
= Output Example =
user@host: ~ $ ./check_hadoop-dfs.sh -s /var/nagios/home/bin
OK - Datanodes up and running: 50, DFS total: 20147365 MB, DFS used: 0 MB (0%) | 'datanodes_available'=50 'dfs_total'=20147365 'dfs_used'=0
user@host: ~ $ ./check_hadoop-dfs.sh -s /var/nagios/home/bin -w 40 -c 30
OK - Datanodes up and running: 50, DFS total: 20147365 MB, DFS used: 0 MB (0%) | 'datanodes_available'=50 'dfs_total'=20147365 'dfs_used'=0
user@host: ~ $ ./check_hadoop-dfs.sh -s /var/nagios/home/bin -w 60 -c 40
WARNING - Datanodes up and running: 50, DFS total: 20147365 MB, DFS used: 0 MB (0%) | 'datanodes_available'=50 'dfs_total'=20147365 'dfs_used'=0
user@host: ~ $ ./check_hadoop-dfs.sh -s /var/nagios/home/bin -w 70-c 60
CRITICAL - Datanodes up and running: 50, DFS total: 20147365 MB, DFS used: 0 MB (0%) | 'datanodes_available'=50 'dfs_total'=20147365 'dfs_used'=0
user@host: ~ $ ./check_hadoop-dfs.sh -s /var/nagios/home/bin -w 20 -c 40
Please adjust your warning/critical thresholds. The warning must be higher than the critical level!
user@host: ~ $ ./check_hadoop-dfs.sh -s /var/nagios/home/bin -w 30
Please also set a critical value when you want to use warning/critical thresholds!
user@host: ~ $ ./check_hadoop-dfs.sh -s /var/nagios/home/bin -c 30
Please also set a warning value when you want to use warning/critical thresholds!
Highly work in progress since I'm currently occupied with kicking off a large Hadoop cluster and the checks are written en passant. Please note that there's a small shell snippet that needs to be sudo'd by the Nagios user to get the actual dfs statistics. This is because I didn't want to give hadoop related permissions to the Nagios user.
Put the small snippet attached below in a directory of your choice (configurable via -s/--path-sh), but name it get-dfsreport.sh and make it read, write and accessible by root only. Then enable the Nagios user via /etc/sudoers (or better visudo) to run the script.
= Version =
Version 1.0, 2009, Mike Adolphs ([http://www.matejunkie.com/])
= -h/--help Output =
check_hadoop-dfs.sh is a Nagios plugin to check the status of HDFS, Hadoop's
underlying, redundant, distributed file system.
check_hadoop-dfs.sh -s /usr/local/sbin [-w 10] [-c 5]
Options:
-s|--path-sh)
Path to the shell script that is mentioned in the
documentation. Default is: /usr/local/sbin
-w|--warning)
Defines the warning level for available datanodes. Default
is: off
-c|--critical)
Defines the critical level for available datanodes. Default
is: off
= Output Example =
user@host: ~ $ ./check_hadoop-dfs.sh -s /var/nagios/home/bin
OK - Datanodes up and running: 50, DFS total: 20147365 MB, DFS used: 0 MB (0%) | 'datanodes_available'=50 'dfs_total'=20147365 'dfs_used'=0
user@host: ~ $ ./check_hadoop-dfs.sh -s /var/nagios/home/bin -w 40 -c 30
OK - Datanodes up and running: 50, DFS total: 20147365 MB, DFS used: 0 MB (0%) | 'datanodes_available'=50 'dfs_total'=20147365 'dfs_used'=0
user@host: ~ $ ./check_hadoop-dfs.sh -s /var/nagios/home/bin -w 60 -c 40
WARNING - Datanodes up and running: 50, DFS total: 20147365 MB, DFS used: 0 MB (0%) | 'datanodes_available'=50 'dfs_total'=20147365 'dfs_used'=0
user@host: ~ $ ./check_hadoop-dfs.sh -s /var/nagios/home/bin -w 70-c 60
CRITICAL - Datanodes up and running: 50, DFS total: 20147365 MB, DFS used: 0 MB (0%) | 'datanodes_available'=50 'dfs_total'=20147365 'dfs_used'=0
user@host: ~ $ ./check_hadoop-dfs.sh -s /var/nagios/home/bin -w 20 -c 40
Please adjust your warning/critical thresholds. The warning must be higher than the critical level!
user@host: ~ $ ./check_hadoop-dfs.sh -s /var/nagios/home/bin -w 30
Please also set a critical value when you want to use warning/critical thresholds!
user@host: ~ $ ./check_hadoop-dfs.sh -s /var/nagios/home/bin -c 30
Please also set a warning value when you want to use warning/critical thresholds!
Submitted by: Mike Adolphs
Hits: 410
Overall downloads: 153
Added: 2009-06-03 00:00:00
| Filename | Preview | Size | Description | DL |
|---|---|---|---|---|
| check_hadoop-dfs.sh | ![]() |
4 kb | The script (Version 1.0) | 100 |
| get-dfsreport.sh | ![]() |
0 kb | Code snippet that needs to be sudo'd | 89 |
| LICENSE | 53 kb | The appropriate license | 77 |
Comments
Be the first to comment this project.
Leave a reply
If you just want to rate the project leave the other field empty.
Even though it would be nice if you would drop a line.
-
Check SNMP Citrix
from 0 votesCheck Citrix Server Connection Information via SNMP. Extra OID's available through snmp4ctx package (http://www.wtcs.org/snmp4tpc/snmp4ctx.htm) -
check_squid
from 0 votescheck_squid is a plugin to nagios for monitoring Squid by using squidclient -
Siebel CRM monitoring for all platforms
from 0 votesMonitoring of Siebel CRM Application (check of components, tasks, gateway, application server, logfiles and autorestart for components). So far the only Nagios plugin available for Siebel. -
Check an IIS Application pool state
from 0 votesThis plugin checks for the state of an IIS application pool. -
check_bind.sh
from 0 votessh compliant script to check whether BIND is running and to get the performance data via rndc stats (PNP Template included)
-
FlexLM
4 from 1 votesCheck flexlm license server and return total license and license in use (Also use Performance data). -
CA ARCserve Backup r12 Number of Job Error Check
from 0 votesCA ARCserve Backup r12 number of job error check using the ca_qmgr.exe cli tool. -
jmeter invocation plugin
from 0 votesThis script is a rebuild of (http://www.monitoringexchange.org/cgi-bin/page.cgi?g=2455.html;d=1) that invokes an Apache JMeter (available from http://jakarta.apache.org/jmeter) test plan and parses the results, which are returned to Nagios per plugin API defined exit statuses and performance data. -
check_crysis.pl
5 from 1 votesMonitor the status of a Crysis dedicated server through XML-RPC instead of the standard CryQuery. -
Check CMSms (CMS made simple) version
from 0 votesThis plugin checks the version of a locally installed CMSms installation and compares it to the latest available version.




