check_hadoop-dfs.sh (Visit this link)

rating from 0 votes
sh-compliant script to check the amount of available datanodes of a Hadoop cluster and its total and used size as well.
= Description =
Highly work in progress since I'm currently occupied with kicking off a large Hadoop cluster and the checks are written en passant. Please note that there's a small shell snippet that needs to be sudo'd by the Nagios user to get the actual dfs statistics. This is because I didn't want to give hadoop related permissions to the Nagios user.
Put the small snippet attached below in a directory of your choice (configurable via -s/--path-sh), but name it get-dfsreport.sh and make it read, write and accessible by root only. Then enable the Nagios user via /etc/sudoers (or better visudo) to run the script.

= Version =
Version 1.0, 2009, Mike Adolphs ([http://www.matejunkie.com/])

= -h/--help Output =
check_hadoop-dfs.sh is a Nagios plugin to check the status of HDFS, Hadoop's
underlying, redundant, distributed file system.

check_hadoop-dfs.sh -s /usr/local/sbin [-w 10] [-c 5]

Options:
-s|--path-sh)
Path to the shell script that is mentioned in the
documentation. Default is: /usr/local/sbin
-w|--warning)
Defines the warning level for available datanodes. Default
is: off
-c|--critical)
Defines the critical level for available datanodes. Default
is: off

= Output Example =
user@host: ~ $ ./check_hadoop-dfs.sh -s /var/nagios/home/bin
OK - Datanodes up and running: 50, DFS total: 20147365 MB, DFS used: 0 MB (0%) | 'datanodes_available'=50 'dfs_total'=20147365 'dfs_used'=0
user@host: ~ $ ./check_hadoop-dfs.sh -s /var/nagios/home/bin -w 40 -c 30
OK - Datanodes up and running: 50, DFS total: 20147365 MB, DFS used: 0 MB (0%) | 'datanodes_available'=50 'dfs_total'=20147365 'dfs_used'=0
user@host: ~ $ ./check_hadoop-dfs.sh -s /var/nagios/home/bin -w 60 -c 40
WARNING - Datanodes up and running: 50, DFS total: 20147365 MB, DFS used: 0 MB (0%) | 'datanodes_available'=50 'dfs_total'=20147365 'dfs_used'=0
user@host: ~ $ ./check_hadoop-dfs.sh -s /var/nagios/home/bin -w 70-c 60
CRITICAL - Datanodes up and running: 50, DFS total: 20147365 MB, DFS used: 0 MB (0%) | 'datanodes_available'=50 'dfs_total'=20147365 'dfs_used'=0
user@host: ~ $ ./check_hadoop-dfs.sh -s /var/nagios/home/bin -w 20 -c 40
Please adjust your warning/critical thresholds. The warning must be higher than the critical level!
user@host: ~ $ ./check_hadoop-dfs.sh -s /var/nagios/home/bin -w 30
Please also set a critical value when you want to use warning/critical thresholds!
user@host: ~ $ ./check_hadoop-dfs.sh -s /var/nagios/home/bin -c 30
Please also set a warning value when you want to use warning/critical thresholds!

Submitted by: Mike Adolphs
Hits: 410
Overall downloads: 153
Added: 2009-06-03 00:00:00

Filename Preview Size Description DL
check_hadoop-dfs.sh Preview 4 kb The script (Version 1.0) 100
get-dfsreport.sh Preview 0 kb Code snippet that needs to be sudo'd 89
LICENSE 53 kb The appropriate license 77

Comments


Be the first to comment this project.

Leave a reply

If you just want to rate the project leave the other field empty.
Even though it would be nice if you would drop a line.

Allowed tags: blockquote, strong, a, b, i, u, code

Captcha Reload Image


  • Check SNMP Citrix

    rating from 0 votes
    Check Citrix Server Connection Information via SNMP. Extra OID's available through snmp4ctx package (http://www.wtcs.org/snmp4tpc/snmp4ctx.htm)
  • check_squid

    rating from 0 votes
    check_squid is a plugin to nagios for monitoring Squid by using squidclient
  • Siebel CRM monitoring for all platforms

    rating from 0 votes
    Monitoring of Siebel CRM Application (check of components, tasks, gateway, application server, logfiles and autorestart for components). So far the only Nagios plugin available for Siebel.
  • Check an IIS Application pool state

    rating from 0 votes
    This plugin checks for the state of an IIS application pool.
  • check_bind.sh

    rating from 0 votes
    sh compliant script to check whether BIND is running and to get the performance data via rndc stats (PNP Template included)
  • FlexLM

    rating4 from 1 votes
    Check flexlm license server and return total license and license in use (Also use Performance data).
  • CA ARCserve Backup r12 Number of Job Error Check

    rating from 0 votes
    CA ARCserve Backup r12 number of job error check using the ca_qmgr.exe cli tool.
  • jmeter invocation plugin

    rating from 0 votes
    This script is a rebuild of (http://www.monitoringexchange.org/cgi-bin/page.cgi?g=2455.html;d=1) that invokes an Apache JMeter (available from http://jakarta.apache.org/jmeter) test plan and parses the results, which are returned to Nagios per plugin API defined exit statuses and performance data.
  • check_crysis.pl

    rating5 from 1 votes
    Monitor the status of a Crysis dedicated server through XML-RPC instead of the standard CryQuery.
  • Check CMSms (CMS made simple) version

    rating from 0 votes
    This plugin checks the version of a locally installed CMSms installation and compares it to the latest available version.
Catalot 1.3.0 (appkit/v0.9.0-dev, Agavi/1.0.0) | www.netways.de | Legal
© 2009 NETWAYS GmbH. The Program is provided AS IS, without warranty. Licensed under GPLv3.
This program is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License version 3.