Ganglia is a scalable distributed monitoring system for high-performance computing systems such as clusters and grids. It is based on a hierarchical design targeted at federations of clusters. Ganglia is currently in use on over 500 clusters around the world and has scaled to handle clusters with 2000 nodes.
PortSensor is a powerful Windows/Unix server monitoring tool with Linux/Mac/Windows clients. Sensors can be created to monitor nearly any TCP/UDP service, such as: HTTP, FTP, POP3, SMTP, MySQL, and DNS. Custom sensors can be created to monitor your other critical server metrics, such as processor loads, mail queue loads, disk space usage, and log activity.
Wackamole is a tool that helps with making a cluster highly available. It manages a bunch of virtual IPs that should be available to the outside world at all times, and ensures that exactly one machine within the cluster is listening on each virtual IP address that Wackamole manages. If it discovers that particular machines within the cluster are not alive, it will almost immediately ensure that other machines acquire the virtual IP addresses the down machines were managing. At no time will more than one connected machine be responsible for any virtual IP.
BigDaddy is a program for monitoring servers. It is similar to Nagios, with the added benefit of also monitoring and controlling the crontab (or any scheduled application) across an entire fleet of servers. The application comes in the form of a daemon for monitoring and reporting as well as an easy-to-use Web-based GUI for controlling monitoring, viewing timelines of incidents, filing incidents and graphing statistics. The application is extensible with any sort of monitoring module and notification is based on a five step escalation process.
The CommandCenter-NOC monitors a complete network system. It provides asset management, security monitoring, bandwidth analysis, and reporting for almost any environment. The CommandCenter-NOC will discover and inventory new hardware and software on your system while providing reports that cover service availability, outages, inventory, security, performance, delta inventory, and open data framework reports. The Web interface allows easy system monitoring. Built-in Snort intrusion detection and Nessus vulnerability scanning will provide immediate notification of any risk to the system via email, SMS, and pager.
The WebReboot Plugin for Nagios is a suite of commands that can be used within Nagios to monitor a server and take corrective action if necessary via the WebReboot line of products. For example, the plugin can be used to alert you if a host is powered down, versus simply not responding to network requests. Likewise, it can be used to reboot a server if a host fails to respond to ping, or to shut down a server when a critical temperature threshold is exceeded. The commands can be mixed-and-matched with all existing Nagios commands, maximizing total network coverage.
simena-io is a Linux tool written in Perl and designed to show ethernet interface statistics in bits/second and packets/second in real time. It requires at least Linux kernel 2.2 and Perl 5. It does not require a root account. There is only one command parameter: the refresh rate in seconds. If no parameter is provided, simena-io will refresh every 2 seconds by default. Detailed documentation can be obtained by running "perldoc simena-io".
Modem.pl is a small script that scrapes the Web interface of a Motorola SURFboard cable modem for various status conditions like signal strength and signal to noise ratio. The results are sent to STDOUT where they can be easily piped into a log file. The modem values are also checked for reasonable operating ranges. If the modem values are outside of reasonable operating ranges, results are also sent to STDERR. When run from a cron job, modem.pl can be used to monitor the condition of the cable service and notify someone before conditions cause service interruptions.
Openwsman provides an implementation of the Web Services Management specification and expose system management information on the Linux operating system using the WS-Management protocol. It is based on a suite of Web services specifications and usage requirements that exposes a set of operations focused on and covering all system management aspects.
Os-Cafe is a system for administering a cybercafe running exclusively Linux for clients and the server. It has been used in The OpenSource Café (France/Lyon) since summer 2006. It provides a set of preconfigured and customized tools (including some pre-existing ones) like openkiosk, nodeview, and bigsister, but it also provides a full framework for administration and a captive portal for Wifi support.