System Watchdog is a program that is designed to monitor a collection of Linux systems. It is configured to collect data on each monitored system at 5-minute intervals and build historical graphs using RRDTool. By default, it will gather statistics on CPU, memory, disk, and network usage. It also will attempt to monitor temperature and power settings. System Watchdog uses the paramiko module to ssh to all monitored systems. To access monitored systems, threads are implemented to access each remote host. After all threads have completed, watchdog will build a summary landing page to display the current status of each host, with links to a hardware inventory and resource graphs of each host.
The System Status Dashboard (SSD) is a status dashboard designed to provide an easy to understand view into an organization's infrastructure health status. It is modeled after status dashboard services like Google's App Status and Amazon's Service Health Dashboard and displays a matrix of services, dates, and their corresponding status in an easy to read calendar view. Services can be Web sites, APIs, internal IT systems, or anything else about which your users or customers require timely status information. It allows system administrators to easily create and track incidents and provides an excellent historical reference for SLA performance calculations.
Nagios has been around for quite some time, but producing output it can consume is something of a black art. Only the plugin documentation actually explains what all the extra semicolons or extended formatting even mean. This is especially onerous when performance consuming add-ons expect a specific structure before operating properly. The NagAconda package strives to greatly simplify the process of actually generating Nagios output using Python.