Ganglia is a scalable distributed monitoring system for high-performance computing systems such as clusters and grids. It is based on a hierarchical design targeted at federations of clusters. Ganglia is currently in use on over 500 clusters around the world and has scaled to handle clusters with 2000 nodes.
Wackamole is a tool that helps with making a cluster highly available. It manages a bunch of virtual IPs that should be available to the outside world at all times, and ensures that exactly one machine within the cluster is listening on each virtual IP address that Wackamole manages. If it discovers that particular machines within the cluster are not alive, it will almost immediately ensure that other machines acquire the virtual IP addresses the down machines were managing. At no time will more than one connected machine be responsible for any virtual IP.
AutoNOC is a high performance, production integrated, peer-to-peer network operations management platform for Windows and Linux. It provides real-time historical analysis, root cause, fault detection, reporting, alerts and alarms, and no-nonsense correlation. It is an interoperable vendor independent solution with built-in support for Microsoft, Cisco, Linux, IBM, and other major technologies. Additionally it offers many novel capabilities, including end user personalization, easy scalability, compressed historical databases, infinite histories, event archiving (it works as a syslog server), and multi-language support.
Openwsman provides an implementation of the Web Services Management specification and expose system management information on the Linux operating system using the WS-Management protocol. It is based on a suite of Web services specifications and usage requirements that exposes a set of operations focused on and covering all system management aspects.
Os-Cafe is a system for administering a cybercafe running exclusively Linux for clients and the server. It has been used in The OpenSource Café (France/Lyon) since summer 2006. It provides a set of preconfigured and customized tools (including some pre-existing ones) like openkiosk, nodeview, and bigsister, but it also provides a full framework for administration and a captive portal for Wifi support.