Moab Grid Suite is a professional grid workload management solution that integrates scheduling, management, monitoring, and reporting of workloads across independent clusters. Moab makes moving to a grid easy by adding grid-optimized job submission and management in a matter of minutes. Moab optimizes data staging and seamlessly integrates with existing security mechanisms or with grid security tool kits such as Globus. It gives sites extensive control over ownership-based access, priority, and service levels. It provides fine-tuned control over the workload and resources to make the grid usable.
Ganglia is a scalable distributed monitoring system for high-performance computing systems such as clusters and grids. It is based on a hierarchical design targeted at federations of clusters. Ganglia is currently in use on over 500 clusters around the world and has scaled to handle clusters with 2000 nodes.
redWall is a bootable CD-ROM firewall which focuses on Web-based reporting of the firewall's status. It includes Snort, snortsam, dansguardian, and support for fwbuilder, squidguard, reporting (using BASE/sarg/ntop/webfwlog), VPN (Openswan/PoPToP/Openvpn), Spam Filtering (spamassassin, dcc, razor2, clamav, amavis-new, dspam and maia mailguard), and mail-based, alerting. Configuration data are stored on a floppy or USB disk.
BixData is a cluster management tool that includes monitoring and system administration features. It monitors services (HTTP, ping, POP3, SMTP), performance, and processes. It has a management console for VMWare and Xen that supports multiple virtual machine hosts and guests. It can create critical notifications and send email alerts for any system event (HTTP, ping, CPU, memory, SMART diagnostics, VM stats). A graphical desktop supports real-time dynamic graphs. The runtime agents and server components are lightweight and easy to set up and run.
Heartbeat is a full-function high-availability system for Linux and other POSIX-like OSes. It monitors services and restarts them on errors. When managing a cluster (more than 1 machine), it will also monitor the members of the cluster and begin recovery of lost services in less than a second. It runs over serial ports and UDP broadcast/multicast, as well as OpenAIS multicast. It is easily adapted to different interconnect media and protocols. When used in a cluster, it can operate using shared disks, data replication, or no data sharing. Versions starting with 2.0 are comparable to any commercial HA package, providing resource monitoring, larger clusters, and detailed dependency information.
Performance Co-Pilot (PCP) is a framework and set of services for supporting system-level performance monitoring and performance management. It provides a unifying abstraction for all of the interesting performance data in a system, and allows client applications to easily retrieve and process any subset of that data using a single API. A client-server architecture allows multiple clients to monitor the same host, and a single client to monitor multiple hosts. Archive logging and replay are integrated so that a client application can use the same API to process real-time data from a host or historical data from an archive.
openMosix is a a set of extensions to the standard Linux kernel allowing you to build a cluster of out of off-the-shelf PC hardware. openMosix scales perfectly up to thousands of nodes. You do not need to modify your applications to benefit from your cluster (unlike PVM, MPI, Linda, etc.). Processes in openMosix migrate transparently between nodes and the cluster will always auto-balance.