Epikscan is a cluster aware, diagnostic script that runs basic health checks and gathers detailed, addressable system information from an RHEL, Scientific Linux, Oracle/OVM2, or CentOS 4, 5, or 6 system. A single archive is created for each host, containing the scan data along with linking information that can be merged with scans from other nodes to produce a cluster oriented report. HTML, text, XML, and SQLite3 database reports can be optionally generated to support both manual and automated fault analysis. The scan and merge process attempts to heuristically identify possible problems with the node or cluster configuration and highlights them in red to speed up the review and resolution process.
ProcMeter3 is a program for monitoring the system status and other information and displaying it in a series of graphs or as text. Most of the information comes from the /proc filesystem (cpu usage, load average , processes information, memory usage, network traffic, interrupts etc.). Other information is available for other sources (date, time, email status, log file length, disk status etc.). The program is modular and highly configurable.
check_oracle_health is a plugin for the Shinken (Nagios) monitoring software that allows you to monitor various metrics of an Oracle database. It includes connection time, SGA data buffer hit ratio, SGA library cache hit ratio, SGA dictionary cache hit ratio, SGA shared pool free, PGA in memory sort ratio, tablespace usage, tablespace fragmentation, tablespace I/O balance, invalid objects, and many more.
check_hpasm is a plugin for Nagios which checks the hardware health of Hewlett-Packard Proliant servers. To accomplish this, you must have installed the hpasm package. The plugin checks the health of processors, power supplies, memory modules, fans, CPU- and board-temperatures, and alerts you if one of these components is faulty or operates outside its normal parameters.
dstat is a versatile replacement for vmstat, iostat, netstat, nfsstat, and ifstat. It includes various counters (in separate plugins) and allows you to select and view all of your system resources instantly; you can, for example, compare disk usage in combination with interrupts from your IDE controller, or compare the network bandwidth numbers directly with the disk throughput (in the same interval).
Network Management Tool makes it possible to quickly find vital information about any of your network devices such as serial numbers and support contact information. A log is kept for each device so that you can enter service information. An automatic export feature that will create a spreadsheet or database-ready file is also provided. Each list is easily edited with a Web interface.
GroundWork Monitor Community Edition can give you insight into your computing infrastructure, allowing you to see the current and historical states of all your computers: servers, desktops, and laptops, all of your network devices, all of your services (like TCP/IP and Web services), and all of your applications (like mail servers and database apps). You can choose to be alerted when something goes awry via pager, SMS, email, or phone, and even set up automatic restarts or fall-overs.