The Scalable Assembler at Notre Dame (SAND) replaces the early stages of the Celera Assembler with scalable versions that can run on collections of commodity computers. By harnessing clusters, clouds, grids, or just random machines in your office, many bioinformatics tasks can be reduced from weeks or months down to minutes or hours.
Son of Grid Engine is a highly-scalable and versatile distributed resource manager for scheduling batch or interactive jobs on clusters or desktop farms. It is a community project to continue Sun's Grid Engine. It is competitive against proprietary systems and provides better scheduling features and scalability than other free DRMs like Torque, SLURM, Condor, and Lava.
Makeflow is a workflow engine for executing large complex applications on clusters, clouds, and grids. It can be used to drive several different distributed computing systems, including Condor, SGE, and the included Work Queue system. It does not require a distributed filesystem, so you can use it to harness whatever collection of machines you have available. It is typically used for scaling up data-intensive scientific applications to hundreds or thousands of cores.
The purpose of the glideinWMS is to provide a simple way to access the Grid resources. To achieve this without reinventing the wheel, glideinWMS is a Glidein-based WMS (Workload Management System) that works on top of Condor. Once setup, final users can submit regular Condor jobs to the local queue and the glidein factory will provide the computing resources behind the scenes. From the final user point of view, the Condor pool just magically grows and shrinks as needed.
The SensorWeb project aims at developing the Open SensorWeb Architecture (OSWA) and implementing a standards-compliant platform and middleware for integration of sensor networks with emerging distributed computing platforms such as Grids. It confirms to the Web Services standard defined by the W3C, and the SensorML (Sensor Model Language) standard defined by the OpenGeospatial Consortium.
The gLite distribution is an integrated set of components designed to enable resource sharing. In other words, this is middleware for building a grid. The gLite middleware is produced by the EGEE project. The distribution model is to construct different services ("node-types") from these components and then ensure easy installation and configuration on the chosen platforms.
DBReplicator is a powerful application for network-based multi-master heterogeneous database replication or filtered synchronization. It supports heterogeneous replication, bi-directional data synchronization between any of the supported database backends, application independence, automatic conflict detection and resolution, a scheduling facility, verbose debugging using Apache log4j, special characters, and automatic table creation.
Mrs is a lightweight but high performance implementation of the MapReduce programming model for parallel computation. It is optimized for computationally intensive programs, especially those which are iterative, though it is applicable to a wide range of applications. It is intended to be simple to install and use. It is written in Python, supports both Python 2 and Python 3, and has no dependencies outside the standard library. The name "Mrs" derives from the practice of referring to MapReduce programs by the title "mister" (e.g., "Mr. Wordcount").