RSS 5 projects tagged "mapreduce"

No download Website Updated 25 Oct 2012 dispy

Screenshot
Pop 68.23
Vit 2.19

dispy is a Python framework for parallel execution of computations by distributing them across multiple processors in a single machine (SMP), or among many machines in a cluster or grid. The computations can be standalone programs or Python functions. dispy is well suited for the data parallel (SIMD) paradigm where a computation is evaluated with different (large) datasets independently (similar to Hadoop, MapReduce, Parallel Python). dispy features include automatic distribution of dependencies (files, Python functions, classes, modules), client-side and server-side fault recovery, scheduling of computations to specific nodes, encryption for security, sharing of computation resources if desired, and more.

Download Website Updated 04 Jun 2012 MapReduce-BitDew

Screenshot
Pop 57.31
Vit 26.13

MapReduce-BitDew is an implementation of the MapReduce programming model proposed by Google for Internet Desktop Grids. Using MapReduce-BitDew, you can execute MapReduce applications on resources like Desktop PCs distributed on the Internet. MapReduce-BitDew features a firewall-friendly protocol, fault-tolerance, result-certification, 2-level schedulers, and more.

No download No website Updated 14 Jul 2011 Beanstalker

Screenshot
Pop 20.42
Vit 31.77

Beanstalker is a set of Maven Plugins for Amazon Web Services (AWS) Elastic Beanstalk and Elastic MapReduce. Plugin Mojos are suitable not only for command-line usage, but for Continuous Integration as well.

No download Website Updated 03 Nov 2011 Plasma

Screenshot
Pop 58.12
Vit 1.76

Plasma implements the map/reduce framework on a compute cluster. It has its own distributed filesystem, PlasmaFS, which is transactional (ACID), reliable, and fast, and which provides a complete set of file operations. PlasmaFS can be accessed via an RPC protocol or via NFS (i.e., it is mountable). Additionally, there is a key/value database on top of PlasmaFS.

No download Website Updated 09 Apr 2010 Hadoop Studio

Screenshot
Pop 150.37
Vit 2.66

Hadoop Studio is a map-reduce development environment (IDE) based on Netbeans. It makes it easy to create, understand, and debug map-reduce applications based on Hadoop, without requiring development-time access to a map-reduce cluster. The studio provides a real-time workflow view of a map-reduce job, which displays the individual inputs, outputs, and interactions between the phases of a map-reduce job. The workflow view of a job updates in real time with the developer's code changes. It then generates Java sources and compiles them into a binary jar file, which can be run on a normal Hadoop cluster.

Screenshot

Project Spotlight

Text2GUI

A program that generates Java GUIs from natural language.

Screenshot

Project Spotlight

Canumb

Software that converts data into something meaningful and interpretable.