RSS 6 projects tagged "hadoop"

No download Website Updated 09 Apr 2010 Hadoop Studio

Screenshot
Pop 150.37
Vit 2.66

Hadoop Studio is a map-reduce development environment (IDE) based on Netbeans. It makes it easy to create, understand, and debug map-reduce applications based on Hadoop, without requiring development-time access to a map-reduce cluster. The studio provides a real-time workflow view of a map-reduce job, which displays the individual inputs, outputs, and interactions between the phases of a map-reduce job. The workflow view of a job updates in real time with the developer's code changes. It then generates Java sources and compiles them into a binary jar file, which can be run on a normal Hadoop cluster.

No download Website Updated 14 May 2013 Gfarm

Screenshot
Pop 98.47
Vit 5.32

Gfarm is a distributed filesystem, generally used for large scale cluster computing. It's implemented in userland, and can be mounted by FUSE. It utilizes locality of a file to access a data node, and supports Globus GSI for Wide Area Network. Users can explicitly control file replica location on Gfarm. Gfarm can be used as an alternative storage system to HDFS for Hadoop, Samba, MPI-IO, and GridFTP. Monitoring via ZABBIX and Ganglia is also supported.

Download No website Updated 17 Mar 2014 Hypertable

Screenshot
Pop 347.03
Vit 27.53

Hypertable is a high performance, scalable database modeled after Google's Bigtable. It is designed to manage the storage and processing of information on a large cluster of commodity servers, providing resilience to machine and component failures.

Download Website Updated 04 Jun 2012 MapReduce-BitDew

Screenshot
Pop 57.31
Vit 26.13

MapReduce-BitDew is an implementation of the MapReduce programming model proposed by Google for Internet Desktop Grids. Using MapReduce-BitDew, you can execute MapReduce applications on resources like Desktop PCs distributed on the Internet. MapReduce-BitDew features a firewall-friendly protocol, fault-tolerance, result-certification, 2-level schedulers, and more.

No download Website Updated 25 Oct 2012 dispy

Screenshot
Pop 68.23
Vit 2.19

dispy is a Python framework for parallel execution of computations by distributing them across multiple processors in a single machine (SMP), or among many machines in a cluster or grid. The computations can be standalone programs or Python functions. dispy is well suited for the data parallel (SIMD) paradigm where a computation is evaluated with different (large) datasets independently (similar to Hadoop, MapReduce, Parallel Python). dispy features include automatic distribution of dependencies (files, Python functions, classes, modules), client-side and server-side fault recovery, scheduling of computations to specific nodes, encryption for security, sharing of computation resources if desired, and more.

Download No website Updated 14 Apr 2014 Infovore

Screenshot
Pop 614.83
Vit 43.40

Infovore is a map/reduce framework for processing large RDF data sets such as Freebase and DBpedia. It is based on Hadoop.

Screenshot

Project Spotlight

SBuild

A Scala-based build system.

Screenshot

Project Spotlight

Code Browser

A folding text editor.