RSS 8 projects tagged "hadoop"

No download Website Updated 09 Apr 2010 Hadoop Studio

Screenshot
Pop 150.62
Vit 2.66

Hadoop Studio is a map-reduce development environment (IDE) based on Netbeans. It makes it easy to create, understand, and debug map-reduce applications based on Hadoop, without requiring development-time access to a map-reduce cluster. The studio provides a real-time workflow view of a map-reduce job, which displays the individual inputs, outputs, and interactions between the phases of a map-reduce job. The workflow view of a job updates in real time with the developer's code changes. It then generates Java sources and compiles them into a binary jar file, which can be run on a normal Hadoop cluster.

No download Website Updated 14 May 2013 Gfarm

Screenshot
Pop 99.32
Vit 5.28

Gfarm is a distributed filesystem, generally used for large scale cluster computing. It's implemented in userland, and can be mounted by FUSE. It utilizes locality of a file to access a data node, and supports Globus GSI for Wide Area Network. Users can explicitly control file replica location on Gfarm. Gfarm can be used as an alternative storage system to HDFS for Hadoop, Samba, MPI-IO, and GridFTP. Monitoring via ZABBIX and Ganglia is also supported.

No download No website Updated 14 Jul 2011 Beanstalker

Screenshot
Pop 21.02
Vit 31.86

Beanstalker is a set of Maven Plugins for Amazon Web Services (AWS) Elastic Beanstalk and Elastic MapReduce. Plugin Mojos are suitable not only for command-line usage, but for Continuous Integration as well.

Download No website Updated 17 Mar 2014 Hypertable

Screenshot
Pop 342.47
Vit 25.29

Hypertable is a high performance, scalable database modeled after Google's Bigtable. It is designed to manage the storage and processing of information on a large cluster of commodity servers, providing resilience to machine and component failures.

Download Website Updated 04 Jun 2012 MapReduce-BitDew

Screenshot
Pop 59.60
Vit 26.24

MapReduce-BitDew is an implementation of the MapReduce programming model proposed by Google for Internet Desktop Grids. Using MapReduce-BitDew, you can execute MapReduce applications on resources like Desktop PCs distributed on the Internet. MapReduce-BitDew features a firewall-friendly protocol, fault-tolerance, result-certification, 2-level schedulers, and more.

No download Website Updated 25 Oct 2012 dispy

Screenshot
Pop 68.64
Vit 2.19

dispy is a Python framework for parallel execution of computations by distributing them across multiple processors in a single machine (SMP), or among many machines in a cluster or grid. The computations can be standalone programs or Python functions. dispy is well suited for the data parallel (SIMD) paradigm where a computation is evaluated with different (large) datasets independently (similar to Hadoop, MapReduce, Parallel Python). dispy features include automatic distribution of dependencies (files, Python functions, classes, modules), client-side and server-side fault recovery, scheduling of computations to specific nodes, encryption for security, sharing of computation resources if desired, and more.

Download No website Updated 14 Apr 2014 Infovore

Screenshot
Pop 613.22
Vit 26.33

Infovore is a map/reduce framework for processing large RDF data sets such as Freebase and DBpedia. It is based on Hadoop.

No download No website Updated 18 Feb 2014 Telepath

Screenshot
Pop 232.74
Vit 2.55

Telepath provides map/reduce code for processing Wikipedia Pagecounts. These contain usage data for all Wikipedia pages in all languages on an hourly basis. Derived from the bakemono toolkit, this project can process this 3TB data set with ease.

Screenshot

Project Spotlight

Wenity

A multi-platform Zenity clone.

Screenshot

Project Spotlight

Profanity

A ncurses-based Jabber client.