RSS 5 projects tagged "hadoop"

No download Website Updated 14 May 2013 Gfarm

Screenshot
Pop 98.16
Vit 5.32

Gfarm is a distributed filesystem, generally used for large scale cluster computing. It's implemented in userland, and can be mounted by FUSE. It utilizes locality of a file to access a data node, and supports Globus GSI for Wide Area Network. Users can explicitly control file replica location on Gfarm. Gfarm can be used as an alternative storage system to HDFS for Hadoop, Samba, MPI-IO, and GridFTP. Monitoring via ZABBIX and Ganglia is also supported.

Download No website Updated 17 Mar 2014 Hypertable

Screenshot
Pop 347.33
Vit 27.97

Hypertable is a high performance, scalable database modeled after Google's Bigtable. It is designed to manage the storage and processing of information on a large cluster of commodity servers, providing resilience to machine and component failures.

Download No website Updated 18 Jun 2012 Syoncloud Logs

Screenshot
Pop 70.25
Vit 1.46

Syoncloud Logs processes log files from various applications and many servers. It can capture business relevant information from everyday log files generated by Web servers, business applications, and back office applications. It uses Flume sinks that run on the machines that produce log files. This data is filtered and relevant events channeled to HBase. The HBase NoSQL database is used for actual data analysis. The number of HBase nodes depends on the amount of processed log files. Syoncloud Logs has an easy to use installer that includes all necessary components such as Hadoop, Flume, Hbase, and Zookeeper.

Download No website Updated 14 Apr 2014 Infovore

Screenshot
Pop 614.23
Vit 51.81

Infovore is a map/reduce framework for processing large RDF data sets such as Freebase and DBpedia. It is based on Hadoop.

No download No website Updated 18 Feb 2014 Telepath

Screenshot
Pop 235.07
Vit 2.61

Telepath provides map/reduce code for processing Wikipedia Pagecounts. These contain usage data for all Wikipedia pages in all languages on an hourly basis. Derived from the bakemono toolkit, this project can process this 3TB data set with ease.

Screenshot

Project Spotlight

Kernel Mode Linux

A factility for executing user processes in kernel mode safely.

Screenshot

Project Spotlight

sshdfilter

A program that automatically blocks ssh brute force attacks.