RSS 13 projects tagged "Clustering"

Download Website Updated 07 Jan 2014 MLPACK

Screenshot
Pop 100.82
Vit 2.65

MLPACK is a C++ machine learning library with an emphasis on scalability, speed, and ease-of-use. Its aim is to make machine learning possible for novice users by means of a simple, consistent API, while simultaneously exploiting C++ language features to provide maximum performance and maximum flexibility for expert users. It contains algorithms such as k-means, Gaussian mixture models, hidden Markov models, density estimation trees, kernel PCA, locality-sensitive hashing, sparse coding, linear regression and least-angle regression.

Download Website Updated 24 Aug 2013 ClodHopper

Screenshot
Pop 32.03
Vit 15.41

ClodHopper is a Java library for high-performance clustering of numerical data. It contains clustering implementations such as K-Means, K-Means++, X-Means, G-Means, Fuzzy C-Means, Jarvis-Patrick, and various forms of hierarchical clustering. ClodHopper's clustering implementations take advantage of the host system's concurrent processing ability to speed clustering. The data structures are also very lean to conserve memory usage. ClodHopper is very extensible. If you are developing a new clustering algorithm, you may save yourself an enormous amount of work by extending a ClodHopper base class.

Download Website Updated 10 Jul 2013 Hados

Screenshot
Pop 61.28
Vit 1.02

Hados stores files in a cluster of servers. Its goal is to handle high availability by storing copies of the same file on several nodes. It provides RESTFUL APIs to easily store, check, or retrieve files. Using the cluster APIs, you can retrieve files from whichever node hosts them. To avoid any single point of failure, it is possible to apply a request to any node of the cluster; there is no master node.

Download Website Updated 28 Jun 2013 HWSD

Screenshot
Pop 77.14
Vit 4.54

HWSD is a daemon and library for the discovery and announcement of hardware resources using ZeroConf. It enables auto-configuration of ad-hoc GPU clusters and multi-GPU machines through multi-platform GPU and network device discovery.

Download Website Updated 04 Jul 2011 K-tree

Screenshot
Pop 75.39
Vit 1.96

K-tree provides a scalable approach to clustering by combining the B+-tree and k-means algorithms. Clustering can be used to solve problems in signal processing, machine learning, and other contexts. It has recently been used to solve document clustering problems on the Wikipedia collection.

Download Website Updated 14 Dec 2010 StarCluster

Screenshot
Pop 83.71
Vit 2.04

StarCluster is a utility for creating traditional computing clusters used in research labs or for general distributed computing applications on Amazon's Elastic Compute Cloud (EC2). It uses a simple configuration file provided by the user to request cloud resources from Amazon and to automatically configure them with a queuing system, an NFS shared /home directory, passwordless SSH, OpenMPI, and ~140GB scratch disk space. It consists of a Python library and a simple command line interface to the library. For end-users, the command line interface provides simple intuitive options for getting started with distributed computing on EC2 (i.e. starting/stopping clusters, managing AMIs, etc). For developers, the library wraps the EC2 API to provide a simplified interface for launching/terminating nodes, executing commands on the nodes, copying files to/from the nodes, etc.

No download No website Updated 02 Jun 2010 Leemba

Screenshot
Pop 15.78
Vit 37.65

Leemba is a new style of application monitor built from the ground up for Terracotta clusters, scalability, and easy online configuration. Leemba is for administrators who have better things to do.

Download Website Updated 29 Mar 2010 Proto Balance Mail

Screenshot
Pop 40.44
Vit 1.43

Proto Balance Mail is an enterprise SMTP cluster solution that supports distribution of email accounts. It scales up to 1,000,000 mailboxes apportioned over up to 125 backend mail servers (8000 mailboxes per server). No NFS or SAN is required. SOA is configurable with SOAP/XML. Anti-spam settings can be set per-user. Grey-listing is supported. Mal-ware is automatically detected and infected client PCs are automatically black-listed. POP load balancing is done. SMTP AUTH is supported. There is a Web-based management interface. Spam blocking is done by on-the-fly connection behavior analysis. It handles up to 10,000 concurrent SMTP connections. Streamlined CRM integration is done with HTTP+XML posts. Email-alias lists, forwarding, and out-of-office auto-reply are supported.

Download Website Updated 18 Feb 2010 jmemcached

Screenshot
Pop 70.88
Vit 2.95

jmemcached is a fast network available cache daemon. It is protocol-compatible with memcached, but written in Java and suitable for applications with portability concerns, where Java is the preferred solution, or for using the memcached protocol in embedded applications with alternate storage engines. Existing clients for memcache work unmodified. It can run as a standalone daemon or be embedded inside an existing Java application.

No download No website Updated 11 Dec 2009 BorderFlow

Screenshot
Pop 16.09
Vit 39.87

BorderFlow implements a general-purpose graph clustering algorithm. It maximizes the inner to outer flow ratio from the border of each cluster to the rest of the graph. The main advantage of the algorithm is that it does not need parametrization to compute results of high accuracy.

Screenshot

Project Spotlight

libHX

A library for quick day-to-day C programming.

Screenshot

Project Spotlight

pride

Poor Richard's Independent anDroid Environment.