RSS 6 projects tagged "Data Mining"

Download Website Updated 24 Aug 2013 ClodHopper

Screenshot
Pop 32.28
Vit 15.38

ClodHopper is a Java library for high-performance clustering of numerical data. It contains clustering implementations such as K-Means, K-Means++, X-Means, G-Means, Fuzzy C-Means, Jarvis-Patrick, and various forms of hierarchical clustering. ClodHopper's clustering implementations take advantage of the host system's concurrent processing ability to speed clustering. The data structures are also very lean to conserve memory usage. ClodHopper is very extensible. If you are developing a new clustering algorithm, you may save yourself an enormous amount of work by extending a ClodHopper base class.

No download No website Updated 18 May 2011 tagger

Screenshot
Pop 25.14
Vit 32.64

tagger is a library for automatic tagging of text documents, implementing different heuristics in order to select the most relevant keywords.

No download No website Updated 22 Mar 2011 Brewery

Screenshot
Pop 24.12
Vit 33.51

Brewery is a Python framework for data streaming, quality measurement, and flow-based data analysis. It can read from and write to various structured data sources such as CSV, XLS files, directories with YAML files, SQL databases, MongoDB, Google Spreadsheet, and more.

No download Website Updated 30 Dec 2012 MyMediaLite

Screenshot
Pop 79.25
Vit 8.04

MyMediaLite is a lightweight, multi-purpose library of recommender system algorithms. It addresses the two most common scenarios in collaborative filtering: rating prediction (e.g. on a scale of 1 to 5 stars), and item prediction from implicit feedback (e.g. from clicks or purchase actions). It contains dozens of recommender engines, including state-of-the-art matrix factorization methods. It also supports real-time updates to the recommender engines, storing engines to disk and reloading them again, and several evaluation measures to compare the accuracy of different recommender system methods. Three command-line programs that offer most of the functionality contained in the library are included.

Download No website Updated 28 Jan 2010 Waffles

Screenshot
Pop 27.13
Vit 39.25

Waffles is a cross-platform C++ library of algorithms for machine learning, artificial intelligence, data mining, etc. It also contains demo apps and command-line wrapper tools that are useful for visualizing, analyzing, and predictively modeling data.

Download Website Updated 01 Jan 2011 MAPDAV

Screenshot
Pop 81.61
Vit 2.96

MAPDAV (More Accurate Password Dictionary Attack Vector) is designed to use what is known about users via the /etc/passwd file on Unix/Linux systems to generate a dynamic dictionary of more accurate guesses as to what their possible password may be. It does this by mangling the user's username and user information in various user-specified ways to look for bad password protection practices.

Screenshot

Project Spotlight

Lernstick Exam Environment

A live Linux distribution for exams.

Screenshot

Project Spotlight

phpMyAdmin

A tool that handles the basic administration of MySQL over the Web.