RSS 6 projects tagged "Data Mining"

Download Website Updated 24 Aug 2013 ClodHopper

Screenshot
Pop 32.03
Vit 15.41

ClodHopper is a Java library for high-performance clustering of numerical data. It contains clustering implementations such as K-Means, K-Means++, X-Means, G-Means, Fuzzy C-Means, Jarvis-Patrick, and various forms of hierarchical clustering. ClodHopper's clustering implementations take advantage of the host system's concurrent processing ability to speed clustering. The data structures are also very lean to conserve memory usage. ClodHopper is very extensible. If you are developing a new clustering algorithm, you may save yourself an enormous amount of work by extending a ClodHopper base class.

No download No website Updated 18 May 2011 tagger

Screenshot
Pop 25.10
Vit 32.66

tagger is a library for automatic tagging of text documents, implementing different heuristics in order to select the most relevant keywords.

No download No website Updated 22 Mar 2011 Brewery

Screenshot
Pop 23.92
Vit 33.53

Brewery is a Python framework for data streaming, quality measurement, and flow-based data analysis. It can read from and write to various structured data sources such as CSV, XLS files, directories with YAML files, SQL databases, MongoDB, Google Spreadsheet, and more.

No download Website Updated 30 Dec 2012 MyMediaLite

Screenshot
Pop 79.12
Vit 8.03

MyMediaLite is a lightweight, multi-purpose library of recommender system algorithms. It addresses the two most common scenarios in collaborative filtering: rating prediction (e.g. on a scale of 1 to 5 stars), and item prediction from implicit feedback (e.g. from clicks or purchase actions). It contains dozens of recommender engines, including state-of-the-art matrix factorization methods. It also supports real-time updates to the recommender engines, storing engines to disk and reloading them again, and several evaluation measures to compare the accuracy of different recommender system methods. Three command-line programs that offer most of the functionality contained in the library are included.

No download No website Updated 30 Sep 2009 allmon

Screenshot
Pop 23.04
Vit 40.76

allmon is a generic system for collecting and storing various runtime metrics collections used for system performance, health, quality, and availability monitoring purposes. The system also provides a set of data-mining algorithms useful for further performance analysis. Allmon is designed to harvest different metrics values coming from many areas of monitoring infrastructure. The collected data are based on quantitative and qualitative performance and availability analysis. Allmon collaborates with other analytical tools for OLAP multidimensional analysis and data mining processing. The tool can be used for production as well as for development (profiling) and QA (load testing) purposes.

Download Website Updated 01 Jan 2011 MAPDAV

Screenshot
Pop 80.75
Vit 2.96

MAPDAV (More Accurate Password Dictionary Attack Vector) is designed to use what is known about users via the /etc/passwd file on Unix/Linux systems to generate a dynamic dictionary of more accurate guesses as to what their possible password may be. It does this by mangling the user's username and user information in various user-specified ways to look for bad password protection practices.

Screenshot

Project Spotlight

JStock - Free Stock Market Software

A stock market application.

Screenshot

Project Spotlight

Devel Live CD

A Live CD to compile programs.