RSS 6 projects tagged "Data Mining"

Download Website Updated 24 Aug 2013 ClodHopper

Screenshot
Pop 32.28
Vit 15.28

ClodHopper is a Java library for high-performance clustering of numerical data. It contains clustering implementations such as K-Means, K-Means++, X-Means, G-Means, Fuzzy C-Means, Jarvis-Patrick, and various forms of hierarchical clustering. ClodHopper's clustering implementations take advantage of the host system's concurrent processing ability to speed clustering. The data structures are also very lean to conserve memory usage. ClodHopper is very extensible. If you are developing a new clustering algorithm, you may save yourself an enormous amount of work by extending a ClodHopper base class.

No download No website Updated 18 May 2011 tagger

Screenshot
Pop 25.69
Vit 32.59

tagger is a library for automatic tagging of text documents, implementing different heuristics in order to select the most relevant keywords.

No download No website Updated 22 Mar 2011 Brewery

Screenshot
Pop 24.00
Vit 33.47

Brewery is a Python framework for data streaming, quality measurement, and flow-based data analysis. It can read from and write to various structured data sources such as CSV, XLS files, directories with YAML files, SQL databases, MongoDB, Google Spreadsheet, and more.

Download No website Updated 28 Jan 2010 Waffles

Screenshot
Pop 26.80
Vit 39.21

Waffles is a cross-platform C++ library of algorithms for machine learning, artificial intelligence, data mining, etc. It also contains demo apps and command-line wrapper tools that are useful for visualizing, analyzing, and predictively modeling data.

No download No website Updated 30 Sep 2009 allmon

Screenshot
Pop 23.04
Vit 40.71

allmon is a generic system for collecting and storing various runtime metrics collections used for system performance, health, quality, and availability monitoring purposes. The system also provides a set of data-mining algorithms useful for further performance analysis. Allmon is designed to harvest different metrics values coming from many areas of monitoring infrastructure. The collected data are based on quantitative and qualitative performance and availability analysis. Allmon collaborates with other analytical tools for OLAP multidimensional analysis and data mining processing. The tool can be used for production as well as for development (profiling) and QA (load testing) purposes.

Download Website Updated 01 Jan 2011 MAPDAV

Screenshot
Pop 82.70
Vit 2.96

MAPDAV (More Accurate Password Dictionary Attack Vector) is designed to use what is known about users via the /etc/passwd file on Unix/Linux systems to generate a dynamic dictionary of more accurate guesses as to what their possible password may be. It does this by mangling the user's username and user information in various user-specified ways to look for bad password protection practices.

Screenshot

Project Spotlight

PPCG

A polyhedral parallel code generator.

Screenshot

Project Spotlight

icctext

A utility to edit text tags in an ICC profile.