Prolog Statistical Machine Translation is a fairly unsophisticated statistical machine translation program. It consists of a language model learner (which takes example sentences in the target language and learns a language model based on trigrams), a dictionary learner (which learns word-for-word translations), and a search program (which uses the data from the first two parts to translate a source sentence into the target language).
FP-Growth-Tiny introduces a space optimization to the FP- Growth algorithm for mining frequent itemsets in a transaction database. The code contains libraries, CLI frontends and a few other tools suited for this task. Frequent itemset or frequency mining is the core of popular mining methods such as association rule mining and sequence mining.
The Kernel-Machine Library is a C++ library to promote the use and progress of kernel machines. It is both for academic use and for developing real world applications. The Kernel-Machine Library draws heavily from features of modern C++ such as template meta-programming to achieve high performance while at the same time offering a comfortable interface. It enables compile-time selection of specialized algorithms on the basis of data types: for example, the specific case of a SVM in combination with a linear kernel can be computed by a specialized efficient algorithm.
N-genes is a Java framework and application for both genetic programming and genetic algorithms. The goal of this software is to offer a flexible system able to speed-up the implementation of research ideas. Complex behaviors like variable size populations or self-adaptive genetic operators can be implemented easily and quickly.
SenseClusters is a natural language processing package that allows you to cluster similar contexts or to identify clusters of related words. It supports its own native methods based on first and second order representations of context, and also supports Latent Semantic Analysis. It is fully unsupervised, and can automatically discover the optimal number of clusters in your text. SenseClusters is a complete system that takes users from preprocessing of raw text to providing clustered output.
CharGer is a conceptual graph editor intended to support research projects and education. It currently is primarily an editor to create visual displays of graphs. It is deliberately and explicitly a research tool meant for conceptual graph researchers to explore implementation issues in conceptual graph interfaces. Using the software will require some familiarity with conceptual graphs, including knowing about concepts and relations, type hierarchies, and type/referent pairs. Knowing about actors will also be very helpful.