SenseClusters is a natural language processing package that allows you to cluster similar contexts or to identify clusters of related words. It supports its own native methods based on first and second order representations of context, and also supports Latent Semantic Analysis. It is fully unsupervised, and can automatically discover the optimal number of clusters in your text. SenseClusters is a complete system that takes users from preprocessing of raw text to providing clustered output.
Red-Piranha is a search system that can actually learn what you are looking for. It can be used as a Web page, command line, or XML-WebService, so it will work with most languages, including Java, Perl, C#/.NET, and PHP. It includes learning abilities for the Desktop/Internet search functionality. All feedback from the user is stored in (editable) XML and RDF, and is used by the system to improve the quality of searches.
Ellogon is a multi-lingual, cross-platform, general-purpose language engineering environment, developed in order to aid both researchers who are doing research in computational linguistics, as well as companies who produce and deliver language engineering systems. As a language engineering platform, it offers an extensive set of facilities, including tools for processing and visualising textual/HTML/XML data and associated linguistic information, support for lexical resources (like creating and embedding lexicons), tools for creating annotated corpora, accessing databases, comparing annotated data, or transforming linguistic information into vectors for use with various machine learning algorithms.
The Orbital library is a Java class library providing object-oriented representations and algorithms for logic, mathematics, and computer science. It comprises theorem proving, computer algebra, search and planning, as well as machine learning algorithms. Generally speaking, the conceptual idea behind the Orbital library is to provide extensional services and components that surround the heart of many scientific applications, hence the name "Orbital library". In order to satisfy the requirements of high reusability, the design of this foundation class library favors flexibility, conceptual simplicity, and generalization. Many sophisticated problems can be solved easily with its adaptable components.
LinkGrammar-WN is a lexicon expansion for the Link Grammar Parser. The Link Grammar Parser is a syntactic parser of the English language that is capable of handling a wide variety of syntactic constructions and is considered quite robust. The LinkGrammar-WN project aims to import lexical information from WordNet in an effort to increase the size of the LGP lexicon. This project is of interest to anyone interested in NLP (natural language parsing) of English text.
The OMCSNet-WordNet project aims to improve the quality of the OMCSNet dataset by using automated processes to map WordNet synonym sets to OMCSNet concepts and import additional semantic linkage data from WordNet. It is based on OMCSNet 1.2, a semantic network and inference toolkit written in Python/Java. OMCSNet currently contains over 280,000 separate pieces of common sense information extracted from the raw OMCS dataset. This project is also based on WordNet, an online lexical reference system that in recent years has become a popular tool for AI researchers.
OMCSNetCPP is a C++ API and inference toolkit for accessing OMCSNet, a semantic network mined out of the Open Mind Common Sense knowledge base. The goal of this project is to provide a class library that allows programmers to easily add common sense reasoning capabilities to C++ applications.
Lush is a Lisp dialect with extensions for object-oriented and array-oriented programming. It is intended as a programming environment for prototyping numerically intensive applications. Unlike alternatives like Python or SciLab, Lush is designed for easy integration of existing C/C++/Fortran codes.