libkdtree++ is a C++ template container implementation of k-dimensional space sorting, using a kd-tree. It sports a theoretically unlimited number of dimensions, and can store any data structure. Provided the data structure, it provides operator[0 - k-1] to access the individual dimensional components (arrays, std::vector already do) and a std::less implementation for the type of dimensional components. It has support for custom allocators, implements iterators, and provides standard find as well as range queries. It has amortised O(lg n) time (O(n lg n) worst case) on most operations (insert/erase/find optimised) and worst-case O(n) space, and also provides a means to rebalance and thus optimise the tree.
HALoGEN is an extremely powerful and easy to use general-purpose natural language generation system. It consists of a symbolic generator, a forest ranker, and some sample inputs. The symbolic generator includes the Sensus Ontology dictionary based on WordNet. The forest ranker includes a 250 million word ngram language model (unigram, bigram, and trigram) trained on the Wall Street Journal newspaper text. The symbolic generator is written in LISP and requires a Lisp interpreter.
CLIPS is a productive development and delivery expert system tool which provides a complete environment for the construction of rule- and/or object-based expert systems. It is used by numerous users throughout the public and private community, including all NASA sites and branches of the military, numerous federal bureaus, government contractors, universities, and many companies.
Agent Farms is a system for modeling and simulation of complex, multi-agent based systems. The system can be used for creating models of multi-agent systems, interactive and distributed simulation, observation and visualisation of the simulation, and population modification and migration. On those models one can learn about the evolution of strategies in the populations of agents.
Virtual Sound provides unlimited effects and modulation for sophisticated audio generation, with unique instrument simulation capabilities to craft your own unique soundscapes. All effects are parametrized and their parameters may be controlled through the outputs of other effects, through samples, or through special modulators. Instrument simulation is based on recursively-linked filter networks. Actual instruments can also be simulated through the statistical learning library incorporated in VSound.
Kognition is an omnifont OCR software for KDE. It is the result of two diploma theses about omnifont optical character recognition which are included as documentation (in German). Due to the fact that each step of the OCR process can be visualized you can get a quick idea of how OCR works and where the problems lie. However the program may be of minor/no use for end users in its current state.
Mimas Toolkit is a C++ computer vision toolkit. It is easy to use and includes tools for edge detection, corner detection, various filters, optic flow, tracking, blob analysis, Web cam tools for real-time applications, and much more. It also includes many implementations of traditional algorithms such as Canny. It was developed for GNU/Linux but as the GUI is largely separate, porting to other platforms should be straightforward.
Genetic Algorithm File Fitter, GAFFitter for short, is a tool based on a genetic algorithm (GA) that tries to fit a collection of items into as few as possible volumes of a specific size. For example, the items might be files/directories and the volumes might be CDs or DVDs. The task of arranging items in such manner that the number of required bins is minimized is called Bin Packing, a NP-hard combinatorial problem for which no deterministic polynomial-time algorithm is known. Using heuristics, such as GAs, it is usually possible to approximate -- and often reach -- the best solution for the problem within a reasonable time.
Grok is a library of Java components for performing various natural language tasks. These include several preprocessing tasks, chart parsing, a large categorial grammar for English (induced from the Penn treebank), and some knowledge representation components (basic coreference, salience tracking, etc.). The library also has a companion kit which provides a GUI interface to the components, several of which are implementations of interfaces in the Quipu OpenNLP API.