SCAN is a personal information retrieval framework, combining search, text analysis, tagging, and metadata functions for document collections management. SCAN is a component-based software using a number of plugins for specific features. The basic SCAN platform can be easily extended with plugins for different document formats and document location types.
WordNet® is an on-line lexical reference system whose design is inspired by current psycholinguistic theories of human lexical memory. English nouns, verbs, adjectives and adverbs are organized into synonym sets, each representing one underlying lexical concept. Different relations link the synonym sets.
Maximum entropy is a powerful method for constructing statistical models of classification tasks, such as part-of-speech tagging in Natural Language Processing. The Quipu Maximum Entropy Package is a Java implementation of the maximum entropy framework. It allows you to train, evaluate, and use maxent models.
Hodie prints the current date and time to stdout in Roman numerals, with grammatically correct Latin. Complete with Id., Kal., Non., pridie, postridie, bis, and all the other nice annoyances. As an option, it even provides you with current date according to Roman calendar -- that is 'ab urbe condita'; after Rome was built.
transtoba2 facilitates the transliteration or transcription of a word or text from the Roman script into the Toba Batak script. Transliterating from the Roman into the Batak script is not an easy undertaking, as the Batak script has a number of peculiarities that complicates the process of transliteration. This program uses a set of algorithms which enables the user to effortlessly transliterate from the Roman to the Toba Batak script.
Grok is a library of Java components for performing various natural language tasks. These include several preprocessing tasks, chart parsing, a large categorial grammar for English (induced from the Penn treebank), and some knowledge representation components (basic coreference, salience tracking, etc.). The library also has a companion kit which provides a GUI interface to the components, several of which are implementations of interfaces in the Quipu OpenNLP API.