193 projects tagged "Linguistic"

Download Website Updated 10 Jun 2008 SCAN

Screenshot
Pop 53.60
Vit 2.59

SCAN is a personal information retrieval framework, combining search, text analysis, tagging, and metadata functions for document collections management. SCAN is a component-based software using a number of plugins for specific features. The basic SCAN platform can be easily extended with plugins for different document formats and document location types.

Download Website Updated 30 Jul 2007 WordNet

Screenshot
Pop 73.85
Vit 2.57

WordNet® is an on-line lexical reference system whose design is inspired by current psycholinguistic theories of human lexical memory. English nouns, verbs, adjectives and adverbs are organized into synonym sets, each representing one underlying lexical concept. Different relations link the synonym sets.

Download Website Updated 04 Apr 2003 The Quipu Maximum Entropy Package

Screenshot
Pop 38.26
Vit 2.50

Maximum entropy is a powerful method for constructing statistical models of classification tasks, such as part-of-speech tagging in Natural Language Processing. The Quipu Maximum Entropy Package is a Java implementation of the maximum entropy framework. It allows you to train, evaluate, and use maxent models.

Download Website Updated 06 Mar 2008 Mguesser

Screenshot
Pop 30.13
Vit 2.50

Mguesser is a tool to guess a text's character set and language. It is a standalone part of the mnoGoSearch engine. More than 100 various character set and language combinations are supported.

Download Website Updated 26 Dec 2004 QuickTranslate Online

Screenshot
Pop 35.67
Vit 2.50

QuickTranslate is a tool for translating text in 16 languages using the Systranbox service, which is used by Google, Altavista, Lycos, Terra, AOL, and others.

Download Website Updated 30 Jan 2001 hodie

Screenshot
Pop 38.14
Vit 2.48

Hodie prints the current date and time to stdout in Roman numerals, with grammatically correct Latin. Complete with Id., Kal., Non., pridie, postridie, bis, and all the other nice annoyances. As an option, it even provides you with current date according to Roman calendar -- that is 'ab urbe condita'; after Rome was built.

No download Website Updated 28 Jul 2008 transtoba2

Screenshot
Pop 11.31
Vit 2.46

transtoba2 facilitates the transliteration or transcription of a word or text from the Roman script into the Toba Batak script. Transliterating from the Roman into the Batak script is not an easy undertaking, as the Batak script has a number of peculiarities that complicates the process of transliteration. This program uses a set of algorithms which enables the user to effortlessly transliterate from the Roman to the Toba Batak script.

Download Website Updated 05 Dec 2001 Grok

Screenshot
Pop 53.74
Vit 2.37

Grok is a library of Java components for performing various natural language tasks. These include several preprocessing tasks, chart parsing, a large categorial grammar for English (induced from the Penn treebank), and some knowledge representation components (basic coreference, salience tracking, etc.). The library also has a companion kit which provides a GUI interface to the components, several of which are implementations of interfaces in the Quipu OpenNLP API.

Download Website Updated 05 Dec 2001 The Quipu OpenNLP API

Screenshot
Pop 36.67
Vit 2.37

The Quipu OpenNLP API is a preliminary collection of Java interfaces for standardizing how natural language processing components interact.

Download Website Updated 31 May 2005 CocoaCEDICT

Screenshot
Pop 20.37
Vit 2.35

CocoaCEDICT is a Cocoa interface to the CEDICT Chinese (Mandarin) English dictionary. It supports looking up words by their English definition, Pinyin pronunciation, traditional Chinese text, or simplified Chinese text, and more.

Screenshot

Project Spotlight

CT-gui/CT-synth/CT-farfisa

A GUI toolkit for Linux and Android.

Screenshot

Project Spotlight

fcron

A command scheduler for non-permanently-running systems.