193 projects tagged "Linguistic"

Download No website Updated 31 Mar 2004 jgram

Screenshot
Pop 19.39
Vit 1.00

jgram is a simple markov-chain library suitable for building rudimentary n-gram representations of sequences of any Java objects. It has an extensible scoring mechanism and a generator for traversing a random path through n-gram states based on transition score.

No download Website Updated 18 Apr 2004 Mind AI

Screenshot
Pop 42.43
Vit 1.00

The purpose of Mind AI is to build an artificial mind based on some advanced concepts: machine learning, representation and meta representation of concepts, concept reflection, reification (concept to meta concept), and denotation (meta concept to concept), and to explore some new concepts. Interaction with the AI is done via IRC.

Download Website Updated 25 May 2004 hiercat

Screenshot
Pop 26.94
Vit 1.00

Hiercat is an automatic text classifier that uses a keyword hierarchy to improve categorization. It uses the generative probabilistic model of Gaussier, et. al 2001 as its document model.

Download Website Updated 31 May 2005 CocoaCEDICT

Screenshot
Pop 20.37
Vit 2.35

CocoaCEDICT is a Cocoa interface to the CEDICT Chinese (Mandarin) English dictionary. It supports looking up words by their English definition, Pinyin pronunciation, traditional Chinese text, or simplified Chinese text, and more.

No download Website Updated 10 Apr 2010 Hesperides

Screenshot
Pop 16.34
Vit 2.68

Hesperides is an adaptation for Mac OS X of Didier Willis' Dragon Flame, a dictionary for Sindarin (the grey-elven language, invented by J.R.R. Tolkien). It integrates Didier's Sindarin dictionary with a state-of-the-art interface and the Narmacil engine (to display the words using Tengwar).

Download Website Updated 21 Jul 2004 Mobile i2e

Screenshot
Pop 23.24
Vit 1.00

Mobile i2e is a MIDP adaptation of the Linux translator "i2e". It supports i2e and idp dictionary file types. Support for new dictionary file types can be added with little effort.

Download Website Updated 05 Oct 2013 Apache Lucene

Screenshot
Pop 287.97
Vit 19.47

Apache Lucene is a high-performance, full-featured text search engine library written entirely in Java. It is suitable for nearly any application that requires full-text search, especially cross-platform.

No download Website Updated 02 Sep 2004 Babel

Screenshot
Pop 15.10
Vit 59.64

Babel is a linguistics tool that uses dynamic data structuring of words and context spaces. Models can be mixed and references added to created terms. It includes the markov, ngram, and ispell tools.

Download Website Updated 16 Sep 2004 buckwalter2unicode

Screenshot
Pop 12.53
Vit 1.41

buckwalter2unicode is a fairly simple Python script designed to convert Arabic text that is written using Buckwalter's transliteration system to a Unicode encoding (and vice-versa).

Download Website Updated 06 Oct 2004 Dowser

Screenshot
Pop 55.48
Vit 2.24

Dowser is a Web research and archiving tool that clusters results from search engines, associates words that appear in previous searches, and keeps a local cache of all the results you click on in a searchable database along with summaries and links to related information. It helps you to keep track of what you find, with no advertising.

Screenshot

Project Spotlight

CT-gui/CT-synth/CT-farfisa

A GUI toolkit for Linux and Android.

Screenshot

Project Spotlight

fcron

A command scheduler for non-permanently-running systems.