RSS 27 projects tagged "Linguistic"

Download Website Updated 30 Jan 2001 Ciao Prolog

Screenshot
Pop 110.27
Vit 1.00

Ciao is a complete Prolog system subsuming ISO-Prolog with a novel modular design which allows both restricting and extending the language. Ciao extensions currently include feature terms (records), higher-order, functions, constraints, objects, persistent predicates, a good base for distributed execution (agents), and concurrency. Libraries also support WWW programming, sockets, and external interfaces (C, Java, TCL/Tk, relational databases, etc.). An Emacs-based environment, a stand-alone compiler, and a toplevel shell are also provided.

Download Website Updated 05 Dec 2001 Grok

Screenshot
Pop 54.41
Vit 2.37

Grok is a library of Java components for performing various natural language tasks. These include several preprocessing tasks, chart parsing, a large categorial grammar for English (induced from the Penn treebank), and some knowledge representation components (basic coreference, salience tracking, etc.). The library also has a companion kit which provides a GUI interface to the components, several of which are implementations of interfaces in the Quipu OpenNLP API.

No download Website Updated 22 Dec 2008 Open Translation Engine

Screenshot
Pop 67.75
Vit 5.50

Open Translation Engine (OTE) is a Web-based system to enable community management of translation dictionaries.

Download Website Updated 04 Apr 2003 The Quipu Maximum Entropy Package

Screenshot
Pop 36.99
Vit 2.51

Maximum entropy is a powerful method for constructing statistical models of classification tasks, such as part-of-speech tagging in Natural Language Processing. The Quipu Maximum Entropy Package is a Java implementation of the maximum entropy framework. It allows you to train, evaluate, and use maxent models.

Download Website Updated 05 Dec 2001 The Quipu OpenNLP API

Screenshot
Pop 34.06
Vit 2.37

The Quipu OpenNLP API is a preliminary collection of Java interfaces for standardizing how natural language processing components interact.

Download Website Updated 22 Jul 2002 Linguaphile

Screenshot
Pop 63.74
Vit 1.49

Linguaphile is a simple command line language translator. It is open source, platform independent, and programmed in Perl. Linguaphile currently supports the following languages: Afrikaans, Alawa, Albanian, Arrernte, Basque, Belarusian, Bulgarian, Catalan, Croatian, Czech, Danish, Dutch, English, Esperanto, Estonian, Finnish, French, Galician, German, Greek, Hawaiian, Hungarian, Icelandic, Indonesian, Interlingua, Irish, Italian, Kala Lagaw Ya, Korean, Kriol, Latvian, Lithuanian, Malay, Maltese, Maori, Norwegian, Pitjantjatjara, Polish, Portuguese, Romanian, Russian, Samoan, Serbian, Slovak, Slovenian, Spanish, Swahili, Swedish, Thai, Tok Pisin, Turkish, Ukrainian, Warlpiri, and Welsh. The Spanish to English translation is the most useful at this stage.

Download Website Updated 19 Aug 2002 FramerD

Screenshot
Pop 47.24
Vit 1.77

FramerD is a semi-structured object database integrated with a Scheme-based scripting language which supports multi-lingual programming (with pervasive Unicode), a stable module system for programming in the large, distributed applications (via an extensible RPC protocol), non-deterministic (PROLOG-like) evaluation for search and set operations, multi-threaded program execution, extensive tools for text and language analysis, built-in HTML/XML/MIME parsers, and intuitive (CGI- and FastCGI-based) Web scripting. The built-in object database robustly supports millions of objects and indexed access to those objects, both through disk files and networked servers.

Download Website Updated 26 Mar 2006 dbacl

Screenshot
Pop 174.48
Vit 4.91

dbacl is a digramic Bayesian text classifier. Given some text, it calculates the posterior probabilities that the input resembles one of any number of previously learned document collections. It can be used to sort incoming email into arbitrary categories such as spam, work, and play, or simply to distinguish an English text from a French text. It fully supports international character sets, and uses sophisticated statistical models based on the Maximum Entropy Principle.

Download Website Updated 09 Feb 2003 DadaDodo

Screenshot
Pop 43.45
Vit 1.00

DadaDodo is a program that analyses texts for word probabilities, and then generates random cut-up sentences based on that. It is a travesty generator similar to Dissociated Press, but based on a Markov Chain of length 1.

Download Website Updated 19 Sep 2004 HumAn Language GENerator

Screenshot
Pop 60.40
Vit 1.51

HALoGEN is an extremely powerful and easy to use general-purpose natural language generation system. It consists of a symbolic generator, a forest ranker, and some sample inputs. The symbolic generator includes the Sensus Ontology dictionary based on WordNet. The forest ranker includes a 250 million word ngram language model (unigram, bigram, and trigram) trained on the Wall Street Journal newspaper text. The symbolic generator is written in LISP and requires a Lisp interpreter.

Screenshot

Project Spotlight

openModeller

An ecological niche modeling framework.

Screenshot

Project Spotlight

Barman for PostgreSQL

A backup and recovery manager for PostgreSQL.