RSS 193 projects tagged "Linguistic"

Download Website Updated 14 Sep 2005 Dacco

Screenshot
Pop 25.06
Vit 2.72

Dacco is a collaborative English-Catalan, Catalan-English dictionary project. It seeks to provide an up-to-date, comprehensive, bilingual dictionary that will be of benefit to learners of both languages. The dictionaries are downloadable and customizable (using XSLT) and contain audio files.

Download Website Updated 08 Oct 2003 LinkGrammar-WN

Screenshot
Pop 49.09
Vit 1.00

LinkGrammar-WN is a lexicon expansion for the Link Grammar Parser. The Link Grammar Parser is a syntactic parser of the English language that is capable of handling a wide variety of syntactic constructions and is considered quite robust. The LinkGrammar-WN project aims to import lexical information from WordNet in an effort to increase the size of the LGP lexicon. This project is of interest to anyone interested in NLP (natural language parsing) of English text.

Download Website Updated 05 Mar 2004 MegaLettering

Screenshot
Pop 18.11
Vit 1.44

MegaLettering is the PHP engine created to manage the Italian translation of www.megatokyo.com, but it is written with general use in mind, so it can support any number of languages. Text in baloons can be translated by using a MySQL database that defines both the balloon shapes and the translated text and fonts to use to add new text.

No download Website Updated 19 Nov 2003 JScript Logic

Screenshot
Pop 24.33
Vit 1.00

JScript Logic implements core routines for solving logic puzzles. Advanced features are being added for derivations using rules of formal logic.

Download Website Updated 28 Dec 2003 Convert character set

Screenshot
Pop 19.29
Vit 1.00

Convert character set is meant to convert text strings between different character set encodings. It features conversion between single byte character sets, from single byte to multi-byte character sets (UTF-8), and from multi-byte to single byte. All conversion output can be saved with numeric entities (browser character set independent). The main requirement is that a character has to be in both character sets, or it will return an error.

Download Website Updated 08 Jan 2004 HindiDict

Screenshot
Pop 15.49
Vit 1.00

HindiDict creates a Latex formatted Hindi dictionary from a text file. The entries are sorted both by Hindi and by English.

Download Website Updated 03 Feb 2004 Polygen

Screenshot
Pop 26.91
Vit 61.04

PolyGen is a program for generating random sentences according to a grammar definition, that is following custom syntactical and lexical rules. Formally, it is an interpreter of a language itself designed to define languages, where to interpret means executing a source program in real time and eventually outputting its result. Here, a source program is a grammar definition. The execution consists of the exploration of such grammar by selecting a random path, and the result is the sentence built on the way.

Download Website Updated 18 Aug 2008 An Gramadóir

Screenshot
Pop 50.38
Vit 2.68

An Gramadóir is a grammar checking engine that is designed for the rapid development of grammar checkers for minority languages and other languages with limited computational resources. Rule specifications are given according to a simple syntax combining XML and regular expressions. Part-of-speech tagging can be learned from text corpora using statistical methods. It is currently implemented for Irish (Gaeilge).

No download Website Updated 05 May 2012 Text-Tokenizer

Screenshot
Pop 64.76
Vit 7.69

Text-Tokenizer is Perl module based on the flex generated lexical analyzer that can be used for parsing of text (configuration) files. With this module, a simple full-featured configuration parser can be written very easily.

Download Website Updated 15 Mar 2005 Ellogon

Screenshot
Pop 53.22
Vit 1.82

Ellogon is a multi-lingual, cross-platform, general-purpose language engineering environment, developed in order to aid both researchers who are doing research in computational linguistics, as well as companies who produce and deliver language engineering systems. As a language engineering platform, it offers an extensive set of facilities, including tools for processing and visualising textual/HTML/XML data and associated linguistic information, support for lexical resources (like creating and embedding lexicons), tools for creating annotated corpora, accessing databases, comparing annotated data, or transforming linguistic information into vectors for use with various machine learning algorithms.

Screenshot

Project Spotlight

LPAR2RRD

AIX/iSeries free performance monitoring and capacity planning.

Screenshot

Project Spotlight

asciidia

A program to create bitmaps from simple ASCII diagrams.