RSS 81 projects tagged "Linguistic"

Download Website Updated 07 Apr 2014 Verbiste

Screenshot
Pop 336.12
Vit 135.35

Verbiste is a French conjugation system implemented as a C++ library, a GNOME applet, and two command-line tools. It can conjugate verbs and analyze conjugated verbs to determine their mode, tense, and person. The knowledge base contains over 6700 verbs.

No download Website Updated 23 Feb 2000 Mueller English-Russian Dictionary Kit

Screenshot
Pop 29.29
Vit 71.87

This is the first respectable English-Russian Dictionary with transcription (IPA) under GNU GPL. The dictionary has 46233 word articles and is 5.5MB. "MOVA" is a set of Bash and Tcl/TK scripts for dictionary management under UNIX.

Download Website Updated 07 Mar 2003 lacy

Screenshot
Pop 31.14
Vit 63.70

lacy is a conversion tool that changes Latin letters to Cyrillic.

Download Website Updated 03 Feb 2004 Polygen

Screenshot
Pop 27.02
Vit 61.03

PolyGen is a program for generating random sentences according to a grammar definition, that is following custom syntactical and lexical rules. Formally, it is an interpreter of a language itself designed to define languages, where to interpret means executing a source program in real time and eventually outputting its result. Here, a source program is a grammar definition. The execution consists of the exploration of such grammar by selecting a random path, and the result is the sentence built on the way.

Download Website Updated 31 Aug 2009 Apertium

Screenshot
Pop 36.95
Vit 46.56

Apertium is a machine translation platform, initially aimed at related-language pairs, but recently expanded to deal with more divergent language pairs (such as English-Catalan). The platform provides a language-independent machine translation engine, tools to manage the linguistic data necessary to build a machine translation system for a given language pair, and linguistic data for a growing number of language pairs.

Download Website Updated 04 Jul 2011 Emdros

Screenshot
Pop 158.51
Vit 16.83

Emdros is a corpus query system for storing and searching linguistically annotated text. It is very generic, supporting almost any kind of annotation from almost any linguistic theory. All linguistic levels of analysis are supported, including phonology, morphology, the lexical level, syntax, and discourse. The core libraries act as a middleware layer between a client and an underlying SQL database. MySQL, PostgreSQL, and SQLite are supported.

Download Website Updated 15 May 2011 uni2ascii

Screenshot
Pop 187.70
Vit 12.22

uni2ascii and ascii2uni provide conversion in both directions between UTF-8 Unicode and more than thirty 7-bit ASCII equivalents, including RFC 2396 URI format and RFC 2045 Quoted Printable format, the representations used in HTML, SGML, XML, OOXML, the Unicode standard, Rich Text Format, POSIX portable charmaps, POSIX locale specifications, and Apache log files. It can also convert between the escapes used for Unicode in languages such as Ada, C, Common Lisp, Java, Pascal, Perl, Postscript, Python, Scheme, and Tcl.

Download Website Updated 11 Jan 2010 msort

Screenshot
Pop 183.80
Vit 11.43

Msort sorts files in sophisticated ways. Records may be fixed size, newline-separated blocks, or terminated by any specified character. Key fields may be selected by position, tag, or character range. For each key, distinct exclusions, multigraphs, substitutions, and a sort order may be defined or locale collation rules used. Comparisons may be lexicographic, numeric, numeric string, hybrid, random, by string length, angle, domain name, date, time, month name, or ISO8601 timestamp. Keys may be reversed so as to generate reverse dictionaries. Optional keys are supported. Unicode is supported, including full case-folding. Msort itself has a somewhat complex command line interface, but may be driven by an optional GUI.

No download Website Updated 02 Aug 2012 Poliqarp

Screenshot
Pop 54.63
Vit 7.84

Poliqarp is a universal suite of utilities for processing large corpora. It includes a concordancer that works on binary corpora compiled for efficient searching and a corpus builder. It supports positional tagsets, ambiguities in the texts, and Unicode.

Download Website Updated 28 Feb 2012 Hspell

Screenshot
Pop 69.97
Vit 7.62

Hspell is a Hebrew linguistic project. It features a Hebrew spell-checker, and aims to use the databases and algorithms developed as a morphology engine (for example, for search engines), and in the future for advanced things like Hebrew speech synthesis.

Screenshot

Project Spotlight

Wenity

A multi-platform Zenity clone.

Screenshot

Project Spotlight

EC2Box

A Web-based multi-terminal ssh tool for EC2 instances.