193 projects tagged "Linguistic"

Download Website Updated 31 May 2005 IPA-CXS/X-Sampa Converter

Screenshot
Pop 41.05
Vit 1.42

IPA-CXS/X-Sampa Converter is a selection of modules for various programming languages (C, Perl, Lisp, and Python) for translating between IPA (International Phonetic Alphabet) and ASCII versions, in particular CXS, which is a close relative to X-Sampa. The project homepage contains a demo for using the Perl script as an online converter.

Download Website Updated 02 Jan 2006 Full Text for SQLite3

Screenshot
Pop 31.42
Vit 1.42

Full Text for SQLite3 is a full text indexer for data stored into a sqlite3 database. The text fields stored in the database can be split word-by-word and stored in a form suitable for lookup.

No download No website Updated 20 Feb 2011 Linguas OS

Screenshot
Pop 23.92
Vit 1.42

Linguas OS is a Linux live CD that includes OpenOffice.org, Omega T, Evince, and other basic tools that translators use on a daily basis for translation work. It includes Internet browsing and e-mail software.

Download Website Updated 16 Sep 2004 buckwalter2unicode

Screenshot
Pop 12.53
Vit 1.41

buckwalter2unicode is a fairly simple Python script designed to convert Arabic text that is written using Buckwalter's transliteration system to a Unicode encoding (and vice-versa).

Download Website Updated 06 Sep 2003 polcnv

Screenshot
Pop 16.43
Vit 1.15

polcnv is designed to convert files between different encoding methods used for Polish texts. It can be also used to covert plain text documents in any language using supported character encoding methods. The program uses ISO-10646 UCS-4 (equivalent to Unicode UTF-32) as internal representation.

Download Website Updated 18 Dec 2010 xgrk

Screenshot
Pop 22.69
Vit 1.09

xgrk provides the possibility to change keyboard mapping with alt-shift or meta-shift combinations or by clicking on the flag image. You will be able to write greek in X programs like netscape or xedit. Keycodes are auto-loaded on startup so it should work with all unices and keyboard layouts. Fonts are not included.

Download Website Updated 13 Oct 2006 Uplug

Screenshot
Pop 26.72
Vit 1.07

Uplug is a collection of tools for linguistic corpus processing, word alignment, and term extraction from parallel corpora. Several tools have been integrated in Uplug. Pre-processing tools include a sentence splitter, tokenizer, and external part-of-speech tagger and shallow parsers. The following external tools are used: the Grok system for English (tagging and chunking) and the morphological analyzer ChaSen for Japanese. Other tools such as the TreeTagger can easily be added. Translated documents can be sentence aligned using the length-based approach by Gale & Church. Words and phrases can be aligned using the clue alignment approach and the toolbox for training statistical alignment models GIZA++.

Download Website Updated 05 Nov 2007 OmegaT

Screenshot
Pop 28.90
Vit 1.07

OmegaT is a translation memory application intended for professional translators. It does not translate for you (software that does this is called "machine translation"). It features fuzzy matching, match propagation, simultaneous processing of multiple-file projects, simultaneous use of multiple translation memories, and external glossaries. Document file formats include plain text, HTML, and OpenOffice.org/StarOffice. It has Unicode (UTF-8) support (can be used with non-Latin alphabets). It is compatible with other translation memory applications (TMX Level 1).

Download Website Updated 20 Sep 2001 lexica

Screenshot
Pop 21.21
Vit 1.05

Lexica is a graphical interface to Unix/Linux dictionary resources. It is implemented in TCL/Tk and provides access to dict, wn, and grep.

No download Website Updated 08 Jun 2004 PAiN

Screenshot
Pop 16.00
Vit 1.03

PAiN is a new MUD codebase written in Java. It provides a general purpose persistence engine (PAiN DB) and the ability to do dynamic code reloading.

Screenshot

Project Spotlight

CT-gui/CT-synth/CT-farfisa

A GUI toolkit for Linux and Android.

Screenshot

Project Spotlight

fcron

A command scheduler for non-permanently-running systems.