193 projects tagged "Linguistic"

Download Website Updated 12 May 2005 Tomabaem

Screenshot
Pop 13.27
Vit 1.00

Tomabaem is a substitute for the System's Character Palette, at least for people focusing on the so-called CJKV languages (Chinese, Japanese, Korean, and Vietnamese). Tomabaem, like Unicode, is cross-language. Whatever you are looking for related to Chinese characters, there's a high chance that Tomabaem has a way of looking it up, whether it's the Cantonese pronunciation, the UTF-16 codepoint, the radical, the meaning, or the character itself, which you can copy/paste or drag'n'drop from another document. It uses UniHan.txt file from the Unicode Consortium as the basis of the data shown.

No download Website Updated 02 Mar 2013 jsesh

Screenshot
Pop 54.99
Vit 7.64

JSesh is an editor for ancient Egyptian hieroglyphic texts. It can export the text into picture formats, such as WMF files for easy inclusion in word processors. JSesh can also be used as a library for other projects concerning ancient Egyptian.

No download No website Updated 07 Apr 2005 Lost in Translation

Screenshot
Pop 16.25
Vit 1.00

Lost in Translation is a steganographic encoder that exploits the possibilities of steganographically embedding information in the "noise'' created by automatic translation of natural language documents. Because natural language translation inherently creates plenty of room for variation, it is ideal for steganographic applications. Also, because there are frequent errors in legitimate automatic text translations, additional errors inserted by an information hiding mechanism are plausibly undetectable and would appear to be part of the normal noise associated with translation.

Download Website Updated 20 Nov 2009 po for anything

Screenshot
Pop 41.86
Vit 3.19

The goal of po4a (po for anything) is to ease the creation and maintenance of translations using gettext tools on areas where they were not expected, like documentation.

Download Website Updated 22 Mar 2007 SlpTK

Screenshot
Pop 21.33
Vit 1.60

SlpTK is an ANSI C library, a set of utilities, and scripts for natural language processing. It provides data structures and treatments related to lexical and syntactic levels.

Download Website Updated 18 Mar 2005 Universal Text Recognizer and Converter

Screenshot
Pop 39.24
Vit 1.00

The Universal Text Recognizer and Converter (Utrac) is a commandline tool and a C library that recognizes the encoding of an input file (UTF-8, ISO-8859-1, CP437, etc.) and its end-of-line type (CR, LF, or CRLF). It features automatic recognition (depending on the file and on the system's locale, reliable in most cases), assistance for verification or manual recognition, and conversion to another charset and/or end-of-line type.

Download Website Updated 15 Nov 2009 minpair

Screenshot
Pop 43.08
Vit 4.53

Minpair consists of two programs, a C command-line program and a Tcl/Tk GUI, each of which can independently generate a complete list of minimal pairs (words differing in exactly one segment) for use in linguistic research. The GUI may also be used to control the faster CLI program. Both allow sequences of characters to be defined as single segments. Unicode is fully supported. It is also possible to obtain a list of pairs differing in exactly two positions for use in finding phonological rules.

Download Website Updated 30 Jan 2005 GNOME Translate

Screenshot
Pop 25.10
Vit 1.00

GNOME Translate is a GNOME interface to libtranslate. It can translate a text or Web page between several natural languages, and it can automatically detect the source language as you type in text.

No download Website Updated 30 Jan 2005 libtranslate

Screenshot
Pop 21.10
Vit 1.00

libtranslate is a library for translating text and Web pages between natural languages. Its modular infrastructure allows the user to implement new translation services separately from the core library. libtranslate is shipped with a generic module that supports Web-based translation services such as Babel Fish, Google Language Tools, and SYSTRAN. Moreover, the generic module allows new services to be added simply by adding a few lines to an XML file. The libtranslate distribution includes a powerful command line interface.

Download Website Updated 28 Jan 2007 ByteName

Screenshot
Pop 46.58
Vit 3.92

ByteName is a tool that for each byte of the input prints a line consisting of the byte offset, the byte in hex, octal, binary, and decimal, and its description in a selected single-byte encoding. A command line flag suppresses printing of lines corresponding to ASCII characters, which is useful for locating stray non-ASCII codes. It can also generate a chart for a specified encoding or, for a specified codepoint, generate descriptions in all known encodings.

Screenshot

Project Spotlight

CT-gui/CT-synth/CT-farfisa

A GUI toolkit for Linux and Android.

Screenshot

Project Spotlight

fcron

A command scheduler for non-permanently-running systems.