193 projects tagged "Linguistic"

No download No website Updated 07 Apr 2005 Lost in Translation

Screenshot
Pop 16.25
Vit 1.00

Lost in Translation is a steganographic encoder that exploits the possibilities of steganographically embedding information in the "noise'' created by automatic translation of natural language documents. Because natural language translation inherently creates plenty of room for variation, it is ideal for steganographic applications. Also, because there are frequent errors in legitimate automatic text translations, additional errors inserted by an information hiding mechanism are plausibly undetectable and would appear to be part of the normal noise associated with translation.

No download Website Updated 02 Mar 2013 jsesh

Screenshot
Pop 54.99
Vit 7.64

JSesh is an editor for ancient Egyptian hieroglyphic texts. It can export the text into picture formats, such as WMF files for easy inclusion in word processors. JSesh can also be used as a library for other projects concerning ancient Egyptian.

Download Website Updated 12 May 2005 Tomabaem

Screenshot
Pop 13.27
Vit 1.00

Tomabaem is a substitute for the System's Character Palette, at least for people focusing on the so-called CJKV languages (Chinese, Japanese, Korean, and Vietnamese). Tomabaem, like Unicode, is cross-language. Whatever you are looking for related to Chinese characters, there's a high chance that Tomabaem has a way of looking it up, whether it's the Cantonese pronunciation, the UTF-16 codepoint, the radical, the meaning, or the character itself, which you can copy/paste or drag'n'drop from another document. It uses UniHan.txt file from the Unicode Consortium as the basis of the data shown.

Download Website Updated 30 Dec 2008 xlit

Screenshot
Pop 66.61
Vit 5.33

Xlit converts text from one writing system into another. It allows the user to define a transliteration simply by typing the input strings in one window and the strings to which they are to be mapped in another. Transliteration may be restricted to regions bounded by specified delimiters or their complements. Transliteration may also be performed by external commands or plugins. Xlit can also convert one type of delimiter to another, e.g. from HZ escapes to XML. Xlit can read and write transliteration definitions in its own format and as Yudit keymaps. It can be run in batch mode without the GUI.

Download Website Updated 31 May 2005 IPA-CXS/X-Sampa Converter

Screenshot
Pop 41.05
Vit 1.42

IPA-CXS/X-Sampa Converter is a selection of modules for various programming languages (C, Perl, Lisp, and Python) for translating between IPA (International Phonetic Alphabet) and ASCII versions, in particular CXS, which is a close relative to X-Sampa. The project homepage contains a demo for using the Perl script as an online converter.

Download Website Updated 22 Apr 2007 Computational Linguistics Toolset

Screenshot
Pop 50.65
Vit 2.97

The Computational Linguistics Toolset is a set of tools for computational linguistics. It contains re-usable code for cleaning, splitting, refining, and taking samples from corpora (ICE, Penn, and a native one), for tagging them using the TnT-tagger, for doing permutation statistics on N-grams (useful for finding statistically significant syntactical differences between any two sets of tagged texts), and various examination-tools. The tools themselves are well documented.

No download Website Updated 05 Jul 2005 Pure PHP Spell Check

Screenshot
Pop 21.63
Vit 1.00

Pure PHP Spell Check performs spell-checking of text using only base PHP functions, without using specific spell check PHP extensions such as aspell or pspell. The class uses a dictionary that is implemented as an array-based binary search table. The binary search table declaration is saved to a file for speed and can be updated easily by the developer.

Download Website Updated 06 Dec 2011 tlgu

Screenshot
Pop 19.80
Vit 4.24

tlgu is a utility for converting an input file in Thesaurus Linguae Graeca (TLG) or Packard Humanities Institute (PHI) representation (beta code text and citation information) into Unicode (UTF-8). A companion GNU/Linux Hellenic Polytonic HOWTO may also be found in the tlgu site.

Download Website Updated 21 Aug 2005 Transolution

Screenshot
Pop 51.26
Vit 2.25

Transolution is a Computer Aided Translation (CAT) suite supporting the XLIFF standard. It provides the open source community with features and concepts that have been used by commercial offerings for years to improve translation efficiency and quality. The suite is modular to make it flexible and provides an XLIFF Editor, translation memory engine and filters to convert different formats to and from XLIFF. The use of XLIFF means that almost any content can be localized as long as there is a filter for it (XML, SGML, PO, RTF, StarOffice/OpenOffice, etc.).

No download Website Updated 27 Jul 2005 NL Stego

Screenshot
Pop 18.65
Vit 56.82

NL Stego is a system for text generation and text-based steganography. It combines Markov Models of several orders to generate random text resembling a given training text (or text corpus). It can also embed secret messages into pseudo-random generated text.

Screenshot

Project Spotlight

CT-gui/CT-synth/CT-farfisa

A GUI toolkit for Linux and Android.

Screenshot

Project Spotlight

fcron

A command scheduler for non-permanently-running systems.