193 projects tagged "Linguistic"

Download Website Updated 16 Jul 2003 TKLTrans

Screenshot
Pop 11.92
Vit 1.00

TKLTrans is a dictionary program to help translate to/from English, German, or Hungarian. All combinations of these three languages are supported.

Download Website Updated 27 May 2007 Tamil Converters

Screenshot
Pop 20.25
Vit 2.14

Tamil Converters is a collection of programs for converting among a variety of encodings and transliterations of Tamil, including: Unicode, ISCII, TSCII, ITRANS, the International Phonetic Alphabet, the Koln, Penn, and Colloquial Tamil romanizations, ISO-15919 transliteration, and Unicode character names enclosed in angle brackets (as in POSIX locale source files).

Download No website Updated 05 Jan 2007 Tamil Karuvi

Screenshot
Pop 9.80
Vit 1.00

Tamizh Karuvi is an English to Tamil transliteration tool. It can be used to provide Unicode UTF-8 input to programs within GNOME/GTK+ environments. The software provides a nice GUI and a command line program. The encoding table used is from the standard JaffnaLibrary.

No download Website Updated 05 May 2012 Text-Tokenizer

Screenshot
Pop 65.31
Vit 7.54

Text-Tokenizer is Perl module based on the flex generated lexical analyzer that can be used for parsing of text (configuration) files. With this module, a simple full-featured configuration parser can be written very easily.

Download Website Updated 10 Jul 2007 TextSearch

Screenshot
Pop 28.02
Vit 1.00

TextSearch is a program to search through a set of text files in a directory structure. Each document is searched using a regular expression and an overview of the results is shown as a tree structure. By clicking on a file, it can be viewed, with matches being highlighted. As opposed to other programs out there, its focus is not so much on statistics, i.e. how often a word would occur in an entire corpus of files, but rather on occurrences in single files.

Download Website Updated 04 Apr 2003 The Quipu Maximum Entropy Package

Screenshot
Pop 38.26
Vit 2.50

Maximum entropy is a powerful method for constructing statistical models of classification tasks, such as part-of-speech tagging in Natural Language Processing. The Quipu Maximum Entropy Package is a Java implementation of the maximum entropy framework. It allows you to train, evaluate, and use maxent models.

Download Website Updated 05 Dec 2001 The Quipu OpenNLP API

Screenshot
Pop 36.67
Vit 2.37

The Quipu OpenNLP API is a preliminary collection of Java interfaces for standardizing how natural language processing components interact.

Download Website Updated 12 May 2005 Tomabaem

Screenshot
Pop 13.27
Vit 1.00

Tomabaem is a substitute for the System's Character Palette, at least for people focusing on the so-called CJKV languages (Chinese, Japanese, Korean, and Vietnamese). Tomabaem, like Unicode, is cross-language. Whatever you are looking for related to Chinese characters, there's a high chance that Tomabaem has a way of looking it up, whether it's the Cantonese pronunciation, the UTF-16 codepoint, the radical, the meaning, or the character itself, which you can copy/paste or drag'n'drop from another document. It uses UniHan.txt file from the Unicode Consortium as the basis of the data shown.

Download Website Updated 21 Aug 2005 Transolution

Screenshot
Pop 51.26
Vit 2.25

Transolution is a Computer Aided Translation (CAT) suite supporting the XLIFF standard. It provides the open source community with features and concepts that have been used by commercial offerings for years to improve translation efficiency and quality. The suite is modular to make it flexible and provides an XLIFF Editor, translation memory engine and filters to convert different formats to and from XLIFF. The use of XLIFF means that almost any content can be localized as long as there is a filter for it (XML, SGML, PO, RTF, StarOffice/OpenOffice, etc.).

Download Website Updated 16 Jul 2005 Transtalo

Screenshot
Pop 17.18
Vit 3.71

Transtalo is an automatic translator. It consists of a library interface and modules for source and destination languages (called input and output modules). These modules communicate to each other through sentence files in an XML format.

Screenshot

Project Spotlight

CT-gui/CT-synth/CT-farfisa

A GUI toolkit for Linux and Android.

Screenshot

Project Spotlight

fcron

A command scheduler for non-permanently-running systems.