193 projects tagged "Linguistic"

Download Website Updated 28 Feb 2012 Hspell

Screenshot
Pop 66.81
Vit 7.46

Hspell is a Hebrew linguistic project. It features a Hebrew spell-checker, and aims to use the databases and algorithms developed as a morphology engine (for example, for search engines), and in the future for advanced things like Hebrew speech synthesis.

Download Website Updated 19 Sep 2004 HumAn Language GENerator

Screenshot
Pop 60.08
Vit 1.51

HALoGEN is an extremely powerful and easy to use general-purpose natural language generation system. It consists of a symbolic generator, a forest ranker, and some sample inputs. The symbolic generator includes the Sensus Ontology dictionary based on WordNet. The forest ranker includes a 250 million word ngram language model (unigram, bigram, and trigram) trained on the Wall Street Journal newspaper text. The symbolic generator is written in LISP and requires a Lisp interpreter.

No download Website Updated 10 Sep 2005 I18N

Screenshot
Pop 12.21
Vit 56.42

I18N is a class that gets translation texts from flat files or from an SQL database. The system supports variables in translated strings and has a conversion facility to move data from one container to another. An included tool checks programs against sets of translated strings to detect references without strings or unused strings. Each call checks that referenced variables exist.

Download Website Updated 17 Feb 2006 IPA Zounds

Screenshot
Pop 25.14
Vit 2.63

IPA Zounds models language sound changes by applying a given set of sound change rules to a given lexicon. It has a built-in model of the International Phonetic Alphabet, allowing users to write input words in IPA characters and rules using those characters or the distinctive features of the model.

Download Website Updated 31 May 2005 IPA-CXS/X-Sampa Converter

Screenshot
Pop 41.05
Vit 1.42

IPA-CXS/X-Sampa Converter is a selection of modules for various programming languages (C, Perl, Lisp, and Python) for translating between IPA (International Phonetic Alphabet) and ASCII versions, in particular CXS, which is a close relative to X-Sampa. The project homepage contains a demo for using the Perl script as an online converter.

Download Website Updated 05 Oct 2005 ISCII Utilities

Screenshot
Pop 23.43
Vit 1.81

ISCII Utilities is two programs for analyzing text files encoded according to the Indian Script Code for Information Interchange (ISCII), the Indian national standard. IsciiName identifies each code, printing the byte offset, the code in hex, and an explanation of the meaning of the code. ATR codes for writing system transition and display mode are interpreted. CountIsciiChars counts the codes in an ISCII file and classifies them according to their type and function. The original purpose was computing accurate letter counts for reading studies, but this information is also useful when processing ISCII-encoded text.

Download Website Updated 24 Apr 2014 International Components for Unicode (C/C++)

Screenshot
Pop 463.84
Vit 69.24

ICU provides a Unicode implementation, with functions for formatting numbers, dates, times, and currencies (according to locale conventions, transliteration, and parsing text in those formats). It provides flexible patterns for formatting messages, where the pattern determines the order of the variable parts of the messages, and the format for each of those variables. These patterns can be stored in resource files for translation to different languages. Included are more than 100 codepage converters for interaction with non-unicode systems.

Download Website Updated 15 Aug 2006 JBootCat

Screenshot
Pop 18.55
Vit 1.00

JBootCat is an implemention of the BootCat scripts for acquiring corpora from the Internet, which is of interest to linguists and translators. The main goal is to encapsulate the BootCat functionality within a user-friendly desktop application.

Download Website Updated 23 Aug 2007 JDing

Screenshot
Pop 12.41
Vit 49.71

JDing is a clone of the Unix translation tool "Ding". It has been ported to Java to make it platform independent. Ding dictionaries can be used. JDing is a simple but powerful dictionary.

Download Website Updated 21 Mar 2013 JOrtho

Screenshot
Pop 71.25
Vit 8.06

JOrtho is a spell checker for Java. The library works with any JTextComponent from the Swing framework and checks as you type. The dictionary is based on the free Wiktionary.org, and is applicable for multiple languages. You can select the spell checking language via a context menu. The Features of JOrtho are the highlighting of potentially wrongly spelled words, a context menu with suggestions for correct forms of the word, and a context menu with option to change the checking language. At the moment there are nine languages for spell checking available: English, German, French, Spanish, Italian, Russian, Polish, Dutch, and Arabic.

Screenshot

Project Spotlight

CT-gui/CT-synth/CT-farfisa

A GUI toolkit for Linux and Android.

Screenshot

Project Spotlight

fcron

A command scheduler for non-permanently-running systems.