193 projects tagged "Linguistic"

No download Website Updated 21 Aug 2005 French Verb Conjugation Rules

Screenshot
Pop 16.06
Vit 56.59

French Verb Conjugation Rules establishes a concise and accurate set of computer readable French verb conjugation rules. The rules have been placed in an xbase database for easy access. The project is oriented towards language students, developers of computer assisted language learning software, and computational linguists.

Download Website Updated 13 Oct 2006 Uplug

Screenshot
Pop 26.72
Vit 1.07

Uplug is a collection of tools for linguistic corpus processing, word alignment, and term extraction from parallel corpora. Several tools have been integrated in Uplug. Pre-processing tools include a sentence splitter, tokenizer, and external part-of-speech tagger and shallow parsers. The following external tools are used: the Grok system for English (tagging and chunking) and the morphological analyzer ChaSen for Japanese. Other tools such as the TreeTagger can easily be added. Translated documents can be sentence aligned using the length-based approach by Gale & Church. Words and phrases can be aligned using the clue alignment approach and the toolbox for training statistical alignment models GIZA++.

Download Website Updated 12 Aug 2005 Konjugator

Screenshot
Pop 11.45
Vit 1.00

Konjugator helps with learning or interpreting verb forms in Welsh. It produces a list of around 200,000 inflected verb forms for almost 4,000 Welsh verbs, along with English glosses and parsing information. It attempts to conjugate Welsh verbs that are unknown to it, and will give parsing details for random Welsh verb forms if these are known to it.

No download Website Updated 27 Jul 2005 NL Stego

Screenshot
Pop 18.65
Vit 56.82

NL Stego is a system for text generation and text-based steganography. It combines Markov Models of several orders to generate random text resembling a given training text (or text corpus). It can also embed secret messages into pseudo-random generated text.

Download Website Updated 21 Aug 2005 Transolution

Screenshot
Pop 51.26
Vit 2.25

Transolution is a Computer Aided Translation (CAT) suite supporting the XLIFF standard. It provides the open source community with features and concepts that have been used by commercial offerings for years to improve translation efficiency and quality. The suite is modular to make it flexible and provides an XLIFF Editor, translation memory engine and filters to convert different formats to and from XLIFF. The use of XLIFF means that almost any content can be localized as long as there is a filter for it (XML, SGML, PO, RTF, StarOffice/OpenOffice, etc.).

Download Website Updated 06 Dec 2011 tlgu

Screenshot
Pop 19.80
Vit 4.24

tlgu is a utility for converting an input file in Thesaurus Linguae Graeca (TLG) or Packard Humanities Institute (PHI) representation (beta code text and citation information) into Unicode (UTF-8). A companion GNU/Linux Hellenic Polytonic HOWTO may also be found in the tlgu site.

No download Website Updated 05 Jul 2005 Pure PHP Spell Check

Screenshot
Pop 21.63
Vit 1.00

Pure PHP Spell Check performs spell-checking of text using only base PHP functions, without using specific spell check PHP extensions such as aspell or pspell. The class uses a dictionary that is implemented as an array-based binary search table. The binary search table declaration is saved to a file for speed and can be updated easily by the developer.

Download Website Updated 22 Apr 2007 Computational Linguistics Toolset

Screenshot
Pop 50.65
Vit 2.97

The Computational Linguistics Toolset is a set of tools for computational linguistics. It contains re-usable code for cleaning, splitting, refining, and taking samples from corpora (ICE, Penn, and a native one), for tagging them using the TnT-tagger, for doing permutation statistics on N-grams (useful for finding statistically significant syntactical differences between any two sets of tagged texts), and various examination-tools. The tools themselves are well documented.

Download Website Updated 31 May 2005 IPA-CXS/X-Sampa Converter

Screenshot
Pop 41.05
Vit 1.42

IPA-CXS/X-Sampa Converter is a selection of modules for various programming languages (C, Perl, Lisp, and Python) for translating between IPA (International Phonetic Alphabet) and ASCII versions, in particular CXS, which is a close relative to X-Sampa. The project homepage contains a demo for using the Perl script as an online converter.

Download Website Updated 30 Dec 2008 xlit

Screenshot
Pop 66.61
Vit 5.33

Xlit converts text from one writing system into another. It allows the user to define a transliteration simply by typing the input strings in one window and the strings to which they are to be mapped in another. Transliteration may be restricted to regions bounded by specified delimiters or their complements. Transliteration may also be performed by external commands or plugins. Xlit can also convert one type of delimiter to another, e.g. from HZ escapes to XML. Xlit can read and write transliteration definitions in its own format and as Yudit keymaps. It can be run in batch mode without the GUI.

Screenshot

Project Spotlight

CT-gui/CT-synth/CT-farfisa

A GUI toolkit for Linux and Android.

Screenshot

Project Spotlight

fcron

A command scheduler for non-permanently-running systems.