RSS 144 projects tagged "Linguistic"

Download Website Updated 10 Mar 2008 spell-norwegian

Screenshot
Pop 16.55
Vit 3.08

spell-norwegian provides spell checking and thesaurus services for both Norwegian Bokmål and Norwegian Nynorsk for ispell, aspell, and myspell. This project was previously called ispell-norsk and norwegian.

Download Website Updated 18 May 2006 File2XLIFF4j

Screenshot
Pop 16.73
Vit 1.77

File2XLIFF4j is a modular implementation of a tool that converts files to and from the OASIS standard XLIFF (XML Localization Interchange File Format).

Download Website Updated 27 Dec 2005 Gentium fonts

Screenshot
Pop 37.55
Vit 1.00

Gentium is a typeface family designed to enable the diverse ethnic groups around the world who use the Latin script to produce readable, high-quality publications. It supports a wide range of Latin-based alphabets, and includes glyphs that correspond to all the Latin ranges of Unicode.

Download Website Updated 02 Jan 2006 Full Text for SQLite3

Screenshot
Pop 30.17
Vit 1.42

Full Text for SQLite3 is a full text indexer for data stored into a sqlite3 database. The text fields stored in the database can be split word-by-word and stored in a form suitable for lookup.

Download Website Updated 25 Jun 2008 libucd

Screenshot
Pop 37.71
Vit 3.17

libucd is a C library interface to the Unicode Character Database, which contains properties of all the Unicode characters.

No download Website Updated 29 Nov 2005 isobel

Screenshot
Pop 24.41
Vit 1.00

Isobel is a framework to build complex information retrieval and analysis systems. Isobel can be functionally divided in two subsytems, Isobel Gatherer (the crawling and filtering subsystem) and Isobel Analyzer (the analysis subsystem). The two subsytems can also be used separately. Isobel Gatherer offers ready-to-use services like content fetching, scheduling, document format conversion, Hyperlink graph storage and analysis, content storage and indexing. A programmer may easily add new services. Isobel Analyzer uses the IBM UIMA architecture to reuse the analysis components developed for this architecture.

Download Website Updated 12 Dec 2008 WordGenerator

Screenshot
Pop 62.33
Vit 3.96

WordGenerator generates hypothetical words from specifications of their syllable structure. The user specifies the maximum length of the words in syllables, the abstract structure of syllables in the language (in terms of such units as consonants and vowels or onsets and rhymes), and the actual sounds that comprise each abstract class (e.g. the list of vowels in the language); WordGenerator then generates the words that conform to this specification. Such lists are useful to field linguists exploring the vocabulary of a language, and to designers of artificial languages.

Download Website Updated 18 Sep 2010 Linguistic Tree Constructor

Screenshot
Pop 65.36
Vit 5.52

Linguistic Tree Constructor is an application for drawing linguistic syntax trees. Its main strength is assisting in data production by quickly analyzing large amounts of text. "Generic" trees are supported, as well as RRG and X-Bar trees. Node-categories are user-definable, and additional user-definable labels can also be applied to each node. Publication-quality, high-resolution, horizontal trees can be drawn. The file format is based on TIGER-XML.

Download Website Updated 30 Jul 2007 prbeditor

Screenshot
Pop 53.07
Vit 4.19

prbeditor is an editor for Java property resource bundle files. The application's intent is to help in the localization (l10n) of those programs that have been internationalized with Java's standard i18n mechanism. In contrast to other similar tools, it shows the keys and values of several languages at the same time in a spreadsheet, giving a global view of the resource files. The tool relies on the application of regular expresions to organize the keys and filter the visibility of the files. It includes a spell checker for several languages, based on word lists which may be downloaded separately.

No download Website Updated 19 Jan 2009 Unicode.php

Screenshot
Pop 17.55
Vit 3.11

The CentralNic Unicode Library (Unicode.php) provides some PHP classes for manipulating Unicode data. These classes are general purpose, but are intended for use when working with Internationalised Domain Names (IDNs).

Screenshot

Project Spotlight

Wing IDE

An IDE for Python.

Screenshot

Project Spotlight

Teddy Templating Engine

An easy-to-read, HTML-based, mostly logic-less DOM templating engine.