193 projects tagged "Linguistic"

No download Website Updated 29 Nov 2005 isobel

Screenshot
Pop 24.17
Vit 1.00

Isobel is a framework to build complex information retrieval and analysis systems. Isobel can be functionally divided in two subsytems, Isobel Gatherer (the crawling and filtering subsystem) and Isobel Analyzer (the analysis subsystem). The two subsytems can also be used separately. Isobel Gatherer offers ready-to-use services like content fetching, scheduling, document format conversion, Hyperlink graph storage and analysis, content storage and indexing. A programmer may easily add new services. Isobel Analyzer uses the IBM UIMA architecture to reuse the analysis components developed for this architecture.

Download Website Updated 25 Jun 2008 libucd

Screenshot
Pop 37.39
Vit 3.16

libucd is a C library interface to the Unicode Character Database, which contains properties of all the Unicode characters.

Download Website Updated 02 Jan 2006 Full Text for SQLite3

Screenshot
Pop 31.42
Vit 1.42

Full Text for SQLite3 is a full text indexer for data stored into a sqlite3 database. The text fields stored in the database can be split word-by-word and stored in a form suitable for lookup.

Download Website Updated 27 Dec 2005 Gentium fonts

Screenshot
Pop 38.47
Vit 1.00

Gentium is a typeface family designed to enable the diverse ethnic groups around the world who use the Latin script to produce readable, high-quality publications. It supports a wide range of Latin-based alphabets, and includes glyphs that correspond to all the Latin ranges of Unicode.

Download Website Updated 18 May 2006 File2XLIFF4j

Screenshot
Pop 16.85
Vit 1.77

File2XLIFF4j is a modular implementation of a tool that converts files to and from the OASIS standard XLIFF (XML Localization Interchange File Format).

Download Website Updated 10 Mar 2008 spell-norwegian

Screenshot
Pop 15.56
Vit 3.08

spell-norwegian provides spell checking and thesaurus services for both Norwegian Bokmål and Norwegian Nynorsk for ispell, aspell, and myspell. This project was previously called ispell-norsk and norwegian.

Download Website Updated 15 May 2010 OmegaT+

Screenshot
Pop 53.39
Vit 4.55

OmegaT+ is a Computer-Assisted Translation (CAT) tools platform. It includes a translation processor with translation memory and projects support, a bitext aligner, and a TMX validator. It has various other tools to process documents for translation.

No download Website Updated 26 Jan 2006 Verticrawl Seek Site Search

Screenshot
Pop 20.64
Vit 1.00

Verticrawl Seek Site Search is a search engine technology for making powerful, fast, and customizable search solutions. It features parsing of multiple document formats, an admin interface, compatibility with sitemaps, and a search interface for HTML, XML, and PHP.

No download Website Updated 02 Aug 2012 Poliqarp

Screenshot
Pop 55.43
Vit 7.64

Poliqarp is a universal suite of utilities for processing large corpora. It includes a concordancer that works on binary corpora compiled for efficient searching and a corpus builder. It supports positional tagsets, ambiguities in the texts, and Unicode.

Download Website Updated 06 Apr 2006 XChestival

Screenshot
Pop 17.20
Vit 1.00

XChestival is an improved version of xchat_speak designed for Italian. It lets xchat and irssi "speak" through festival. It comes with a script for xchat and irssi and the Italian phonemes. The scripts have some useful features like channels and query filtering and string substitution.

Screenshot

Project Spotlight

CT-gui/CT-synth/CT-farfisa

A GUI toolkit for Linux and Android.

Screenshot

Project Spotlight

fcron

A command scheduler for non-permanently-running systems.