RSS 11 projects tagged "Linguistic"

Download Website Updated 05 Dec 2001 The Quipu OpenNLP API

Screenshot
Pop 34.28
Vit 2.37

The Quipu OpenNLP API is a preliminary collection of Java interfaces for standardizing how natural language processing components interact.

Download Website Updated 19 Aug 2002 FramerD

Screenshot
Pop 46.39
Vit 1.77

FramerD is a semi-structured object database integrated with a Scheme-based scripting language which supports multi-lingual programming (with pervasive Unicode), a stable module system for programming in the large, distributed applications (via an extensible RPC protocol), non-deterministic (PROLOG-like) evaluation for search and set operations, multi-threaded program execution, extensive tools for text and language analysis, built-in HTML/XML/MIME parsers, and intuitive (CGI- and FastCGI-based) Web scripting. The built-in object database robustly supports millions of objects and indexed access to those objects, both through disk files and networked servers.

Download Website Updated 28 Nov 2004 GCC Introspector

Screenshot
Pop 85.28
Vit 1.96

The GCC XML Tree Node Introspector project consists of a patch to the gcc compiler to output the internal compiler tree nodes in RDF/XML and programs to process that RDF/XML. The tree nodes are complex data structures which represent the source code inside the compiler. Through these tree nodes, users are able to extract information from their programs that would be otherwise very difficult to obtain. Modules exist to store these nodes in Redland RDF using a Berkley database. The long-term goal of the project is create a high-level API that will make the programmatic manipulation of programs easier than it is now.

Download Website Updated 25 Oct 2002 SILGraphite

Screenshot
Pop 40.00
Vit 64.76

SILGraphite (formerly OpenGraphite) is a project within SIL's Non-Roman Script Initiative and Language Software Development groups to provide extensible cross-platform rendering capabilities for complex non-Roman writing systems. It consists of a rule-based programming language, Graphite Description Language (GDL), that can be used to describe the behavior of a writing system, a compiler for that language, and a rendering engine that can serve as the backend of a text processing application. SILGraphite renders TrueType fonts that have been extended by means of compiling a GDL program. It is currently being integrated into Gecko/Mozilla through the SILA project, a GNU/Linux port is also underway, and there are plans for OpenOffice.org and Abiword integration.

Download Website Updated 14 Sep 2005 Dacco

Screenshot
Pop 25.38
Vit 2.72

Dacco is a collaborative English-Catalan, Catalan-English dictionary project. It seeks to provide an up-to-date, comprehensive, bilingual dictionary that will be of benefit to learners of both languages. The dictionaries are downloadable and customizable (using XSLT) and contain audio files.

Download Website Updated 13 Oct 2006 Uplug

Screenshot
Pop 26.15
Vit 1.07

Uplug is a collection of tools for linguistic corpus processing, word alignment, and term extraction from parallel corpora. Several tools have been integrated in Uplug. Pre-processing tools include a sentence splitter, tokenizer, and external part-of-speech tagger and shallow parsers. The following external tools are used: the Grok system for English (tagging and chunking) and the morphological analyzer ChaSen for Japanese. Other tools such as the TreeTagger can easily be added. Translated documents can be sentence aligned using the length-based approach by Gale & Church. Words and phrases can be aligned using the clue alignment approach and the toolbox for training statistical alignment models GIZA++.

No download Website Updated 17 Sep 2005 Sikher

Screenshot
Pop 13.64
Vit 56.00

Sikher is a desktop program designed to archive, search, and display the Sikh scriptures using advanced functions. It allows the common person to understand and read the messages contained in the Sikh scriptures through translations and transliterations in different languages, thereby breaking the language and geographical barrier between Gurbani (Sikh Scriptures) and the world. Sikher is a robust, future proof, and cross-platform application which may be used by developers to create similar internationalized and localized search applications.

Download Website Updated 18 May 2006 File2XLIFF4j

Screenshot
Pop 16.73
Vit 1.77

File2XLIFF4j is a modular implementation of a tool that converts files to and from the OASIS standard XLIFF (XML Localization Interchange File Format).

Download Website Updated 18 Oct 2008 Cypher

Screenshot
Pop 27.93
Vit 3.50

Cypher is an AI program that generates the RDF graph and SPARQL query representations of plain language input, allowing users to speak plain language to update and query databases. With robust definition languages, Cypher's grammar and lexicon can quickly and easily be extended to process highly complex sentences and phrases of any natural language, and can cover any vocabulary. Equipped with Cypher, programmers can begin building next generation semantic Web applications that harness natural language.

No download Website Updated 14 Apr 2008 bitext2tmx

Screenshot
Pop 17.38
Vit 1.00

Bitext2tmx is a cross-platform Java application to align bitext (of a corresponding original text and its translation) and generate a TMX translation memory for use in computer-assisted translation.

Screenshot

Project Spotlight

Mantle Business Artifacts

Business artifacts including data model and service library for common business processes.

Screenshot

Project Spotlight

Xtables-addons

Additional Netfilter/iptables modules.