193 projects tagged "Linguistic"

Download Website Updated 06 Jun 2009 libalinga

Screenshot
Pop 26.46
Vit 1.52

libalinga is a C++ implementation of a multi-stream codec for the ALingA (Aligned Linguistic Annotation) format. It makes use of libogg++. Each ALingA stream holds at least one stream of annotation data, which is in the LingA format. It may also interleave the signal stream(s) against which the LingA streams are aligned, or it may simply reference such streams. It also provides metadata about the underlying manifold for the signals and the annotations. The metadata is ordered for runtime parsing of the number and type of signal and LingA codecs to enable decoding of the multiple logical streams in one pass.

No download Website Updated 28 Jul 2008 transtoba2

Screenshot
Pop 11.31
Vit 2.46

transtoba2 facilitates the transliteration or transcription of a word or text from the Roman script into the Toba Batak script. Transliterating from the Roman into the Batak script is not an easy undertaking, as the Batak script has a number of peculiarities that complicates the process of transliteration. This program uses a set of algorithms which enables the user to effortlessly transliterate from the Roman to the Toba Batak script.

Download Website Updated 31 Aug 2009 Apertium

Screenshot
Pop 34.42
Vit 47.02

Apertium is a machine translation platform, initially aimed at related-language pairs, but recently expanded to deal with more divergent language pairs (such as English-Catalan). The platform provides a language-independent machine translation engine, tools to manage the linguistic data necessary to build a machine translation system for a given language pair, and linguistic data for a growing number of language pairs.

No download Website Updated 14 Apr 2008 bitext2tmx

Screenshot
Pop 16.31
Vit 1.00

Bitext2tmx is a cross-platform Java application to align bitext (of a corresponding original text and its translation) and generate a TMX translation memory for use in computer-assisted translation.

Download Website Updated 17 Mar 2008 Free Logic Form

Screenshot
Pop 11.05
Vit 1.00

Free Logic Form is a system for generating logic forms of English sentences. . It works by postprocessing the output of the Enju predicate parser.

Download Website Updated 11 Mar 2008 Algraeph

Screenshot
Pop 30.69
Vit 1.00

Algraeph is a tool for manual alignment of linguistic graphs, such as phrase structure trees or dependency structures, where each node corresponds to a subsequence of the analyzed input sentence. It allows you to express the similarity between two graphs by aligning their nodes and attaching relation labels to these alignments. Graphs are read from one or more graphbanks (or treebanks) in the GraphML or Alpino formats. Alignment relations are user-defined and are stored in a simple XML format, which can be used for further processing. The resulting parallel graph corpus is a useful data set for many tasks in computational linguistics and natural language processing.

No download Website Updated 29 Feb 2008 Grammar Browser

Screenshot
Pop 35.50
Vit 1.00

Grammar Browser provides a simple-to-use graphical interface to the grammatical structure and relations of any text, as parsed by the Stanford Parser. It contains a grammatical relation editor to modify, import, and export grammatical relation definitions (tregex patterns and features).

Download Website Updated 18 Oct 2008 Cypher

Screenshot
Pop 27.02
Vit 3.50

Cypher is an AI program that generates the RDF graph and SPARQL query representations of plain language input, allowing users to speak plain language to update and query databases. With robust definition languages, Cypher's grammar and lexicon can quickly and easily be extended to process highly complex sentences and phrases of any natural language, and can cover any vocabulary. Equipped with Cypher, programmers can begin building next generation semantic Web applications that harness natural language.

Download Website Updated 07 Oct 2011 OpenEphyra

Screenshot
Pop 47.43
Vit 2.67

OpenEphyra is a question answering (QA) system. It retrieves answers to natural language questions from the Web and other sources. OpenEphyra comes with implementations of algorithms that proved effective in Carnegie Mellon's Ephyra system, which participated in the TREC evaluations. It is platform independent and can be set up in just a few minutes. The goal of this project is to give researchers the opportunity to develop new QA techniques without worrying about the end-to-end system.

Download Website Updated 24 Jul 2009 Unicode Data Browser

Screenshot
Pop 36.88
Vit 2.81

UnicodeDataBrowser is a browser for the UnicodeData.txt file, which contains much useful information but is not easily read by humans. It creates a scrollable table in which columns represent properties. The table may be sorted on any column. Abbreviations are expanded and characters cross-referenced in decomposition and casing fields are named. Regular expression search restricted to a selected column is available. The set of characters for which information is displayed may be restricted to those characters matching a regular expression on a specified property.

Screenshot

Project Spotlight

CT-gui/CT-synth/CT-farfisa

A GUI toolkit for Linux and Android.

Screenshot

Project Spotlight

fcron

A command scheduler for non-permanently-running systems.