193 projects tagged "Linguistic"

Download Website Updated 07 Nov 2007 PottyMouth

Screenshot
Pop 30.69
Vit 1.00

PottyMouth transforms completely unstructured and untrusted text to valid, nice-looking, completely safe XHTML. PottyMouth is designed to handle input text from non-technical, potentially careless, or malicious users. It produces HTML that is completely safe, programmatically and visually, to include on any Web page. You don't need to make your users read any instructions before they start typing. They don't even need to know that PottyMouth is being used.

Download Website Updated 21 Mar 2013 JOrtho

Screenshot
Pop 71.25
Vit 8.06

JOrtho is a spell checker for Java. The library works with any JTextComponent from the Swing framework and checks as you type. The dictionary is based on the free Wiktionary.org, and is applicable for multiple languages. You can select the spell checking language via a context menu. The Features of JOrtho are the highlighting of potentially wrongly spelled words, a context menu with suggestions for correct forms of the word, and a context menu with option to change the checking language. At the moment there are nine languages for spell checking available: English, German, French, Spanish, Italian, Russian, Polish, Dutch, and Arabic.

No download No website Updated 20 Feb 2011 Linguas OS

Screenshot
Pop 23.92
Vit 1.42

Linguas OS is a Linux live CD that includes OpenOffice.org, Omega T, Evince, and other basic tools that translators use on a daily basis for translation work. It includes Internet browsing and e-mail software.

Download Website Updated 24 Jul 2009 Unicode Data Browser

Screenshot
Pop 36.88
Vit 2.81

UnicodeDataBrowser is a browser for the UnicodeData.txt file, which contains much useful information but is not easily read by humans. It creates a scrollable table in which columns represent properties. The table may be sorted on any column. Abbreviations are expanded and characters cross-referenced in decomposition and casing fields are named. Regular expression search restricted to a selected column is available. The set of characters for which information is displayed may be restricted to those characters matching a regular expression on a specified property.

Download Website Updated 07 Oct 2011 OpenEphyra

Screenshot
Pop 47.43
Vit 2.67

OpenEphyra is a question answering (QA) system. It retrieves answers to natural language questions from the Web and other sources. OpenEphyra comes with implementations of algorithms that proved effective in Carnegie Mellon's Ephyra system, which participated in the TREC evaluations. It is platform independent and can be set up in just a few minutes. The goal of this project is to give researchers the opportunity to develop new QA techniques without worrying about the end-to-end system.

Download Website Updated 18 Oct 2008 Cypher

Screenshot
Pop 27.02
Vit 3.50

Cypher is an AI program that generates the RDF graph and SPARQL query representations of plain language input, allowing users to speak plain language to update and query databases. With robust definition languages, Cypher's grammar and lexicon can quickly and easily be extended to process highly complex sentences and phrases of any natural language, and can cover any vocabulary. Equipped with Cypher, programmers can begin building next generation semantic Web applications that harness natural language.

No download Website Updated 29 Feb 2008 Grammar Browser

Screenshot
Pop 35.50
Vit 1.00

Grammar Browser provides a simple-to-use graphical interface to the grammatical structure and relations of any text, as parsed by the Stanford Parser. It contains a grammatical relation editor to modify, import, and export grammatical relation definitions (tregex patterns and features).

Download Website Updated 11 Mar 2008 Algraeph

Screenshot
Pop 30.69
Vit 1.00

Algraeph is a tool for manual alignment of linguistic graphs, such as phrase structure trees or dependency structures, where each node corresponds to a subsequence of the analyzed input sentence. It allows you to express the similarity between two graphs by aligning their nodes and attaching relation labels to these alignments. Graphs are read from one or more graphbanks (or treebanks) in the GraphML or Alpino formats. Alignment relations are user-defined and are stored in a simple XML format, which can be used for further processing. The resulting parallel graph corpus is a useful data set for many tasks in computational linguistics and natural language processing.

Download Website Updated 17 Mar 2008 Free Logic Form

Screenshot
Pop 11.05
Vit 1.00

Free Logic Form is a system for generating logic forms of English sentences. . It works by postprocessing the output of the Enju predicate parser.

No download Website Updated 14 Apr 2008 bitext2tmx

Screenshot
Pop 16.31
Vit 1.00

Bitext2tmx is a cross-platform Java application to align bitext (of a corresponding original text and its translation) and generate a TMX translation memory for use in computer-assisted translation.

Screenshot

Project Spotlight

CT-gui/CT-synth/CT-farfisa

A GUI toolkit for Linux and Android.

Screenshot

Project Spotlight

fcron

A command scheduler for non-permanently-running systems.