193 projects tagged "Linguistic"

Download Website Updated 05 Nov 2007 OmegaT

Screenshot
Pop 28.90
Vit 1.07

OmegaT is a translation memory application intended for professional translators. It does not translate for you (software that does this is called "machine translation"). It features fuzzy matching, match propagation, simultaneous processing of multiple-file projects, simultaneous use of multiple translation memories, and external glossaries. Document file formats include plain text, HTML, and OpenOffice.org/StarOffice. It has Unicode (UTF-8) support (can be used with non-Latin alphabets). It is compatible with other translation memory applications (TMX Level 1).

Download Website Updated 15 May 2010 OmegaT+

Screenshot
Pop 53.39
Vit 4.55

OmegaT+ is a Computer-Assisted Translation (CAT) tools platform. It includes a translation processor with translation memory and projects support, a bitext aligner, and a TMX validator. It has various other tools to process documents for translation.

No download Website Updated 22 Dec 2008 Open Translation Engine

Screenshot
Pop 67.86
Vit 5.47

Open Translation Engine (OTE) is a Web-based system to enable community management of translation dictionaries.

Download Website Updated 07 Oct 2011 OpenEphyra

Screenshot
Pop 47.43
Vit 2.67

OpenEphyra is a question answering (QA) system. It retrieves answers to natural language questions from the Web and other sources. OpenEphyra comes with implementations of algorithms that proved effective in Carnegie Mellon's Ephyra system, which participated in the TREC evaluations. It is platform independent and can be set up in just a few minutes. The goal of this project is to give researchers the opportunity to develop new QA techniques without worrying about the end-to-end system.

No download Website Updated 08 Jun 2004 PAiN

Screenshot
Pop 16.00
Vit 1.03

PAiN is a new MUD codebase written in Java. It provides a general purpose persistence engine (PAiN DB) and the ability to do dynamic code reloading.

No download Website Updated 02 Aug 2012 Poliqarp

Screenshot
Pop 55.43
Vit 7.64

Poliqarp is a universal suite of utilities for processing large corpora. It includes a concordancer that works on binary corpora compiled for efficient searching and a corpus builder. It supports positional tagsets, ambiguities in the texts, and Unicode.

Download Website Updated 03 Feb 2004 Polygen

Screenshot
Pop 27.20
Vit 61.38

PolyGen is a program for generating random sentences according to a grammar definition, that is following custom syntactical and lexical rules. Formally, it is an interpreter of a language itself designed to define languages, where to interpret means executing a source program in real time and eventually outputting its result. Here, a source program is a grammar definition. The execution consists of the exploration of such grammar by selecting a random path, and the result is the sentence built on the way.

Download Website Updated 07 Nov 2007 PottyMouth

Screenshot
Pop 30.69
Vit 1.00

PottyMouth transforms completely unstructured and untrusted text to valid, nice-looking, completely safe XHTML. PottyMouth is designed to handle input text from non-technical, potentially careless, or malicious users. It produces HTML that is completely safe, programmatically and visually, to include on any Web page. You don't need to make your users read any instructions before they start typing. They don't even need to know that PottyMouth is being used.

No download Website Updated 05 Jul 2005 Pure PHP Spell Check

Screenshot
Pop 21.63
Vit 1.00

Pure PHP Spell Check performs spell-checking of text using only base PHP functions, without using specific spell check PHP extensions such as aspell or pspell. The class uses a dictionary that is implemented as an array-based binary search table. The binary search table declaration is saved to a file for speed and can be updated easily by the developer.

Download Website Updated 22 Jan 2004 PyBabelPhish

Screenshot
Pop 37.79
Vit 1.80

PyBabelPhish is a GTK-based program providing fast translations from one natural language to another. Texts translated to Spanish can be read aloud in Spanish through optional text-to-speech support.

Screenshot

Project Spotlight

CT-gui/CT-synth/CT-farfisa

A GUI toolkit for Linux and Android.

Screenshot

Project Spotlight

fcron

A command scheduler for non-permanently-running systems.