193 projects tagged "Linguistic"

Download Website Updated 10 Jun 2008 SCAN

Screenshot
Pop 53.60
Vit 2.59

SCAN is a personal information retrieval framework, combining search, text analysis, tagging, and metadata functions for document collections management. SCAN is a component-based software using a number of plugins for specific features. The basic SCAN platform can be easily extended with plugins for different document formats and document location types.

Download Website Updated 15 May 2010 OmegaT+

Screenshot
Pop 53.39
Vit 4.55

OmegaT+ is a Computer-Assisted Translation (CAT) tools platform. It includes a translation processor with translation memory and projects support, a bitext aligner, and a TMX validator. It has various other tools to process documents for translation.

Download Website Updated 18 Aug 2008 An Gramadóir

Screenshot
Pop 52.39
Vit 2.67

An Gramadóir is a grammar checking engine that is designed for the rapid development of grammar checkers for minority languages and other languages with limited computational resources. Rule specifications are given according to a simple syntax combining XML and regular expressions. Part-of-speech tagging can be learned from text corpora using statistical methods. It is currently implemented for Irish (Gaeilge).

Download Website Updated 15 Mar 2005 Ellogon

Screenshot
Pop 51.70
Vit 1.82

Ellogon is a multi-lingual, cross-platform, general-purpose language engineering environment, developed in order to aid both researchers who are doing research in computational linguistics, as well as companies who produce and deliver language engineering systems. As a language engineering platform, it offers an extensive set of facilities, including tools for processing and visualising textual/HTML/XML data and associated linguistic information, support for lexical resources (like creating and embedding lexicons), tools for creating annotated corpora, accessing databases, comparing annotated data, or transforming linguistic information into vectors for use with various machine learning algorithms.

Download Website Updated 21 Aug 2005 Transolution

Screenshot
Pop 51.26
Vit 2.25

Transolution is a Computer Aided Translation (CAT) suite supporting the XLIFF standard. It provides the open source community with features and concepts that have been used by commercial offerings for years to improve translation efficiency and quality. The suite is modular to make it flexible and provides an XLIFF Editor, translation memory engine and filters to convert different formats to and from XLIFF. The use of XLIFF means that almost any content can be localized as long as there is a filter for it (XML, SGML, PO, RTF, StarOffice/OpenOffice, etc.).

Download Website Updated 22 Apr 2007 Computational Linguistics Toolset

Screenshot
Pop 50.65
Vit 2.97

The Computational Linguistics Toolset is a set of tools for computational linguistics. It contains re-usable code for cleaning, splitting, refining, and taking samples from corpora (ICE, Penn, and a native one), for tagging them using the TnT-tagger, for doing permutation statistics on N-grams (useful for finding statistically significant syntactical differences between any two sets of tagged texts), and various examination-tools. The tools themselves are well documented.

Download Website Updated 24 Mar 2001 SyNTeX - Syntactic tree drawing program

Screenshot
Pop 49.36
Vit 69.38

SyNTeX is a LaTeX preprocessor that draws syntactic trees using the LaTeX picture environment. The preprocessor reads the comments in a LaTeX file and draws the tree based on commands that it finds in the comments.

Download Website Updated 19 Aug 2002 FramerD

Screenshot
Pop 49.32
Vit 1.77

FramerD is a semi-structured object database integrated with a Scheme-based scripting language which supports multi-lingual programming (with pervasive Unicode), a stable module system for programming in the large, distributed applications (via an extensible RPC protocol), non-deterministic (PROLOG-like) evaluation for search and set operations, multi-threaded program execution, extensive tools for text and language analysis, built-in HTML/XML/MIME parsers, and intuitive (CGI- and FastCGI-based) Web scripting. The built-in object database robustly supports millions of objects and indexed access to those objects, both through disk files and networked servers.

Download Website Updated 07 Oct 2011 OpenEphyra

Screenshot
Pop 47.43
Vit 2.67

OpenEphyra is a question answering (QA) system. It retrieves answers to natural language questions from the Web and other sources. OpenEphyra comes with implementations of algorithms that proved effective in Carnegie Mellon's Ephyra system, which participated in the TREC evaluations. It is platform independent and can be set up in just a few minutes. The goal of this project is to give researchers the opportunity to develop new QA techniques without worrying about the end-to-end system.

Download Website Updated 14 May 2005 WorldPrint

Screenshot
Pop 47.25
Vit 4.84

WorldPrint is a filter for Mozilla (Galeon, etc.), Htmldoc, and Netscape PostScript output that uses TrueType fonts to allow the printing of pages written in Unicode, Big5, SJIS, KOI-8, ISO-8859*, and other charsets.

Screenshot

Project Spotlight

CT-gui/CT-synth/CT-farfisa

A GUI toolkit for Linux and Android.

Screenshot

Project Spotlight

fcron

A command scheduler for non-permanently-running systems.