RSS 28 projects tagged "Linguistic"

Download Website Updated 24 Mar 2001 SyNTeX - Syntactic tree drawing program

Screenshot
Pop 59.80
Vit 66.66

SyNTeX is a LaTeX preprocessor that draws syntactic trees using the LaTeX picture environment. The preprocessor reads the comments in a LaTeX file and draws the tree based on commands that it finds in the comments.

No download Website Updated 16 May 2009 Stance

Screenshot
Pop 15.43
Vit 38.34

Stance is a script that generates random sentences in Dutch, which can be used by teachers to create translation exercises for students of this language. In its finished version, it should be able to generate only gramatically correct sentences.

Download Website Updated 28 Feb 2012 Hspell

Screenshot
Pop 81.49
Vit 9.60

Hspell is a Hebrew linguistic project. It features a Hebrew spell-checker, and aims to use the databases and algorithms developed as a morphology engine (for example, for search engines), and in the future for advanced things like Hebrew speech synthesis.

No download Website Updated 05 May 2012 Text-Tokenizer

Screenshot
Pop 76.12
Vit 9.50

Text-Tokenizer is Perl module based on the flex generated lexical analyzer that can be used for parsing of text (configuration) files. With this module, a simple full-featured configuration parser can be written very easily.

Download Website Updated 22 Oct 2007 Diogenes

Screenshot
Pop 115.41
Vit 4.92

Diogenes is a tool for searching and browsing the Latin and ancient Greek texts published on CD-ROM by the Packard Humanities Institute and the Thesaurus Linguae Graecae. It comes as an easy-to-install stand-alone application for GNU/Linux, Mac OS X, and Windows, based on the Firefox browser (i.e. Xulrunner). Alternatively, it can be installed by a network administrator as a server on a local network, and users then access it via an ordinary Web browser. There is also a command-line tool which can optionally format output as LaTeX instead of HTML.

Download Website Updated 04 Feb 2007 Japana

Screenshot
Pop 56.77
Vit 4.17

Japana is a small HTTP proxy written in Perl. It converts Japanese characters (Hiragana, Katakana, and Kanji) into ASCII (Romaji) on the fly. The translation is done with the kakasi library (an older version without the need for kakasi still exists).

Download Website Updated 13 Mar 2009 WordNet-Similarity

Screenshot
Pop 30.50
Vit 3.47

WordNet-Similarity is a collection of Perl modules for the WordNet system. They are designed as object classes with methods that take two word senses as input and return the semantic relatedness of these word senses.

Download Website Updated 20 Nov 2009 po for anything

Screenshot
Pop 45.43
Vit 3.41

The goal of po4a (po for anything) is to ease the creation and maintenance of translations using gettext tools on areas where they were not expected, like documentation.

Download Website Updated 22 Apr 2007 Computational Linguistics Toolset

Screenshot
Pop 62.09
Vit 3.02

The Computational Linguistics Toolset is a set of tools for computational linguistics. It contains re-usable code for cleaning, splitting, refining, and taking samples from corpora (ICE, Penn, and a native one), for tagging them using the TnT-tagger, for doing permutation statistics on N-grams (useful for finding statistically significant syntactical differences between any two sets of tagged texts), and various examination-tools. The tools themselves are well documented.

Download Website Updated 11 Mar 2009 Lingua::PT::Nums2Words

Screenshot
Pop 20.62
Vit 2.80

Lingua::PT::Nums2Words is a module for Perl that computes Portuguese (Brazilian) verbage from integer numerical values.

Screenshot

Project Spotlight

OpenDocMan

A Web-based document management system.

Screenshot

Project Spotlight

STK/Unit

Unit tests for MariaDB and MySQL.