193 projects tagged "Linguistic"

Download No website Updated 06 May 2014 yawl

Screenshot
Pop 106.16
Vit 3.50

This is a comprehensive "word game" word list for UNIX/Linux. It is a superset of the author's ENABLE list, the "OSW", and various lists researched by the author's colleague, Alan Beale. At 264,093 words, it is the largest list of its kind, suitable for use in all manners of crossword-type board games and word construction games, as well as for a spell checker dictionary. The YAWL package now includes two anagramming utilities (supplied as source code, handled by the included Makefile). There is also a shell script that extends the UNIX "strings" system command. This is the word list package recommended for the author's Quackey word game.

Download Website Updated 02 Feb 2003 Old-Russian Ispell Set

Screenshot
Pop 24.00
Vit 2.86

Old-Russian Ispell is a superset of A.I. Lebedev's rus-ispell package that enables spellchecking of texts in old Russian orthography (pre-1918). The KOI8-C encoding is used for representing old Russian characters.

Download Website Updated 24 Mar 2001 SyNTeX - Syntactic tree drawing program

Screenshot
Pop 49.36
Vit 69.38

SyNTeX is a LaTeX preprocessor that draws syntactic trees using the LaTeX picture environment. The preprocessor reads the comments in a LaTeX file and draws the tree based on commands that it finds in the comments.

Download Website Updated 22 Jul 2002 Linguaphile

Screenshot
Pop 65.45
Vit 1.49

Linguaphile is a simple command line language translator. It is open source, platform independent, and programmed in Perl. Linguaphile currently supports the following languages: Afrikaans, Alawa, Albanian, Arrernte, Basque, Belarusian, Bulgarian, Catalan, Croatian, Czech, Danish, Dutch, English, Esperanto, Estonian, Finnish, French, Galician, German, Greek, Hawaiian, Hungarian, Icelandic, Indonesian, Interlingua, Irish, Italian, Kala Lagaw Ya, Korean, Kriol, Latvian, Lithuanian, Malay, Maltese, Maori, Norwegian, Pitjantjatjara, Polish, Portuguese, Romanian, Russian, Samoan, Serbian, Slovak, Slovenian, Spanish, Swahili, Swedish, Thai, Tok Pisin, Turkish, Ukrainian, Warlpiri, and Welsh. The Spanish to English translation is the most useful at this stage.

Download Website Updated 05 May 2011 Faroese Spell Checking Dictionary

Screenshot
Pop 24.19
Vit 10.97

This Faroese spell checking dictionary is intended to be used with programs like aspell and ispell.

Download Website Updated 04 Feb 2007 Japana

Screenshot
Pop 62.77
Vit 4.03

Japana is a small HTTP proxy written in Perl. It converts Japanese characters (Hiragana, Katakana, and Kanji) into ASCII (Romaji) on the fly. The translation is done with the kakasi library (an older version without the need for kakasi still exists).

Download Website Updated 05 Dec 2001 Saxogram Vocabulary List Builder

Screenshot
Pop 18.97
Vit 2.04

Saxogram is a script which generates vocabulary lists from texts written in foreign languages. So far it has dictionaries for Latin, German, and Italian. Saxogram was written with extensibility in mind. Thus, other languages can be "plugged in" as modules. All that is needed is a dictionary and a small amount of code. The purpose of the program is to speed up language learning. Too much time is spent looking up every third word. This isn't practical when learning a new language. Generating a vocabulary list that can be used in parallel to ones reading is helpful.

Download Website Updated 06 Mar 2008 Mguesser

Screenshot
Pop 30.13
Vit 2.50

Mguesser is a tool to guess a text's character set and language. It is a standalone part of the mnoGoSearch engine. More than 100 various character set and language combinations are supported.

Download Website Updated 01 Dec 2001 euc2html

Screenshot
Pop 18.17
Vit 1.44

euc2html is a simple application to convert any double-byte Japanese (and maybe Chinese/Korean) EUC-encoded characters to HTML/4.0 Unicode entities. It operates using stdin/stdout only, so is useful for batch updating Web sites, content, etc.

No download Website Updated 13 May 2014 Emdros

Screenshot
Pop 362.32
Vit 137.96

Emdros is a corpus query system for storing and searching linguistically annotated text. It is very generic, supporting almost any kind of annotation from almost any linguistic theory. All linguistic levels of analysis are supported, including phonology, morphology, the lexical level, syntax, and discourse. The core libraries act as a middleware layer between a client and an underlying SQL database. MySQL, PostgreSQL, and SQLite (2 and 3) are supported.

Screenshot

Project Spotlight

CT-gui/CT-synth/CT-farfisa

A GUI toolkit for Linux and Android.

Screenshot

Project Spotlight

fcron

A command scheduler for non-permanently-running systems.