RSS 48 projects tagged "Linguistic"

No download Website Updated 23 Feb 2000 Mueller English-Russian Dictionary Kit

Screenshot
Pop 29.29
Vit 71.93

This is the first respectable English-Russian Dictionary with transcription (IPA) under GNU GPL. The dictionary has 46233 word articles and is 5.5MB. "MOVA" is a set of Bash and Tcl/TK scripts for dictionary management under UNIX.

Download Website Updated 30 Jan 2001 ssct

Screenshot
Pop 17.32
Vit 1.00

ssct is a command-line utility, humble of intent, that takes a single word, spell checks it, takes the result(s) and then translates them. It works to/from english only. From/to languages are limited by ispell in the first instance, and by the IDP (Internet Dictionary Project) files in the second. Currently the latter includes Spanish, Portuguese (minimal), Latin, German, French and Italian. These files are included with this package. This utility was originally created to make it easier to decode badly-scrawled postcards from Spain.

Download Website Updated 13 May 2008 yawl

Screenshot
Pop 84.99
Vit 3.52

This is a comprehensive "word game" word list for UNIX/Linux. It is a superset of the author's ENABLE list, the "OSW", and various lists researched by the author's colleague, Alan Beale. At 264,093 words, it is the largest list of its kind, suitable for use in all manners of crossword-type board games and word construction games, as well as for a spell checker dictionary. The YAWL package now includes two anagramming utilities (supplied as source code, handled by the included Makefile). There is also a shell script that extends the UNIX "strings" system command. This is the word list package recommended for the author's Quackey word game.

Download Website Updated 04 Jul 2011 Emdros

Screenshot
Pop 155.52
Vit 16.78

Emdros is a corpus query system for storing and searching linguistically annotated text. It is very generic, supporting almost any kind of annotation from almost any linguistic theory. All linguistic levels of analysis are supported, including phonology, morphology, the lexical level, syntax, and discourse. The core libraries act as a middleware layer between a client and an underlying SQL database. MySQL, PostgreSQL, and SQLite are supported.

Download Website Updated 19 Aug 2002 FramerD

Screenshot
Pop 47.33
Vit 1.77

FramerD is a semi-structured object database integrated with a Scheme-based scripting language which supports multi-lingual programming (with pervasive Unicode), a stable module system for programming in the large, distributed applications (via an extensible RPC protocol), non-deterministic (PROLOG-like) evaluation for search and set operations, multi-threaded program execution, extensive tools for text and language analysis, built-in HTML/XML/MIME parsers, and intuitive (CGI- and FastCGI-based) Web scripting. The built-in object database robustly supports millions of objects and indexed access to those objects, both through disk files and networked servers.

Download Website Updated 28 Nov 2004 GCC Introspector

Screenshot
Pop 85.14
Vit 1.96

The GCC XML Tree Node Introspector project consists of a patch to the gcc compiler to output the internal compiler tree nodes in RDF/XML and programs to process that RDF/XML. The tree nodes are complex data structures which represent the source code inside the compiler. Through these tree nodes, users are able to extract information from their programs that would be otherwise very difficult to obtain. Modules exist to store these nodes in Redland RDF using a Berkley database. The long-term goal of the project is create a high-level API that will make the programmatic manipulation of programs easier than it is now.

Download Website Updated 08 Mar 2006 kanjidrill

Screenshot
Pop 70.65
Vit 3.40

kdrill helps people learn Japanese 'Kanji' characters. Its includes a multiple-choice Kanji quiz program that helps people learn Japanese characters with different guess formats and history options. It also has a suite of dictionary lookup functions. Words can be found using a variety of methods including Romaji, SKIP, four-corner, cut-n-paste, radical lookup, and English search.

Download Website Updated 26 Dec 2004 QuickTranslate Online

Screenshot
Pop 42.43
Vit 2.50

QuickTranslate is a tool for translating text in 16 languages using the Systranbox service, which is used by Google, Altavista, Lycos, Terra, AOL, and others.

No download Website Updated 22 Oct 2002 trans (DE-EN)

Screenshot
Pop 11.45
Vit 1.00

trans is a small bidirectional dictionary lookup tool for Linux PDAs (Zaurus, IPaq). It uses grep to lookup words in a gzipped dictionary file and uses opie-sh to show the results. It comes with a German English wordlist that contains around 116000 words and phrases.

No download Website Updated 25 Sep 2006 Connexor Machinese

Screenshot
Pop 30.35
Vit 3.90

Connexor Machinese analyzers process sequences of written words, identify and classify the various entities in them, and show how these relate to each other, marking the language with a simple and systematic notation. Currently, the Machinese product family includes: Machinese Phrase Tagger, a fast, light-weight morphosyntactic tagger; Machinese Syntax, a full-scale dependency parser; Machinese Semantics, a dependency parser with semantic analysis; and Machinese Metadata, an entity extractor.

Screenshot

Project Spotlight

Mroonga

A fast full-text search engine for MySQL.

Screenshot

Project Spotlight

Razer device configuration tool

A Razer device configuration tool.