RSS 18 projects tagged "Linguistic"

Download Website Updated 30 Jan 2001 Ciao Prolog

Screenshot
Pop 117.19
Vit 1.00

Ciao is a complete Prolog system subsuming ISO-Prolog with a novel modular design which allows both restricting and extending the language. Ciao extensions currently include feature terms (records), higher-order, functions, constraints, objects, persistent predicates, a good base for distributed execution (agents), and concurrency. Libraries also support WWW programming, sockets, and external interfaces (C, Java, TCL/Tk, relational databases, etc.). An Emacs-based environment, a stand-alone compiler, and a toplevel shell are also provided.

Download Website Updated 17 Feb 2001 Cryptic Muse

Screenshot
Pop 43.82
Vit 1.75

Cryptic Muse is a library for performing fast word pattern searches including anagrams, wildcards, overlap, containment, and masks. It provides searches for entry files like dictionaries.

Download Website Updated 05 Dec 2001 Grok

Screenshot
Pop 52.50
Vit 2.38

Grok is a library of Java components for performing various natural language tasks. These include several preprocessing tasks, chart parsing, a large categorial grammar for English (induced from the Penn treebank), and some knowledge representation components (basic coreference, salience tracking, etc.). The library also has a companion kit which provides a GUI interface to the components, several of which are implementations of interfaces in the Quipu OpenNLP API.

Download Website Updated 04 Apr 2003 The Quipu Maximum Entropy Package

Screenshot
Pop 42.57
Vit 2.53

Maximum entropy is a powerful method for constructing statistical models of classification tasks, such as part-of-speech tagging in Natural Language Processing. The Quipu Maximum Entropy Package is a Java implementation of the maximum entropy framework. It allows you to train, evaluate, and use maxent models.

Download Website Updated 05 Dec 2001 The Quipu OpenNLP API

Screenshot
Pop 41.35
Vit 2.38

The Quipu OpenNLP API is a preliminary collection of Java interfaces for standardizing how natural language processing components interact.

No download Website Updated 14 Apr 2004 Name::Find Perl Module

Screenshot
Pop 23.41
Vit 2.24

Name::Find Perl Module is a Perl module for finding names in a text string. It doesn't look for a particular name, but variations of names in the form: Honorific, GivenName1, GivenName2, Surname, Suffix (where some parts may not exist, and GivenName1 and 2 may be initials). It uses a dictionary based approach, so names not in the dictionary will not be found. There is a separate dictionary for each of the word positions in the name, so you don't have to have a list of every possible name combination.

Download Website Updated 19 Aug 2002 FramerD

Screenshot
Pop 51.30
Vit 1.77

FramerD is a semi-structured object database integrated with a Scheme-based scripting language which supports multi-lingual programming (with pervasive Unicode), a stable module system for programming in the large, distributed applications (via an extensible RPC protocol), non-deterministic (PROLOG-like) evaluation for search and set operations, multi-threaded program execution, extensive tools for text and language analysis, built-in HTML/XML/MIME parsers, and intuitive (CGI- and FastCGI-based) Web scripting. The built-in object database robustly supports millions of objects and indexed access to those objects, both through disk files and networked servers.

Download Website Updated 11 Apr 2007 spell-uk

Screenshot
Pop 27.93
Vit 2.94

spell-uk is a Ukrainian dictionary for aspell, myspell, and ispell.

Download Website Updated 25 Oct 2002 SILGraphite

Screenshot
Pop 42.33
Vit 62.14

SILGraphite (formerly OpenGraphite) is a project within SIL's Non-Roman Script Initiative and Language Software Development groups to provide extensible cross-platform rendering capabilities for complex non-Roman writing systems. It consists of a rule-based programming language, Graphite Description Language (GDL), that can be used to describe the behavior of a writing system, a compiler for that language, and a rendering engine that can serve as the backend of a text processing application. SILGraphite renders TrueType fonts that have been extended by means of compiling a GDL program. It is currently being integrated into Gecko/Mozilla through the SILA project, a GNU/Linux port is also underway, and there are plans for OpenOffice.org and Abiword integration.

Download Website Updated 14 Sep 2005 Dacco

Screenshot
Pop 29.24
Vit 2.75

Dacco is a collaborative English-Catalan, Catalan-English dictionary project. It seeks to provide an up-to-date, comprehensive, bilingual dictionary that will be of benefit to learners of both languages. The dictionaries are downloadable and customizable (using XSLT) and contain audio files.

Screenshot

Project Spotlight

FrontAccounting

A user-friendly, Web-based accounting system.

Screenshot

Project Spotlight

Genson

A Java library for doing conversion between JSON and Java.