RSS 193 projects tagged "Linguistic"

No download Website Updated 05 Jul 2005 Pure PHP Spell Check

Screenshot
Pop 20.10
Vit 1.00

Pure PHP Spell Check performs spell-checking of text using only base PHP functions, without using specific spell check PHP extensions such as aspell or pspell. The class uses a dictionary that is implemented as an array-based binary search table. The binary search table declaration is saved to a file for speed and can be updated easily by the developer.

Download Website Updated 12 Aug 2005 Konjugator

Screenshot
Pop 11.45
Vit 1.00

Konjugator helps with learning or interpreting verb forms in Welsh. It produces a list of around 200,000 inflected verb forms for almost 4,000 Welsh verbs, along with English glosses and parsing information. It attempts to conjugate Welsh verbs that are unknown to it, and will give parsing details for random Welsh verb forms if these are known to it.

Download Website Updated 01 Oct 2005 text2phonome

Screenshot
Pop 28.90
Vit 1.00

text2phonome is a class can be used to convert English text to the respective phoneme representation. It provides options for delimiting sentences, words, and phonemes themseleves for further processing.

No download Website Updated 29 Nov 2005 isobel

Screenshot
Pop 24.82
Vit 1.00

Isobel is a framework to build complex information retrieval and analysis systems. Isobel can be functionally divided in two subsytems, Isobel Gatherer (the crawling and filtering subsystem) and Isobel Analyzer (the analysis subsystem). The two subsytems can also be used separately. Isobel Gatherer offers ready-to-use services like content fetching, scheduling, document format conversion, Hyperlink graph storage and analysis, content storage and indexing. A programmer may easily add new services. Isobel Analyzer uses the IBM UIMA architecture to reuse the analysis components developed for this architecture.

Download Website Updated 27 Dec 2005 Gentium fonts

Screenshot
Pop 37.42
Vit 1.00

Gentium is a typeface family designed to enable the diverse ethnic groups around the world who use the Latin script to produce readable, high-quality publications. It supports a wide range of Latin-based alphabets, and includes glyphs that correspond to all the Latin ranges of Unicode.

No download Website Updated 26 Jan 2006 Verticrawl Seek Site Search

Screenshot
Pop 19.60
Vit 1.00

Verticrawl Seek Site Search is a search engine technology for making powerful, fast, and customizable search solutions. It features parsing of multiple document formats, an admin interface, compatibility with sitemaps, and a search interface for HTML, XML, and PHP.

Download Website Updated 06 Apr 2006 XChestival

Screenshot
Pop 17.66
Vit 1.00

XChestival is an improved version of xchat_speak designed for Italian. It lets xchat and irssi "speak" through festival. It comes with a script for xchat and irssi and the Italian phonemes. The scripts have some useful features like channels and query filtering and string substitution.

Download Website Updated 21 May 2006 generictionary

Screenshot
Pop 27.75
Vit 1.00

GenericTionary is a multilingual dictionary software which makes it possible to generate dictionaries from flat files, to import the dictionaries which are generated with itself, and to export them back to flat files. It can handle multiple dictionaries.

Download Website Updated 15 Aug 2006 JBootCat

Screenshot
Pop 18.71
Vit 1.00

JBootCat is an implemention of the BootCat scripts for acquiring corpora from the Internet, which is of interest to linguists and translators. The main goal is to encapsulate the BootCat functionality within a user-friendly desktop application.

No download Website Updated 27 Aug 2006 SenseClusters

Screenshot
Pop 17.44
Vit 1.00

SenseClusters is a natural language processing package that allows you to cluster similar contexts or to identify clusters of related words. It supports its own native methods based on first and second order representations of context, and also supports Latent Semantic Analysis. It is fully unsupervised, and can automatically discover the optimal number of clusters in your text. SenseClusters is a complete system that takes users from preprocessing of raw text to providing clustered output.

Screenshot

Project Spotlight

CuteMarkEd

A MarkDown editor with live HTML previews.

Screenshot

Project Spotlight

Performance Co-Pilot

performance monitoring toolkit and API