RSS 16 projects tagged "Linguistic"

Download Website Updated 04 Jul 2011 Emdros

Screenshot
Pop 155.95
Vit 16.77

Emdros is a corpus query system for storing and searching linguistically annotated text. It is very generic, supporting almost any kind of annotation from almost any linguistic theory. All linguistic levels of analysis are supported, including phonology, morphology, the lexical level, syntax, and discourse. The core libraries act as a middleware layer between a client and an underlying SQL database. MySQL, PostgreSQL, and SQLite are supported.

No download No website Updated 21 Feb 2005 Zoe Intertwingle

Screenshot
Pop 162.27
Vit 7.12

Zoe is a Web based email client with a built in SMTP and POP3 server and Google-like search functionality that lives on your desktop. It is written in Java and uses Lucene technology to provided instant searching and threading of your email messages.

Download Website Updated 26 Mar 2006 dbacl

Screenshot
Pop 174.65
Vit 4.91

dbacl is a digramic Bayesian text classifier. Given some text, it calculates the posterior probabilities that the input resembles one of any number of previously learned document collections. It can be used to sort incoming email into arbitrary categories such as spam, work, and play, or simply to distinguish an English text from a French text. It fully supports international character sets, and uses sophisticated statistical models based on the Maximum Entropy Principle.

Download Website Updated 19 Sep 2004 HumAn Language GENerator

Screenshot
Pop 59.45
Vit 1.51

HALoGEN is an extremely powerful and easy to use general-purpose natural language generation system. It consists of a symbolic generator, a forest ranker, and some sample inputs. The symbolic generator includes the Sensus Ontology dictionary based on WordNet. The forest ranker includes a 250 million word ngram language model (unigram, bigram, and trigram) trained on the Wall Street Journal newspaper text. The symbolic generator is written in LISP and requires a Lisp interpreter.

Download Website Updated 06 Apr 2010 Glossword

Screenshot
Pop 136.61
Vit 7.23

Glossword is a system to publish dictionaries, glossaries, and encyclopedias. It features an installation wizard, support for multiple languages, visual themes, multi-domain installation, an administrative interface with multi-user support, built-in search and cache engines, the ability to export/import dictionaries in XML format, and W3C-validated code. Glossword is useful for any sort of dictionary-like content, including sites with game cheat codes, online translators, references, and various kinds of CMS solutions.

Download Website Updated 15 Mar 2005 Ellogon

Screenshot
Pop 52.65
Vit 1.82

Ellogon is a multi-lingual, cross-platform, general-purpose language engineering environment, developed in order to aid both researchers who are doing research in computational linguistics, as well as companies who produce and deliver language engineering systems. As a language engineering platform, it offers an extensive set of facilities, including tools for processing and visualising textual/HTML/XML data and associated linguistic information, support for lexical resources (like creating and embedding lexicons), tools for creating annotated corpora, accessing databases, comparing annotated data, or transforming linguistic information into vectors for use with various machine learning algorithms.

Download Website Updated 06 Oct 2004 Dowser

Screenshot
Pop 48.93
Vit 2.24

Dowser is a Web research and archiving tool that clusters results from search engines, associates words that appear in previous searches, and keeps a local cache of all the results you click on in a searchable database along with summaries and links to related information. It helps you to keep track of what you find, with no advertising.

Download Website Updated 09 Nov 2004 freli

Screenshot
Pop 30.58
Vit 1.42

FRELI (the Free Repository of English Lexical Information) is a freely redistributable list of English words with associated information (parts of speech, alternate spellings, etc.).

No download Website Updated 29 Nov 2005 isobel

Screenshot
Pop 24.41
Vit 1.00

Isobel is a framework to build complex information retrieval and analysis systems. Isobel can be functionally divided in two subsytems, Isobel Gatherer (the crawling and filtering subsystem) and Isobel Analyzer (the analysis subsystem). The two subsytems can also be used separately. Isobel Gatherer offers ready-to-use services like content fetching, scheduling, document format conversion, Hyperlink graph storage and analysis, content storage and indexing. A programmer may easily add new services. Isobel Analyzer uses the IBM UIMA architecture to reuse the analysis components developed for this architecture.

No download Website Updated 26 Jan 2006 Verticrawl Seek Site Search

Screenshot
Pop 19.75
Vit 1.00

Verticrawl Seek Site Search is a search engine technology for making powerful, fast, and customizable search solutions. It features parsing of multiple document formats, an admin interface, compatibility with sitemaps, and a search interface for HTML, XML, and PHP.

Screenshot

Project Spotlight

MASTIFF

A static analysis automation framework.

Screenshot

Project Spotlight

SYINF

A portable, cross-platform program for brief system information.