RSS 3 projects tagged "Linguistic"

Download Website Updated 03 Nov 2002 Marko

Screenshot
Pop 30.17
Vit 1.42

Marko is a simple toolset that allows you to create markov chain databases of a corpus (or two) of text and then allows you to compare unknown texts to these databases. For any two marko databases you can calculate the probability that the unknown body is related to one over the other. Possible applications include intelligent mail filtering, plagiarism detection, and historical research.

No download No website Updated 21 Feb 2005 Zoe Intertwingle

Screenshot
Pop 163.34
Vit 7.12

Zoe is a Web based email client with a built in SMTP and POP3 server and Google-like search functionality that lives on your desktop. It is written in Java and uses Lucene technology to provided instant searching and threading of your email messages.

Download Website Updated 26 Mar 2006 dbacl

Screenshot
Pop 175.19
Vit 4.91

dbacl is a digramic Bayesian text classifier. Given some text, it calculates the posterior probabilities that the input resembles one of any number of previously learned document collections. It can be used to sort incoming email into arbitrary categories such as spam, work, and play, or simply to distinguish an English text from a French text. It fully supports international character sets, and uses sophisticated statistical models based on the Maximum Entropy Principle.

Screenshot

Project Spotlight

libdvbpsi

A library designed for MPEG TS and DVB PSI tables decoding and generation.

Screenshot

Project Spotlight

filterunit

A unit test facility for command line programs with file input and output.