RSS 13 projects tagged "Linguistic"

Download Website Updated 09 Feb 2003 DadaDodo

Screenshot
Pop 44.81
Vit 1.00

DadaDodo is a program that analyses texts for word probabilities, and then generates random cut-up sentences based on that. It is a travesty generator similar to Dissociated Press, but based on a Markov Chain of length 1.

Download Website Updated 29 Dec 2007 GNU Talk Filters

Screenshot
Pop 110.84
Vit 4.80

The GNU Talk Filters are filter programs that convert ordinary English text into text that mimics a stereotyped or otherwise humorous dialect. Some of these filters have been in the public domain for many years, but here they are provided as a single integrated package. The filters include austro, b1ff, brooklyn, chef, cockney, drawl, dubya, fudd, funetak, jethro, jive, kraut, pansy, pirate, postmodern, redneck, valspeak, and warez. This package provides the filters both as individual executables and collectively as a C library, so they can be easily embedded in other programs.

Download Website Updated 22 Jul 2002 Linguaphile

Screenshot
Pop 64.27
Vit 1.49

Linguaphile is a simple command line language translator. It is open source, platform independent, and programmed in Perl. Linguaphile currently supports the following languages: Afrikaans, Alawa, Albanian, Arrernte, Basque, Belarusian, Bulgarian, Catalan, Croatian, Czech, Danish, Dutch, English, Esperanto, Estonian, Finnish, French, Galician, German, Greek, Hawaiian, Hungarian, Icelandic, Indonesian, Interlingua, Irish, Italian, Kala Lagaw Ya, Korean, Kriol, Latvian, Lithuanian, Malay, Maltese, Maori, Norwegian, Pitjantjatjara, Polish, Portuguese, Romanian, Russian, Samoan, Serbian, Slovak, Slovenian, Spanish, Swahili, Swedish, Thai, Tok Pisin, Turkish, Ukrainian, Warlpiri, and Welsh. The Spanish to English translation is the most useful at this stage.

Download Website Updated 03 Nov 2002 Marko

Screenshot
Pop 30.28
Vit 1.42

Marko is a simple toolset that allows you to create markov chain databases of a corpus (or two) of text and then allows you to compare unknown texts to these databases. For any two marko databases you can calculate the probability that the unknown body is related to one over the other. Possible applications include intelligent mail filtering, plagiarism detection, and historical research.

Download Website Updated 06 Mar 2008 Mguesser

Screenshot
Pop 32.25
Vit 2.51

Mguesser is a tool to guess a text's character set and language. It is a standalone part of the mnoGoSearch engine. More than 100 various character set and language combinations are supported.

Download Website Updated 05 Apr 2003 Novelwriting

Screenshot
Pop 29.19
Vit 1.00

Novelwriting is a Python program to generate random structured documents based on a grammar. It is similar to the Dada Engine, but is more extensible.

Download Website Updated 07 Nov 2007 PottyMouth

Screenshot
Pop 29.60
Vit 1.00

PottyMouth transforms completely unstructured and untrusted text to valid, nice-looking, completely safe XHTML. PottyMouth is designed to handle input text from non-technical, potentially careless, or malicious users. It produces HTML that is completely safe, programmatically and visually, to include on any Web page. You don't need to make your users read any instructions before they start typing. They don't even need to know that PottyMouth is being used.

Download Website Updated 12 Sep 2008 Redet

Screenshot
Pop 229.25
Vit 11.02

Redet is a tool for developing and executing regular expressions using any of more than 50 search programs, editors, and programming languages, intended both for developing regular expressions for use elsewhere and as a search tool in its own right. For each program in each locale, a palette showing the available constructs is provided. The properties of each program are determined by runtime tests, which guarantees that they will be correct for the program version and locale. Additional features include persistent history, extensive help, a variety of character entry tools, and the ability to change locale while running. Redet is highly configurable and fully supports Unicode.

Download Website Updated 24 Mar 2001 SyNTeX - Syntactic tree drawing program

Screenshot
Pop 59.85
Vit 69.08

SyNTeX is a LaTeX preprocessor that draws syntactic trees using the LaTeX picture environment. The preprocessor reads the comments in a LaTeX file and draws the tree based on commands that it finds in the comments.

Download Website Updated 14 May 2005 WorldPrint

Screenshot
Pop 48.29
Vit 4.85

WorldPrint is a filter for Mozilla (Galeon, etc.), Htmldoc, and Netscape PostScript output that uses TrueType fonts to allow the printing of pages written in Unicode, Big5, SJIS, KOI-8, ISO-8859*, and other charsets.

Screenshot

Project Spotlight

Tcpreplay

A tool which edits and replays captured network traffic back onto the wire.

Screenshot

Project Spotlight

Nuxis

An integrated solution for virtualization management.