193 projects tagged "Linguistic"

Download Website Updated 21 Dec 2013 queXC

Screenshot
Pop 74.93
Vit 7.73

queXC is a Web-based data cleaning and coding/classification system that takes a data file (such as data collected from a questionnaire) and cleans the text input fields by spacing them and spell checking them. It allows operators to code text fields to existing coding schemes, or to create a coding scheme on the fly. Multiple operators can code and clean simultaneously, with the ability to assign operators to do particular codes. The queXC system includes some coding schemes created from ABS (Australian Bureau of Statistics) data. It can be used as an open source replacement for Nvivo in some situations.

No download Website Updated 02 Mar 2013 jsesh

Screenshot
Pop 54.99
Vit 7.64

JSesh is an editor for ancient Egyptian hieroglyphic texts. It can export the text into picture formats, such as WMF files for easy inclusion in word processors. JSesh can also be used as a library for other projects concerning ancient Egyptian.

No download Website Updated 02 Aug 2012 Poliqarp

Screenshot
Pop 55.43
Vit 7.64

Poliqarp is a universal suite of utilities for processing large corpora. It includes a concordancer that works on binary corpora compiled for efficient searching and a corpus builder. It supports positional tagsets, ambiguities in the texts, and Unicode.

No download Website Updated 05 May 2012 Text-Tokenizer

Screenshot
Pop 65.31
Vit 7.54

Text-Tokenizer is Perl module based on the flex generated lexical analyzer that can be used for parsing of text (configuration) files. With this module, a simple full-featured configuration parser can be written very easily.

Download Website Updated 28 Feb 2012 Hspell

Screenshot
Pop 66.81
Vit 7.46

Hspell is a Hebrew linguistic project. It features a Hebrew spell-checker, and aims to use the databases and algorithms developed as a morphology engine (for example, for search engines), and in the future for advanced things like Hebrew speech synthesis.

Download Website Updated 06 Apr 2010 Glossword

Screenshot
Pop 137.06
Vit 7.18

Glossword is a system to publish dictionaries, glossaries, and encyclopedias. It features an installation wizard, support for multiple languages, visual themes, multi-domain installation, an administrative interface with multi-user support, built-in search and cache engines, the ability to export/import dictionaries in XML format, and W3C-validated code. Glossword is useful for any sort of dictionary-like content, including sites with game cheat codes, online translators, references, and various kinds of CMS solutions.

No download No website Updated 21 Feb 2005 Zoe Intertwingle

Screenshot
Pop 158.29
Vit 7.12

Zoe is a Web based email client with a built in SMTP and POP3 server and Google-like search functionality that lives on your desktop. It is written in Java and uses Lucene technology to provided instant searching and threading of your email messages.

Download Website Updated 18 Feb 2009 Unicode Utilities

Screenshot
Pop 89.97
Vit 5.83

The Unicode Utilities are a set of programs for manipulating and analyzing Unicode text. uniname prints any combination of the character offset of each character, its byte offset, its hex code value, its encoding, the glyph itself, and its name. unidesc reports the character ranges to which different portions of the text belong. unihist generates a histogram of the characters in its input. ExplicateUTF8 determines and explains the validity of a sequence of bytes as a UTF-8 encoding. unirev reverses UTF-8 strings. unifuzz tests other programs' unicode handling.

Download Website Updated 18 Sep 2010 Linguistic Tree Constructor

Screenshot
Pop 75.10
Vit 5.48

Linguistic Tree Constructor is an application for drawing linguistic syntax trees. Its main strength is assisting in data production by quickly analyzing large amounts of text. "Generic" trees are supported, as well as RRG and X-Bar trees. Node-categories are user-definable, and additional user-definable labels can also be applied to each node. Publication-quality, high-resolution, horizontal trees can be drawn. The file format is based on TIGER-XML.

No download Website Updated 22 Dec 2008 Open Translation Engine

Screenshot
Pop 67.86
Vit 5.47

Open Translation Engine (OTE) is a Web-based system to enable community management of translation dictionaries.

Screenshot

Project Spotlight

CT-gui/CT-synth/CT-farfisa

A GUI toolkit for Linux and Android.

Screenshot

Project Spotlight

fcron

A command scheduler for non-permanently-running systems.