RSS 18 projects tagged "Linguistic"

No download Website Updated 17 Sep 2005 Sikher

Screenshot
Pop 13.11
Vit 52.96

Sikher is a desktop program designed to archive, search, and display the Sikh scriptures using advanced functions. It allows the common person to understand and read the messages contained in the Sikh scriptures through translations and transliterations in different languages, thereby breaking the language and geographical barrier between Gurbani (Sikh Scriptures) and the world. Sikher is a robust, future proof, and cross-platform application which may be used by developers to create similar internationalized and localized search applications.

Download Website Updated 04 Jul 2011 Emdros

Screenshot
Pop 186.41
Vit 19.70

Emdros is a corpus query system for storing and searching linguistically annotated text. It is very generic, supporting almost any kind of annotation from almost any linguistic theory. All linguistic levels of analysis are supported, including phonology, morphology, the lexical level, syntax, and discourse. The core libraries act as a middleware layer between a client and an underlying SQL database. MySQL, PostgreSQL, and SQLite are supported.

Download Website Updated 22 Jul 2012 Apache Lucene

Screenshot
Pop 222.58
Vit 15.56

Apache Lucene is a high-performance, full-featured text search engine library written entirely in Java. It is suitable for nearly any application that requires full-text search, especially cross-platform.

No download Website Updated 02 Aug 2012 Poliqarp

Screenshot
Pop 76.68
Vit 10.76

Poliqarp is a universal suite of utilities for processing large corpora. It includes a concordancer that works on binary corpora compiled for efficient searching and a corpus builder. It supports positional tagsets, ambiguities in the texts, and Unicode.

No download Website Updated 22 Jul 2012 Apache Solr

Screenshot
Pop 139.43
Vit 9.27

Solr is an enterprise search platform from the Apache Lucene project. Its major features include powerful full-text search, hit highlighting, faceted search, dynamic clustering, database integration, and rich document (e.g. Word and PDF) handling. Solr is highly scalable, providing distributed search and index replication, and it powers the search and navigation features of many of the world's largest internet sites. Solr is written in Java and runs as a standalone full-text search server within a servlet container such as Tomcat. Solr uses the Lucene Java search library at its core for full-text indexing and search, and has REST-like HTTP/XML and JSON APIs that make it easy to use from virtually any programming language. Solr's powerful external configuration allows it to be tailored to almost any type of application without Java coding, and it has an extensive plugin architecture when more advanced customization is required.

No download No website Updated 21 Feb 2005 Zoe Intertwingle

Screenshot
Pop 187.51
Vit 7.21

Zoe is a Web based email client with a built in SMTP and POP3 server and Google-like search functionality that lives on your desktop. It is written in Java and uses Lucene technology to provided instant searching and threading of your email messages.

No download Website Updated 25 Sep 2006 Connexor Machinese

Screenshot
Pop 30.98
Vit 3.99

Connexor Machinese analyzers process sequences of written words, identify and classify the various entities in them, and show how these relate to each other, marking the language with a simple and systematic notation. Currently, the Machinese product family includes: Machinese Phrase Tagger, a fast, light-weight morphosyntactic tagger; Machinese Syntax, a full-scale dependency parser; Machinese Semantics, a dependency parser with semantic analysis; and Machinese Metadata, an entity extractor.

Download Website Updated 10 Jun 2008 SCAN

Screenshot
Pop 61.02
Vit 2.62

SCAN is a personal information retrieval framework, combining search, text analysis, tagging, and metadata functions for document collections management. SCAN is a component-based software using a number of plugins for specific features. The basic SCAN platform can be easily extended with plugins for different document formats and document location types.

Download Website Updated 17 Jan 2007 ddc-concordance

Screenshot
Pop 36.55
Vit 2.31

ddc-concordance is a search engine for linguists. It lets you search for words or sequences of words together with morphological patterns. It was created to help linguists find a particular collocation or word in a given context.

Download Website Updated 06 Oct 2004 Dowser

Screenshot
Pop 62.23
Vit 2.24

Dowser is a Web research and archiving tool that clusters results from search engines, associates words that appear in previous searches, and keeps a local cache of all the results you click on in a searchable database along with summaries and links to related information. It helps you to keep track of what you find, with no advertising.

Screenshot

Project Spotlight

LDAP Account Manager

A web-based account manager for LDAP directories.

Screenshot

Project Spotlight

Debian GNU/Linux

The Universal Operating System.