RSS 26 projects tagged "Linguistic"

Download Website Updated 05 Oct 2013 Apache Lucene

Screenshot
Pop 258.49
Vit 21.25

Apache Lucene is a high-performance, full-featured text search engine library written entirely in Java. It is suitable for nearly any application that requires full-text search, especially cross-platform.

No download No website Updated 21 Feb 2005 Zoe Intertwingle

Screenshot
Pop 164.75
Vit 7.12

Zoe is a Web based email client with a built in SMTP and POP3 server and Google-like search functionality that lives on your desktop. It is written in Java and uses Lucene technology to provided instant searching and threading of your email messages.

No download Website Updated 05 Oct 2013 Apache Solr

Screenshot
Pop 164.48
Vit 13.41

Solr is an enterprise search platform from the Apache Lucene project. Its major features include powerful full-text search, hit highlighting, faceted search, dynamic clustering, database integration, and rich document (e.g. Word and PDF) handling. Solr is highly scalable, providing distributed search and index replication, and it powers the search and navigation features of many of the world's largest internet sites. Solr is written in Java and runs as a standalone full-text search server within a servlet container such as Tomcat. Solr uses the Lucene Java search library at its core for full-text indexing and search, and has REST-like HTTP/XML and JSON APIs that make it easy to use from virtually any programming language. Solr's powerful external configuration allows it to be tailored to almost any type of application without Java coding, and it has an extensive plugin architecture when more advanced customization is required.

Download Website Updated 06 Apr 2010 Glossword

Screenshot
Pop 134.55
Vit 7.24

Glossword is a system to publish dictionaries, glossaries, and encyclopedias. It features an installation wizard, support for multiple languages, visual themes, multi-domain installation, an administrative interface with multi-user support, built-in search and cache engines, the ability to export/import dictionaries in XML format, and W3C-validated code. Glossword is useful for any sort of dictionary-like content, including sites with game cheat codes, online translators, references, and various kinds of CMS solutions.

Download Website Updated 30 Jan 2001 Ciao Prolog

Screenshot
Pop 110.43
Vit 1.00

Ciao is a complete Prolog system subsuming ISO-Prolog with a novel modular design which allows both restricting and extending the language. Ciao extensions currently include feature terms (records), higher-order, functions, constraints, objects, persistent predicates, a good base for distributed execution (agents), and concurrency. Libraries also support WWW programming, sockets, and external interfaces (C, Java, TCL/Tk, relational databases, etc.). An Emacs-based environment, a stand-alone compiler, and a toplevel shell are also provided.

No download Website Updated 22 Dec 2008 Open Translation Engine

Screenshot
Pop 68.63
Vit 5.50

Open Translation Engine (OTE) is a Web-based system to enable community management of translation dictionaries.

Download Website Updated 04 Feb 2007 Japana

Screenshot
Pop 65.83
Vit 4.04

Japana is a small HTTP proxy written in Perl. It converts Japanese characters (Hiragana, Katakana, and Kanji) into ASCII (Romaji) on the fly. The translation is done with the kakasi library (an older version without the need for kakasi still exists).

Download Website Updated 06 Oct 2004 Dowser

Screenshot
Pop 49.12
Vit 2.24

Dowser is a Web research and archiving tool that clusters results from search engines, associates words that appear in previous searches, and keeps a local cache of all the results you click on in a searchable database along with summaries and links to related information. It helps you to keep track of what you find, with no advertising.

Download Website Updated 22 Jan 2004 PyBabelPhish

Screenshot
Pop 48.81
Vit 1.80

PyBabelPhish is a GTK-based program providing fast translations from one natural language to another. Texts translated to Spanish can be read aloud in Spanish through optional text-to-speech support.

Download Website Updated 07 Oct 2011 OpenEphyra

Screenshot
Pop 48.58
Vit 2.70

OpenEphyra is a question answering (QA) system. It retrieves answers to natural language questions from the Web and other sources. OpenEphyra comes with implementations of algorithms that proved effective in Carnegie Mellon's Ephyra system, which participated in the TREC evaluations. It is platform independent and can be set up in just a few minutes. The goal of this project is to give researchers the opportunity to develop new QA techniques without worrying about the end-to-end system.

Screenshot

Project Spotlight

Beastlands - Song of the Were

An RPG action adventure game.

Screenshot

Project Spotlight

Docx to Text Converter (docx2txt)

Perl based utility to extract formatted text content from MS Docx file