RSS 185 projects tagged "Linguistic"

Download Website Updated 11 Feb 2013 ANTLR

Screenshot
Pop 298.06
Vit 5.53

ANTLR (ANother Tool for Language Recognition) is a language tool that provides a framework for constructing recognizers, compilers, and translators from grammatical descriptions containing C++, Java, or Sather actions. It is similar to the popular compiler generator YACC, however ANTLR is much more powerful and easy to use. ANTLR-produced parsers are not only highly efficient, but are both human-readable and human-debuggable (especially with the interactive ParseView debugging tool). ANTLR can generate parsers, lexers, and tree-parsers in either C++, Java, or Sather. ANTLR is currently written in Java.

Download Website Updated 05 Mar 2009 After the Deadline for WordPress

Screenshot
Pop 31.94
Vit 1.00

After the Deadline for WordPress is a plugin that interfaces with After the Deadline, a Web service that helps you improve your writing and spend less time editing. This plugin adds a button for checking spelling and writing style to the WordPress visual editor mode. An API key is required to access the After the Deadline service.

Download Website Updated 11 Mar 2008 Algraeph

Screenshot
Pop 44.36
Vit 1.00

Algraeph is a tool for manual alignment of linguistic graphs, such as phrase structure trees or dependency structures, where each node corresponds to a subsequence of the analyzed input sentence. It allows you to express the similarity between two graphs by aligning their nodes and attaching relation labels to these alignments. Graphs are read from one or more graphbanks (or treebanks) in the GraphML or Alpino formats. Alignment relations are user-defined and are stored in a simple XML format, which can be used for further processing. The resulting parallel graph corpus is a useful data set for many tasks in computational linguistics and natural language processing.

Download Website Updated 18 Aug 2008 An Gramadóir

Screenshot
Pop 50.38
Vit 2.68

An Gramadóir is a grammar checking engine that is designed for the rapid development of grammar checkers for minority languages and other languages with limited computational resources. Rule specifications are given according to a simple syntax combining XML and regular expressions. Part-of-speech tagging can be learned from text corpora using statistical methods. It is currently implemented for Irish (Gaeilge).

Download Website Updated 05 Oct 2013 Apache Lucene

Screenshot
Pop 258.07
Vit 21.35

Apache Lucene is a high-performance, full-featured text search engine library written entirely in Java. It is suitable for nearly any application that requires full-text search, especially cross-platform.

No download Website Updated 05 Oct 2013 Apache Solr

Screenshot
Pop 164.27
Vit 13.48

Solr is an enterprise search platform from the Apache Lucene project. Its major features include powerful full-text search, hit highlighting, faceted search, dynamic clustering, database integration, and rich document (e.g. Word and PDF) handling. Solr is highly scalable, providing distributed search and index replication, and it powers the search and navigation features of many of the world's largest internet sites. Solr is written in Java and runs as a standalone full-text search server within a servlet container such as Tomcat. Solr uses the Lucene Java search library at its core for full-text indexing and search, and has REST-like HTTP/XML and JSON APIs that make it easy to use from virtually any programming language. Solr's powerful external configuration allows it to be tailored to almost any type of application without Java coding, and it has an extensive plugin architecture when more advanced customization is required.

Download Website Updated 31 Aug 2009 Apertium

Screenshot
Pop 36.95
Vit 46.57

Apertium is a machine translation platform, initially aimed at related-language pairs, but recently expanded to deal with more divergent language pairs (such as English-Catalan). The platform provides a language-independent machine translation engine, tools to manage the linguistic data necessary to build a machine translation system for a given language pair, and linguistic data for a growing number of language pairs.

Download Website Updated 24 Aug 2004 Arabic Wordlist

Screenshot
Pop 49.57
Vit 1.82

Arabic Wordlist is a project to deliver an English to Arabic translated word list to be used in translations and/or dictionaries. The word list contains in excess of 83,500 words (and growing), and spans a variety of categories (i.e. it is general in nature). This word list is encoded in UTF-8, and is expected to be used in many online free dictionaries.

No download Website Updated 21 Sep 2004 Atlantida

Screenshot
Pop 12.96
Vit 59.11

Atlantida is a multilingual cross-platform dictionary. Currently it has 310,000 definitions, and knows how to pronounce 21,000 English words.

No download Website Updated 02 Sep 2004 Babel

Screenshot
Pop 16.97
Vit 59.28

Babel is a linguistics tool that uses dynamic data structuring of words and context spaces. Models can be mixed and references added to created terms. It includes the markov, ngram, and ispell tools.

Screenshot

Project Spotlight

Sculptor

A DSL and code generator for Java enterprise applications.

Screenshot

Project Spotlight

QCAD

A 2D CAD program.