RSS 16 projects tagged "NLP"

No download Website Updated 09 Dec 2013 WebAnno

Screenshot
Pop 36.63
Vit 1.29

WebAnno is a general purpose Web-based annotation tool for a wide range of linguistic annotations. It offers annotation project management, freely configurable tagsets, and the management of users in different roles. It uses technology from the brat rapid annotation tool for visualizing and editing annotations in a Web browser. It supports annotation and visualization of arbitrarily large documents, pluggable import/export filters, the curation of annotations across various users, and farming out annotations to a crowdsourcing platform.

No download Website Updated 14 Sep 2013 JOWKL

Screenshot
Pop 30.89
Vit 14.82

JOWKL (Java OmegaWiki Library) is a Java-based application programming interface which allows the user to access all information in the free, multilingual online dictionary OmegaWiki.

No download Website Updated 15 Sep 2013 JWKTL

Screenshot
Pop 37.28
Vit 1.01

JWKTL (Java-based Wiktionary Library) is an application programming interface for the free multilingual online dictionary Wiktionary. Wiktionary is collaboratively constructed by volunteers and continually growing. JWKTL enables efficient and structured access to the information encoded in the English, German, and Russian Wiktionary language editions, including sense definitions, part of speech tags, etymology, example sentences, translations, semantic relations, and many other lexical information types.

No download Website Updated 28 Nov 2013 JobimText

Screenshot
Pop 36.44
Vit 1.01

JobimText provides a software solution for automatic text expansion using contextualized distributional similarity.

No download No website Updated 30 Nov 2013 DKPro WSD

Screenshot
Pop 58.99
Vit 2.16

DKPro WSD provides UIMA components which encapsulate corpus readers, linguistic annotators, lexical semantic resources, WSD algorithms, and evaluation and reporting tools. You configure the components, or write new ones, and arrange them into a data processing pipeline. DKPro WSD is modular and flexible. Components which provide the same functionality can be freely swapped. You can easily run the same algorithm on different data sets, or test several different algorithms on the same data set.

No download No website Updated 28 Nov 2013 TWSI

Screenshot
Pop 35.78
Vit 1.76

TWSI is software that produces lexical substitutions in context for over 1000 frequent nouns. It processes English text. This functionality is realized by a supervised word sense disambiguation system, which is trained by sense-labeled occurrences of target words. A classification model is trained for each word, and used to decide which sense an unseen occurrence most likely belongs to. Associated with senses are lists of substitutions, which are injected into the text using inline annotation.

No download No website Updated 23 Dec 2013 DKPro Core

Screenshot
Pop 57.06
Vit 2.33

DKPro Core is a collection of software components for natural language processing (NLP) based on the Apache UIMA framework. Many powerful and state-of-the-art NLP components are already freely available in the NLP research community. New and improved components are being developed and released continuously. The components cover the whole range of NLP-related processing tasks. DKPro Core provides wrappers for such third-party tool as well as original NLP components. DKPro Core builds heavily on uimaFIT which allows for rapid and easy development of NLP processing pipelines.

Download No website Updated 14 Oct 2013 UBY

Screenshot
Pop 84.99
Vit 4.00

UBY is a large-scale unified lexical-semantic resource for natural language processing (NLP) based on the ISO standard Lexical Markup Framework (LMF).

No download Website Updated 16 Feb 2012 jWeb1T

Screenshot
Pop 36.82
Vit 1.00

jWeb1T is an Java tool for efficiently searching n-gram data in the Web 1T 5-gram corpus format. It is based on a binary search algorithm that finds the n-grams and returns their frequency counts in logarithmic time. As the corpus is stored in many files, a simple index is used to retrieve the files containing the n-grams.

No download No website Updated 14 Feb 2014 TreeTagger for Java

Screenshot
Pop 162.05
Vit 7.27

TreeTagger for Java (TT4J) is a Java wrapper around the popular TreeTagger package by Helmut Schmid, a language independent part-of-speech tagger and lemmatizer. It was written with a focus on platform-independence and easy integration into applications.

Screenshot

Project Spotlight

Profanity

A ncurses-based Jabber client.

Screenshot

Project Spotlight

ToPIA

A persistence and application distribution framework.