11 projects tagged "UIMA"

No download Website Updated 13 May 2014 Apache UIMA Java SDK

Screenshot
Pop 207.54
Vit 47.60

UIMA SDK is a software architecture and framework for supporting the development, integration, and deployment of search and analysis technologies. It can be used to analyze large volumes of unstructured information (text, audio, video, images, etc.) to discover, organize, and deliver relevant knowledge to the client or application end user.

No download No website Updated 29 Apr 2014 DKPro Core

Screenshot
Pop 100.20
Vit 7.56

DKPro Core is a collection of software components for natural language processing (NLP) based on the Apache UIMA framework. Many powerful and state-of-the-art NLP components are already freely available in the NLP research community. New and improved components are being developed and released continuously. The components cover the whole range of NLP-related processing tasks. DKPro Core provides wrappers for such third-party tool as well as original NLP components. DKPro Core builds heavily on uimaFIT which allows for rapid and easy development of NLP processing pipelines.

Download Website Updated 29 Nov 2011 Apache OpenNLP

Screenshot
Pop 85.51
Vit 1.49

Apache OpenNLP is a machine learning based toolkit for the processing of natural language text. It supports the most common NLP tasks, such as tokenization, sentence segmentation, part-of-speech tagging, named entity extraction, chunking, parsing, and coreference resolution. These tasks are usually required to build more advanced text processing services.

No download Website Updated 04 Sep 2013 uimaFIT

Screenshot
Pop 57.79
Vit 2.28

uimaFIT provides Java annotations for describing UIMA components which can be used to directly describe the UIMA components in Java code without the need for traditional UIMA XML descriptors. This greatly simplifies refactoring a component definition (e.g., changing a configuration parameter name). uimaFIT also makes it easy to instantiate UIMA components without using XML descriptor files by providing convenient factory methods. This makes uimaFIT an ideal library for testing UIMA components because the component can be easily instantiated and invoked without requiring a descriptor file to be created first. uimaFIT is very useful in research environments in which programmatic/dynamic instantiation of UIMA pipelines can simplify experimentation. For example, when performing 10-fold cross-validation across a number of experimental conditions, it can be quite laborious to create a different set of descriptor files for each run, or even a script which generates such descriptor files. uimaFIT is type system agnostic and does not depend on (or provide) a specific type system. This project has been superseded by the Apache uimaFIT project.

No download No website Updated 30 Nov 2013 DKPro WSD

Screenshot
Pop 56.25
Vit 2.08

DKPro WSD provides UIMA components which encapsulate corpus readers, linguistic annotators, lexical semantic resources, WSD algorithms, and evaluation and reporting tools. You configure the components, or write new ones, and arrange them into a data processing pipeline. DKPro WSD is modular and flexible. Components which provide the same functionality can be freely swapped. You can easily run the same algorithm on different data sets, or test several different algorithms on the same data set.

No download Website Updated 10 Oct 2013 DKPro Similarity

Screenshot
Pop 39.04
Vit 1.00

DKPro Similarity is a framework for text similarity. Its goal is to provide a comprehensive repository of text similarity measures which are implemented using standardized interfaces. The framework is designed to complement DKPro Core, a collection of software components for natural language processing (NLP) based on the Apache UIMA framework. DKPro Similarity comprises a wide variety of measures ranging from ones based on simple n-grams and common subsequences to high-dimensional vector comparisons and structural, stylistic, and phonetic measures. In order to promote the reproducibility of experimental results and to provide reliable, permanent experimental conditions for future studies, DKPro Similarity also comes with a set of full-featured experimental setups which can be run out-of-the-box and used for future systems to built upon.

No download Website Updated 15 Sep 2013 JWKTL

Screenshot
Pop 38.11
Vit 1.01

JWKTL (Java-based Wiktionary Library) is an application programming interface for the free multilingual online dictionary Wiktionary. Wiktionary is collaboratively constructed by volunteers and continually growing. JWKTL enables efficient and structured access to the information encoded in the English, German, and Russian Wiktionary language editions, including sense definitions, part of speech tags, etymology, example sentences, translations, semantic relations, and many other lexical information types.

No download Website Updated 28 Nov 2013 JobimText

Screenshot
Pop 35.94
Vit 1.01

JobimText provides a software solution for automatic text expansion using contextualized distributional similarity.

No download Website Updated 05 Sep 2013 Apache uimaFIT

Screenshot
Pop 35.72
Vit 1.00

Apache uimaFIT provides Java annotations for describing UIMA components which can be used to directly describe the UIMA components in Java code without the need for traditional UIMA XML descriptors. This greatly simplifies refactoring a component definition (e.g., changing a configuration parameter name). It also makes it easy to instantiate UIMA components without using XML descriptor files by providing convenient factory methods. It is ideal for testing UIMA components because the component can be easily instantiated and invoked without requiring a descriptor file to be created first.

No download Website Updated 15 Sep 2013 DKPro Lab

Screenshot
Pop 34.81
Vit 1.01

DKPro Lab is a lightweight framework for parameter sweeping experiments. It allows the user to set up experiments consisting of multiple interdependent tasks in a declarative manner with minimal overhead. Parameters are injected into tasks using via annotated class fields. Data produced by a task for any particular parameter configuration is stored and re-used whenever possible to avoid needless recalculation of results. Reports can be attached to each task to post-process the experimental results and present them in a convenient manner (e.g., as tables or charts).

Screenshot

Project Spotlight

email2trac

A utility that convert an email to a Trac ticket.

Screenshot

Project Spotlight

liblangtag

An interface library to access tags for identifying languages.