DKPro Core is a collection of software components for natural language processing (NLP) based on the Apache UIMA framework. Many powerful and state-of-the-art NLP components are already freely available in the NLP research community. New and improved components are being developed and released continuously. The components cover the whole range of NLP-related processing tasks. DKPro Core provides wrappers for such third-party tool as well as original NLP components. DKPro Core builds heavily on uimaFIT which allows for rapid and easy development of NLP processing pipelines.
|Tags||Natural Language Processing UIMA NLP POS parsing chunker lemmatizer tokenizer|
|Licenses||Apache 2.0 GPLv3|
|Operating Systems||Java Runtime Environment|
|Implementation||Apache UIMA Java 6+|
Release Notes: This release added a bliki reader module for Wikipedia articles, a writer for TigerXML, and support for semantic annotations in TigerXML. Upgrades were made to third-party dependencies in several modules (ClearNLP, LanguageTool, StanfordNLP, TT4J). There were various bugfixes and minor improvements. Java 7 is required.
Release Notes: This release adds new analysis support (mate-tools, mstparser, sfst, etc.), new I/O support (conll, tcf, tgrep, tiger), new annotation types (semantic role labelling, phonetics), and many upgrades, enhancements, and bugfixes. It builds on Apache uimaFIT 2.0.0.
Release Notes: First release via Maven Central. New modules: OpenNLP (parser, pos tagger, tokenizer), MaltParser, MeCab, Berkeley parser, GATE (lemmatizer), binary CAS (de)serialization, and generic JDBC reader. Many bugfixes and improvements.