DKPro WSD provides UIMA components which encapsulate corpus readers, linguistic annotators, lexical semantic resources, WSD algorithms, and evaluation and reporting tools. You configure the components, or write new ones, and arrange them into a data processing pipeline. DKPro WSD is modular and flexible. Components which provide the same functionality can be freely swapped. You can easily run the same algorithm on different data sets, or test several different algorithms on the same data set.
jTracer is a visualization tool for libcsdbg. When libcsdbg creates a stack trace for a caught exception, a thread, or a process-wide stack trace dump, it can be configured to broadcast the trace data through TCP/IP (UDP/IP, RS-232, USB, etc. are under development). jTracer catches those data and visualizes them to the user, sorted and ordered by TCP/UDP/IP address/port (or serial port), process ID, and executable, thread, and timestamp. It's particularly useful when you're doing cross-development and your target platform has no resources to visualize output. The rationale behind the development of jTracer is similar to gdb/gdbserver functionality.
DKPro Lab is a lightweight framework for parameter sweeping experiments. It allows the user to set up experiments consisting of multiple interdependent tasks in a declarative manner with minimal overhead. Parameters are injected into tasks using via annotated class fields. Data produced by a task for any particular parameter configuration is stored and re-used whenever possible to avoid needless recalculation of results. Reports can be attached to each task to post-process the experimental results and present them in a convenient manner (e.g., as tables or charts).
JWKTL (Java-based Wiktionary Library) is an application programming interface for the free multilingual online dictionary Wiktionary. Wiktionary is collaboratively constructed by volunteers and continually growing. JWKTL enables efficient and structured access to the information encoded in the English, German, and Russian Wiktionary language editions, including sense definitions, part of speech tags, etymology, example sentences, translations, semantic relations, and many other lexical information types.
TWSI is software that produces lexical substitutions in context for over 1000 frequent nouns. It processes English text. This functionality is realized by a supervised word sense disambiguation system, which is trained by sense-labeled occurrences of target words. A classification model is trained for each word, and used to decide which sense an unseen occurrence most likely belongs to. Associated with senses are lists of substitutions, which are injected into the text using inline annotation.
nioReactor is a framework for quickly building scalable event-driven network applications. It is built on top of the Reactor design pattern using NIO sockets. This provides high performance and allows for a complete separation from the application specific code. The goal of the project is to provide a simple set of APIs that allow users to build scalable servers without needing to deal with with the cumbersome and sometimes temperamental peaces of the NIO package.
QuickDic parses data from en.wiktionary.org and Beolingus to generate dictionary files that can be used offline. It currently has built-in dictionaries from English to: German (Deutsch - Englisch), Italian (inglese - italiano), Afrikaans, Albanian, Arabic, Armenian, Belarusian, Bengali, Bosnian, Bulgarian, Catalan, Croatian, Czech, Chinese|Mandarin|Cantonese, Danish, Dutch, Esperanto, Estonian, Finnish, French, Greek, Hawaiian, Hebrew, Hindi, Hungarian, Icelandic, Indonesian, Irish, Latin, Latvian, Lithuanian, Japanese, Korean, Kurdish, Malay, Maori, Mongolian, Nepali, Norwegian, Persian, Polish, Portuguese, Punjabi, Romanian, Russian, Sanskrit, Serbian, Slovak, Somali, Spanish, Swahili, Swedish, Tajik, Thai, Tibetan, Turkish, Ukrainian, Vietnamese, Welsh, Yiddish, and Zulu.