DKPro Similarity is a framework for text similarity. Its goal is to provide a comprehensive repository of text similarity measures which are implemented using standardized interfaces. The framework is designed to complement DKPro Core, a collection of software components for natural language processing (NLP) based on the Apache UIMA framework. DKPro Similarity comprises a wide variety of measures ranging from ones based on simple n-grams and common subsequences to high-dimensional vector comparisons and structural, stylistic, and phonetic measures. In order to promote the reproducibility of experimental results and to provide reliable, permanent experimental conditions for future studies, DKPro Similarity also comes with a set of full-featured experimental setups which can be run out-of-the-box and used for future systems to built upon.
PRUN is a cluster job scheduler and parallel task executor system for Linux/Unix systems. Like other full-featured batch systems, it provides a job queueing mechanism, job scheduling, a priority scheme, resource monitoring, and resource management. PRUN is a simple, easy-to-use, full-featured, stable solution to run short-term or long-term jobs across the cluster.
jdiagnostics is a library of routines for inspecting the state of a JVM and building a support bundle. It makes it easy to diagnose issues with classpaths, resources, serviceloaders, versioning, XML parsers, and so forth. It has no dependencies, and is an essential library to build into any infrastructure application.
gradle-sablecc-plugin is a gradle plugin which creates parsers using SableCC. SableCC supports automatic CST-to-AST transformation, emits all the visitor patterns and analysis helpers you will likely ever need, and is LR, not LL(k). Many example grammars are available for modern languages; the author of this plugin has written dozens.