UIMA SDK is a software architecture and framework for supporting the development, integration, and deployment of search and analysis technologies. It can be used to analyze large volumes of unstructured information (text, audio, video, images, etc.) to discover, organize, and deliver relevant knowledge to the client or application end user.
Oddjob is a Java job scheduler and task automation framework. A GUI designer or XML are used to define a hierarchy of jobs. Sequential, parallel, and conditional execution (or combinations) allow for nearly any business process to be modelled. Oddjob can run on the desktop or on a server and uses JMX to control remote instances via an Explorer-style GUI. Basic Web-based monitoring is also available. It can be embedded in client code and is easily extensible via a simple API.
Conary is a distributed software management system for Linux distributions. It replaces traditional package management solutions (such as RPM and dpkg) with one designed to enable loose collaboration across the Internet. It enables sets of distributed and loosely connected repositories to define the components which are installed on a Linux system. Rather than having a full distribution come from a single vendor, it allows administrators and developers to branch a distribution, keeping the pieces which fit their environment while grabbing components from other repositories across the Internet.
SableCC is a parser generator that generates fully featured object-oriented frameworks for building compilers, interpreters, and other text parsers. In particular, generated frameworks include intuitive strictly-typed abstract syntax trees and tree walkers. SableCC also keeps a clean separation between machine-generated code and user-written code, which leads to a shorter development cycle.
fastutil extends the Java Collections Framework by providing type-specific maps, sets, lists, and queues for Java with a small memory footprint and fast access and insertion. It also provides big (64-bit) arrays, sets and lists, and fast, practical I/O classes for binary and text files.
Raptor is a C library providing a set of parsers and serializers for Resource Description Framework (RDF) triples by parsing syntaxes into RDF triples and serializing triples into a syntax. The parsers support RDF/XML, N-Triples, GRDDL, and Turtle, and via RSS tag soup: XML RSS, Atom 0.3, and Atom 1.0. The serializers support RDF/XML (3 flavours), Turtle, DOT, N-Triples, RSS 1.0, and Atom 1.0. Raptor handles RDF/XML as used by RDF applications such as RSS 1.0, FOAF, DOAP, Dublin Core, and OWL. It can use either expat or libxml2 for XML parsing, libcurl when available for URI retrieval, and is portable to many POSIX systems.