Raptor is a C library providing a set of parsers and serializers for Resource Description Framework (RDF) triples by parsing syntaxes into RDF triples and serializing triples into a syntax. The parsers support RDF/XML, N-Triples, GRDDL, and Turtle, and via RSS tag soup: XML RSS, Atom 0.3, and Atom 1.0. The serializers support RDF/XML (3 flavours), Turtle, DOT, N-Triples, RSS 1.0, and Atom 1.0. Raptor handles RDF/XML as used by RDF applications such as RSS 1.0, FOAF, DOAP, Dublin Core, and OWL. It can use either expat or libxml2 for XML parsing, libcurl when available for URI retrieval, and is portable to many POSIX systems.
DKPro WSD provides UIMA components which encapsulate corpus readers, linguistic annotators, lexical semantic resources, WSD algorithms, and evaluation and reporting tools. You configure the components, or write new ones, and arrange them into a data processing pipeline. DKPro WSD is modular and flexible. Components which provide the same functionality can be freely swapped. You can easily run the same algorithm on different data sets, or test several different algorithms on the same data set.
MDIFramework provides a ready-to-use architecture to ease the creation MDI-style applications in Java. It takes care of the overall architecture of the main window of the application, with a tabbed architecture, the presence of an HTML printable message area, and so on. It provides a generic API to manage lengthy actions, keeping the interface responsive, and taking care that actions are performed one at a time, without having to bother about it. It can add metadata to already opened files. It manages loading and unloading of external plugins at runtime.
Duke is a fast and flexible record linkage engine. It does not use the traditional blocking (sort by key) approach, but instead relies on Lucene. This makes it high-performance (able to process 1,000,000 records in ~10 minutes). Duke can be run from the command line, but also has an API allowing incremental linking applications to be built easily. It supports reading data from CSV, JDBC, SPARQL, and NTriples, and also supports a number of string comparators and string normalizers.
Apache Cayenne is a persistence framework that provides object-relational mapping (ORM) and remoting services. It has a wealth of unique and powerful features and can address a wide range of persistence needs. Cayenne seamlessly binds one or more database schemas directly to Java objects, managing atomic commit and rollbacks, SQL generation, joins, sequences, and more. With Cayenne's Remote Object Persistence, those Java objects can even be persisted out to clients via Web Services. With native XML serialization, objects can be even further persisted to non-Java clients.
Chandler is a standards-based "Note-to-Self Organizer" designed for personal and small-group task management and calendaring. It consists of a desktop application and Chandler Hub, a free sharing service and Web application. You can also download and run your own Chandler Server.
fastutil extends the Java Collections Framework by providing type-specific maps, sets, lists, and queues for Java with a small memory footprint and fast access and insertion. It also provides big (64-bit) arrays, sets and lists, and fast, practical I/O classes for binary and text files.
Oddjob is a Java job scheduler and task automation framework. A GUI designer or XML are used to define a hierarchy of jobs. Sequential, parallel, and conditional execution (or combinations) allow for nearly any business process to be modelled. Oddjob can run on the desktop or on a server and uses JMX to control remote instances via an Explorer-style GUI. Basic Web-based monitoring is also available. It can be embedded in client code and is easily extensible via a simple API.