Apache XML Graphics Commons is a library that consists of several reusable components used by Apache Batik and Apache FOP. Many of these components can easily be used separately outside the domains of SVG and XSL-FO. You will find components such as a PDF library, an RTF library, Graphics2D implementations that let you generate PDF and PostScript files, and much more.
Jaxup is a Java library that defines an interface to update XML documents, through which clients can work without knowledge of the exact object model that the document uses. The interface is called Updater, and the idea behind it is the same as with Jaxen's Navigator interface. In addition, an implementation of xmldb.org's proposed XUpdate specification is provided. Jaxup supports DOM, JDom, and DOM4J.
The Okapi project’s main purpose is to architect a set of building blocks for the creation of larger open source localization and translation tools. But many Okapi components are generic enough to be of interest to the text mining, natural language processing, and text retrieval communities. Okapi’s many text filters (HTML, Properties, XML (ITS XPath-based rules), OpenXML, ODF, Regex etc.) provide a straightforward way to access the text of multiple document formats. Its document events and pipeline can be made to integrate with other frameworks such as UIMA, LingPipe, OpenPipeline, OpenNLP, GATE, and Lucene. The advantage of Okapi’s text filters is that not only is text extracted, but all non-textual formatting is preserved. It is possible to decompose a document into events, process them via the pipeline, and then rebuild the input document without loss. Structural information can be added to Okapi document events so that tables, lists, links, titles etc. are grouped together and treated as a unit. This is useful when context based on a “universal” document structure is needed. The Okapi event model supports user configurable annotations, similar to UIMA, but simpler and more restricted in scope. User can annotate spans of text or add new resources such as translation memory matches, terminology, token types, or part of speech information.
The Styler Ant Task makes useful combinations of XSLT and other transformations easy to specify in an Ant build file. Styler can handle multiple transformations, in parallel or pipelined, split or merge files, process non-XML files such as HTML (based on JTidy), apply non-XSLT transformation, such as "regular fragmentations" and use any custom XMLReader or XMLFilter class to handle new file formats and transformation techniques.
WikklyText is a Python-based wiki and modular library for creating custom wikitext rendering applications. Its wikitext engine is compatible with the TiddlyWiki markup language. It functions equally well as a wiki-on-a-stick or as a multiuser wiki behind Apache. It includes a Drupal plugin as well.
The XEVM is an XML processing engine. It's a multi-threaded, Pub/Sub environment for dynamic programming on an event-driven state machine with TCP communications, tight fault free memory management, powerful set algebra, and a magical database. It is 100% C++ (25,000 LOC), with a thin porting layer; there are implementations for POSIX (Mac/Linux) and Win32. The XEVM is for processing XEPL (the Xepl Engine Programming Language).
SimplyBibTeX is an application to share and hold BibTeX bibliographies. One can add, edit and remove entries from bibliographies online or upload complete collections. The system uses a very simple but effective template mechanism. Users and coworkers can subscribe to RSS 2.0 or Atom feeds in order to get notified about changes.
Xineo XIL (XML Import Language) defines an XML language for transforming various record-based data sources into XML documents, and provides a fully functional XIL processing implementation. This implementation has built-in support for relational sources (via JDBC) and structured text sources (like CSV). It is also extensible through its public API, which permits dynamic integration of new data source implementations.