The cdcat is a graphical (Qt based) multi-platform catalog program that scans the directories/drives you want and memorizes the filesystem (including the tags of MP3s) and stores it in a small file. The database is stored in a gzipped XML format, so you can hack it, or use it if necessary. Cdcat can store the contents of some specified files up to a size limit if you want (for example: *.nfo).
NekoXNI is a collection of small, useful XML tools written for the Xerces Native Interface (XNI) that is the foundation of the Xerces2 implementation. The NekoXNI tools are written to illustrate the power and flexibility of the XNI framework, as well as provide useful tools for XML application developers. The current collection of XNI tools includes an HTML Parser, DTD Converter, a Style Processor, a RelaxNG Validator parser configuration, and an XML Pull Parser.
References for TeX and Friends is an ongoing project which provides a help file for LaTeX (and its friends like ConTeXt, Metapost, Metafont, etc.) using a state-of-the-art source format, DocBook/XML. Various output formats can be generated from the source file. Anyone can write a converter for any desired output format. Because the source file is XML, the easiest way to do this might be to use XSLT.
PDFreactor is a formatting processor to convert HTML and XML to PDF. It uses Cascading Style Sheets (CSS) to define page layout and styles. It allows you to dynamically generate PDF documents such as invoices, delivery notes, shipping documents, or print versions of Web content on-the-fly. Vector graphics (SVG), barcodes, MathML, XSLT, and CMYK colors are supported. All common J2EE application servers are supported. Complete .NET, PHP, Perl, Python, and Ruby APIs are included. Direct integration into automatic build processes using Apache Ant is also possible.
Babeldoc is a framework and set of applications to process documents for business-to-business and other Internet/integration applications. It is primarily intended for text documents, especially XML, but supports a wide range of operations and data types. It has a sophisticated journaling system that supports replaying and reprocessing. Babeldoc is pipeline based and supports numerous ways to combine the pipeline stages in a dynamically reconfigurable fashion. It has a GUI and a Web-based console for document processing and monitoring, and comes with tools for the tranformation of flatfile data to XML, archival, and cryptography. Additionally it is able to scan various data sources based on sophisticated constraints.
The POI project contains several components for dealing with popular OLE 2 formats in Java. POIFS is a pure Java implementation of the OLE 2 Compound document format. HSSF is a pure Java implementation of Excel 97-2003 XLS file format based on POIFS. HSSF Serializer is a pure Java serializer for Cocoon 2 that uses the Gnumeric XML format to output XLS. Full documentation of the POIFS file format is included. It is useful if you wish to output reports in the Excel file format, or if you have existing XML documents that you need to get into Excel. HSLF provides initial support for PowerPoint 97-2003 and HWPF provides limited support for Word 97-2003. POIFS can be used to read any OLE2 stream.
XCC is a tool for building XML format parsers. One way to describe what XCC does is by analogy with a generic parser generator, e.g. yacc or bison. Yacc needs a lexical analyzer to function properly, and that lexical analyzer is usually built with (f)lex. In the XML world, there are a few packages which fill in the role of lex (expat and libxml are the most known), but the high-level grammar parsing is usually done by a hand-written code; writing such a parser is a tedious and error-prone task. XCC was created to help developers in writing reliable easy-to-understood parsers for handling complex XML-based grammars.