Absimpa is a Java class library that provides a recursive decent parser. It is an experiment in abstraction and Java generics. It tries to include only the essentials of what makes out a parser and to leave out, or rather leave free for specification, what is not relevant for the parsing process.
Babeldoc is a framework and set of applications to process documents for business-to-business and other Internet/integration applications. It is primarily intended for text documents, especially XML, but supports a wide range of operations and data types. It has a sophisticated journaling system that supports replaying and reprocessing. Babeldoc is pipeline based and supports numerous ways to combine the pipeline stages in a dynamically reconfigurable fashion. It has a GUI and a Web-based console for document processing and monitoring, and comes with tools for the tranformation of flatfile data to XML, archival, and cryptography. Additionally it is able to scan various data sources based on sophisticated constraints.
FreeMarker is a template engine that was originally designed so that servlet-based applications could keep graphical design separate from application logic. The templates provide an easy and highly flexible way to generate any kind of text output (HTML, PostScript, TeX, source code, etc.) from a variety of data sources such as Java objects, Jython objects, XML object models, and more.
GNU Source-highlight produces a document with syntax highlighting when given a source file. It handles many languages, e.g., Java, C/C++, Prolog, Perl, PHP3, Python, Flex, HTML, and other formats, e.g., ChangeLog and log files, as source languages and HTML, XHTML, DocBook, ANSI color escapes, LaTeX, and Texinfo as output formats. Input and output formats can be specified with a regular expression-oriented syntax.
GXPARSE is not a new XML parser, but is an additional processing layer that makes it much easier to use event-based parsers like the SAX parser. It supports both direct sequential output and random access output (via the Resequencer interface). The random access mode delays output until all input has been processed, but makes it much easier to handle ID/IDREF attributes. GXPARSE maintains most advantages of the event-based parser. Application development and maintenance is considerably easier, but processing is a little slower.
Grammatica is a parser generator (compiler compiler) for C# and Java. It improves upon similar tools (like yacc and ANTLR) by creating well-commented and readable source code, by having automatic error recovery and detailed error messages, and by support for testing and debugging grammars without generating source code. Grammatica supports LL(k) grammars with an unlimited number of look-ahead tokens.