EXIficient is an implementation of the W3C Efficient XML Interchange (EXI) format specification written in the Java programming language. The EXI format is a very compact representation for the Extensible Markup Language (XML) Information Set that is intended to simultaneously optimize performance and the utilization of computational resources.
Restless is yet-another lightweight markup processor; it generates HTML from plain text with non-intrusive markup of headings, bullet lists, and so on. But unlike other such tools, restless aims to do less: it does not use cryptic punctuation to control advanced formatting, and instead uses heuristics to classify, for example, code samples versus text. It can be used from the command line or on-demand as an Apache filter.
Steev's HTML Parser is an HTML parsing library that builds a complete hierarchy for each element and attribute in the supplied HTML file. Each element is its own C++ class, replete with child nodes, allowing for full control and processing. An 'HTML beautifier' example is included.
Java Text Table Formatter is a set of classes to help print text in tabulated form. You can organize 'cells' of data into rows and columns. Each cell can contain multiple lines of text, and may specify a horizontal alignment (left, center, right) and a vertical alignment (top, center, bottom).
xhtml2pdf converts HTML/XHTML/XHML to PDF using the ReportLab Toolkit, the HTML5lib, and pyPdf. It supports HTML 5 and CSS 2.1 (and some of CSS 3). The main benefit of this tool that a user with Web skills like HTML and CSS is able to generate PDF templates very quickly without learning new technologies.