Libxml2 is the XML C library developed for the Gnome project. The library code is portable (to Linux, Unix, Windows, embedded systems, etc.) and modular; most of the extensions can be compiled out. Libxml2 implements a number of existing standards related to markup languages, including the XML standard, Namespaces in XML, XML Base, Relax NG, RFC 2396, XPath, XPointer, HTML4, XInclude, SGML Catalogs, and XML Catalogs. In most cases, libxml tries to implement the specifications in a relatively strict way. To some extent, it provides support for the following specifications, but doesn't claim to implement them: DOM, FTP client, HTTP client, and SAX2. Support for W3C XML Schemas is in progress. It includes xmllint, a command line XML validator.
Libxslt is a C library for GNOME which allows developers to work with XSLT. It is based on libxml for XML parsing, tree manipulation, and XPath support. Also included is 'xsltproc', a command line XSLT processor. The library is written in plain C, making as few assumptions as possible, and sticking closely to ANSI C/POSIX for easy embedding. It should work on Linux, Unix, and Windows. Though not designed primarily with performances in mind, libxslt seems to be a relatively fast processor. It also include full support for the EXSLT set of extension functions as well as some common extensions present in other XSLT engines.
Ganglia is a scalable distributed monitoring system for high-performance computing systems such as clusters and grids. It is based on a hierarchical design targeted at federations of clusters. Ganglia is currently in use on over 500 clusters around the world and has scaled to handle clusters with 2000 nodes.
uni2ascii and ascii2uni provide conversion in both directions between UTF-8 Unicode and more than thirty 7-bit ASCII equivalents, including RFC 2396 URI format and RFC 2045 Quoted Printable format, the representations used in HTML, SGML, XML, OOXML, the Unicode standard, Rich Text Format, POSIX portable charmaps, POSIX locale specifications, and Apache log files. It can also convert between the escapes used for Unicode in languages such as Ada, C, Common Lisp, Java, Pascal, Perl, Postscript, Python, Scheme, and Tcl.
RefDB is a reference database and bibliography tool for SGML, XML, and LaTeX documents. Command-line tools allow interactive or scriptable access to the data which are stored in a SQL database. RefDB can also be accessed through a Web interface, a SRU interface, or via editor extensions (Emacs/vim). Libraries for Perl and PHP are available for programmers. RefDB provides sophisticated character encoding handling, using Unicode by default.
Vilistextum is a small and fast HTML to text converter. It is quite fault-tolerant and deals well with badly-formed or otherwise quirky HTML. It has full support for different character sets (e.g. Unicode). It is able to optimize for ebook reading, collapse multiple blank lines, and create footnotes out of links. A GUI frontend using kaptain is included.
The Project Gutenberg Markup Tool is a command-line tool with a GUI front-end that automatically creates an HTML or LaTeX file from a Project Gutenberg etext. The aim is to provide publication-quality formatted etexts, without manual markup, in conjunction with post-processing by other pre-existing tools. It is tailored specifically to Project Gutenberg etexts, but can in some cases be used for other plain-ASCII etexts.
HSC (HTML Sucks Completely) is a developer tool for HTML projects. It is mainly intended for creating and maintaining larger Web sites, with a focus on compact and standards-conforming code. Its source texts are HTML files that use an extended syntax allowing for macros, conditionals, variables, expressions, etc. These sources are transformed into static (X)HTML pages. HSC checks the result for many common errors such as missing end tags and nesting mistakes, both in the markup and, to a limited extent, in CSS. As a commandline tool, it can be used together with "make", and a dependency generator is included. The macro library features things like navigation menus, LaTeX-style sections, footnotes and bibliographies, semiautomatic table-based layout, and database/scripting interfaces.