Libxml2 is the XML C library developed for the Gnome project. The library code is portable (to Linux, Unix, Windows, embedded systems, etc.) and modular; most of the extensions can be compiled out. Libxml2 implements a number of existing standards related to markup languages, including the XML standard, Namespaces in XML, XML Base, Relax NG, RFC 2396, XPath, XPointer, HTML4, XInclude, SGML Catalogs, and XML Catalogs. In most cases, libxml tries to implement the specifications in a relatively strict way. To some extent, it provides support for the following specifications, but doesn't claim to implement them: DOM, FTP client, HTTP client, and SAX2. Support for W3C XML Schemas is in progress. It includes xmllint, a command line XML validator.
Libxslt is a C library for GNOME which allows developers to work with XSLT. It is based on libxml for XML parsing, tree manipulation, and XPath support. Also included is 'xsltproc', a command line XSLT processor. The library is written in plain C, making as few assumptions as possible, and sticking closely to ANSI C/POSIX for easy embedding. It should work on Linux, Unix, and Windows. Though not designed primarily with performances in mind, libxslt seems to be a relatively fast processor. It also include full support for the EXSLT set of extension functions as well as some common extensions present in other XSLT engines.
NetCrawler is the frontend to a Web crawling system. This command line application will download all of the pages within a domain, and then parse and process all of the relative content (Images, Text, Audio, Video), saving this content within an XML document for later processing. It is definitely alpha quality, but has been used quite extensively.
(g)WebcppUI is a Gtk+ port of WebcppUI, an interface for Webcpp and MkScs2, providing a unified user environment for creating webcpp commands and designing webcpp syntax color schemes. Webcpp is a utility which converts programmer's source code into syntax highlighted HTML code. It supports over 30 programming languages.
xpath2rss is an XPath to RSS scraper. XPath makes a better HTML scraper than regex (the typical solution) because it understands the structure of the document, rather than just treating it as a big string. As a result, xpath2rss is a more reliable scraper, and much easier to use, once you get the hang of XPath.
pylize is a Python script that makes the creation of on-screen presentations a matter of a few minutes. It generates a template master document, which can be edited with a text or HTML editor. The master document is then processed by pylize to generate HTML files for every slide plus a file for the table of contents. The presentation can be viewed with any CSS-capable Web browser. pylize is inspired by the Perl program PLies.
Integratis is a Web development framework that supports quick scripting and selective optimization of interactive Web sites. It is implemented in C using Pthreads and SysV shared memory. It features a multi-threaded application server, CGI and commandline clients that directly access objects in shared memory, an intelligent, designer-friendly HTML parser that automatically pre-populates forms, rewrites query strings, and executes server-side scripts in multiple scripting languages, an object-relational mapping layer, a built-in scripting language and a single OO framework that allows classes to be implemented in multiple scripting languages or as shared libraries.
Silva is a CMS for organizations that manage multiple or complex Web sites. Content is stored in clean XML, independent of layout and presentation. Features include versioning, a workflow system, an integral visual editor, content reuse, sophisticated access control, multi-site management, extensive import/export facilities, fine-grained templating, and hi-res image storage and manipulation. Silva is built on top of the Zope Web application platform.