NetCrawler is the frontend to a Web crawling system. This command line application will download all of the pages within a domain, and then parse and process all of the relative content (Images, Text, Audio, Video), saving this content within an XML document for later processing. It is definitely alpha quality, but has been used quite extensively.
c_xml is a class that can convert between XML files and arrays or strings. It can parse an XML file into a DOM document object, so it can be returned as an array or displayed using a XSL stylesheet. The parsed XML document can also be manipulated to add, change, or remove nodes. Nodes may be searched using XPath. The resulting document can be regenerated in XML format.
Silva is a CMS for organizations that manage multiple or complex Web sites. Content is stored in clean XML, independent of layout and presentation. Features include versioning, a workflow system, an integral visual editor, content reuse, sophisticated access control, multi-site management, extensive import/export facilities, fine-grained templating, and hi-res image storage and manipulation. Silva is built on top of the Zope Web application platform.
Sirix is a versioned tree-based storage system. It provides Snapshot Isolation through MVCC and is best suited for flash-based storage. The tree encoding is suited for heavy update operations. It provides diffing algorithms either to import diffs between currently shredded (imported) XML documents to update a tree structure or to get the differences between versions of a tree structure. A GUI provides several visualizations for comparing imported tree structures.
JaxMe is a Java/XML binding framework based on SAX2. It consists of a set of code generators that read an XML schema and generate code for parsing conformant XML documents into corresponding Java objects, saving those objects into a database or, vice versa, reading such Java objects from a database and converting them into XML. JaxMe supports namespaces, relational databases, and Tamino. JaxMe comes with an integrated application framework and a generator for EJB entity beans with BMP (bean managed persistence).
XML parser class is a PHP class that parses arbitrary XML input and builds an array with the structure of all tag and data elements. Optionally it can keep track of the positions of each element to locate elements that may be contextually in error. Supports a parsed file cache to minimize the overhead of parsing the same file repeatedly. Optimized parsing of simplified XML (SML) formats ignoring the tag attributes.
Ganglia is a scalable distributed monitoring system for high-performance computing systems such as clusters and grids. It is based on a hierarchical design targeted at federations of clusters. Ganglia is currently in use on over 500 clusters around the world and has scaled to handle clusters with 2000 nodes.