XSD is a W3C XML Schema to C++ translator. Provided with an XML instance specification (XML Schema), it generates C++ classes that represent the given vocabulary as well as parsing and serialization code. You can then access the data stored in XML using types and functions that semantically correspond to your application domain rather than dealing with elements, attributes, and text in a direct representation of XML such as DOM or SAX. XSD features support for in-memory and stream-oriented processing models, comprehensive XML Schema feature coverage, easy integration, and more.
The Okapi project’s main purpose is to architect a set of building blocks for the creation of larger open source localization and translation tools. But many Okapi components are generic enough to be of interest to the text mining, natural language processing, and text retrieval communities. Okapi’s many text filters (HTML, Properties, XML (ITS XPath-based rules), OpenXML, ODF, Regex etc.) provide a straightforward way to access the text of multiple document formats. Its document events and pipeline can be made to integrate with other frameworks such as UIMA, LingPipe, OpenPipeline, OpenNLP, GATE, and Lucene. The advantage of Okapi’s text filters is that not only is text extracted, but all non-textual formatting is preserved. It is possible to decompose a document into events, process them via the pipeline, and then rebuild the input document without loss. Structural information can be added to Okapi document events so that tables, lists, links, titles etc. are grouped together and treated as a unit. This is useful when context based on a “universal” document structure is needed. The Okapi event model supports user configurable annotations, similar to UIMA, but simpler and more restricted in scope. User can annotate spans of text or add new resources such as translation memory matches, terminology, token types, or part of speech information.
jSmaTeP assists in the use of Java for processing import and export data by configuring a data structure rather than by programming it. The structure of the import data is specified in an XML file. jSmaTeP then generates a value object representing exactly one row or record in the import file based on a given XML data configuration. This means that if the import or export format changes, only the XML data configuration needs to be changed to match it.
GPlates offers a novel combination of interactive plate-tectonic reconstructions, geographic information system (GIS) functionality, and raster data visualization. GPlates enables both the visualization and the manipulation of plate-tectonic reconstructions and associated geological, geophysical, and paleo-geographic data through geological time.
CairoForPHP is an application that demonstrates several functions of the Cairo implementation for PHP. It uses a technique with which, from the same source, it can run both as a desktop application with PHP-GTK and as a Web application, thus demonstrating how PHP-GTK can be used as "just one" method of output and integrated into multi-platform, multi-environment applications. The samples in CairoForPHP are fully adjustable, and code can be "dropped-in" easily in the non-Phar versions: all required additional data interfaces are XML-based. The integrated samples come both pre-rendered and in live rendering, demonstrating both abilities of Cairo here and PHP-GTK's utilization with Cairo in the realization of it. CairoForPHP uses several technologies in order to achieve all this, such as XML-based data interfaces (including XML Schema validation), PHP-GTK, Phar compatibility, Gettext integration, and others.