DSML is the Directory Services Markup Language, an XML dialect for working with directory information. The DSML Tools provide for the querying of any LDAP directory (with search results output as DSML), the importing of DSML data into any LDAP directory, the directory-context validation of DSML (checking for illegal attributes in the entries, etc.), and the calculation of the differences (for a directory) between two DSML documents to provide an XML Diff algorithm for DSML data. This software makes all LDAP-supporting directories DSML-enabled. It can also check the integrity of DSML data, and show at a glance how two data sets, represented as DSML, differ.
flex.rb is a regexp matching and scanning library for the Ruby language. It is more than 3 times faster than Ruby regexps, and supports matching text arriving in multiple parts (via async, non-blocking I/O). flex.rb embeds the GNU Flex 2.5.4 (fast lexical analyzer generator) as an engine, and all of Flex's functionality is accessible from Ruby scripts.
FreeMarker is a template engine that was originally designed so that servlet-based applications could keep graphical design separate from application logic. The templates provide an easy and highly flexible way to generate any kind of text output (HTML, PostScript, TeX, source code, etc.) from a variety of data sources such as Java objects, Jython objects, XML object models, and more.
GPP is a general-purpose preprocessor with customizable syntax, suitable for a wide range of preprocessing tasks. Its independence from any programming language makes it much more versatile than cpp, while its syntax is lighter and more flexible than that of m4. The syntax is fully customizable, which makes it possible to process text files, HTML, or source code equally efficiently in a variety of languages.
Grok is a library of Java components for performing various natural language tasks. These include several preprocessing tasks, chart parsing, a large categorial grammar for English (induced from the Penn treebank), and some knowledge representation components (basic coreference, salience tracking, etc.). The library also has a companion kit which provides a GUI interface to the components, several of which are implementations of interfaces in the Quipu OpenNLP API.
Hodie prints the current date and time to stdout in Roman numerals, with grammatically correct Latin. Complete with Id., Kal., Non., pridie, postridie, bis, and all the other nice annoyances. As an option, it even provides you with current date according to Roman calendar -- that is 'ab urbe condita'; after Rome was built.