The Okapi project’s main purpose is to architect a set of building blocks for the creation of larger open source localization and translation tools. But many Okapi components are generic enough to be of interest to the text mining, natural language processing, and text retrieval communities. Okapi’s many text filters (HTML, Properties, XML (ITS XPath-based rules), OpenXML, ODF, Regex etc.) provide a straightforward way to access the text of multiple document formats. Its document events and pipeline can be made to integrate with other frameworks such as UIMA, LingPipe, OpenPipeline, OpenNLP, GATE, and Lucene. The advantage of Okapi’s text filters is that not only is text extracted, but all non-textual formatting is preserved. It is possible to decompose a document into events, process them via the pipeline, and then rebuild the input document without loss. Structural information can be added to Okapi document events so that tables, lists, links, titles etc. are grouped together and treated as a unit. This is useful when context based on a “universal” document structure is needed. The Okapi event model supports user configurable annotations, similar to UIMA, but simpler and more restricted in scope. User can annotate spans of text or add new resources such as translation memory matches, terminology, token types, or part of speech information.
CMS Master is an easy-to-use tool for Web content management. It is completely Web-based with a WYSIWYG editor and an intuitive user interface. It allows you to create and update site content, manage the site structure, menu, page order, images, and files on your site, install new modules, and more.
Molinos CMS is a modular CMS. It supports multiple Web sites per installation, multiple domains per Web site, and different subdomains for different Web site sections. It uses XSLT for templating; templates can be organized in themes, which can be applied to either all pages or particular pages. It can use MySQL or SQLite for data stoarge; all data is available in XML format, ready to use by templates and scripts. User groups (roles) and advanced ACL can be used to control access to that data. It supports many caching and code optimization techniques. Data storage is optimized to minimize queries required to display pages. There are modules for integrating with various third-party services (over 70 modules). Modules are simple to write using the provided documentation (currently only available in Russian). Web sites can use local and system modules simultaneously. It works with most Web servers, and has a built-in Web server (for localhost development). It also has a CLI for most tasks.
The Epeios MIDI to XML converter makes it possible to convert binary MIDI files to a format that allows them to be handled effectively with tools like CVS. If you put, using your favorite MIDI sequencer, the CVS '$Id$' tag (or other CVS tags) in the comment tag of the MIDI file, the changes made to this tag by CVS are propagated into the MIDI file.