The Cainteoir Engine is a library for reading and recording different document formats (ePub, HTML, MHT, RTF, email, and others) to various audio output formats (such as PulseAudio, WAV, and Ogg/Vorbis). It also provides the following command-line tools: cainteoir, a front-end to the Cainteoir text-to-speech library; metadata, which extracts metadata from documents to RDF tuples; and tagcloud, which generates tag clouds and tag cloud data.
CWIS (Collection Workflow Integration System) is a software package designed to help assemble, organize, and share information about resources online. The software conforms to international and academic standards for metadata while providing turnkey setup and a user-friendly interface, allowing resource collection developers (and end users) to focus on what they want to share (or find) without worrying about the technical details. It provides extensive support for cataloging resources with standardized metadata, which can then be automatically shared via an array of channels out to the larger Internet community, helping others find your work.
Mayan EDMS is a document manager Web application with custom metadata indexing, file serving integration, and OCR capabilities. It features user defined metadata fields, dynamic default values for metadata, lookup support for metadata, filesystem integration by means of metadata indexing directories, user defined document UUID generation, local file or server side staging file uploads, batch uploading of many documents with the same metadata, user defined document checksum algorithms, previews for a great deal of image formats including PDF, document OCR and searching, automatic grouping of documents by metadata, permissions and roles support, multi-page document support, page transformations, distributed OCR processing, and support for multiple languages.