LogicalDOC is a Web-based document management system that is easy to use and learn. Its architecture leverages best-of-breed Java technology to achieve a powerful and flexible solution. It supports its users with a powerful search engine (Lucene), Web service interface (JAX-WS via CXF) compatible with .NET and PHP, versioning, annotation on documents, a WebDAV interface, importing and exporting from .zip files. Documents can be organized into hierarchical folders, searched using the integrated search engine, or browsed by Tag. The system is extensible thanks to the technologies used (Spring-Hibernate) and its plugin architecture.
Nuxeo Platform provides a framework and set of components to address document management and collaboration needs, including metadata/taxonomies, versioning, lifecyle management, workflow, relations, searching, reporting, transformation, auditing, and retention. Its flexible extension system, based on OSGi, allows developers to quickly configure and extend the platform by creating new components. Its default Web user interface, based on the JSF standard, uses AJAX to create a pleasant user experience. It can also be accessed by a rich client interface through the use of Web services, for instance using the Eclipse-based Nuxeo RCP rich client platform.
Stunnix Advanced Web Server is a Web server for putting Web sites onto CD-ROMs, DVDs, FlashDisks, and other readonly media to be used on any typical system. Sites can be scripted with PHP, Python, Java, or Perl and can be driven by MySQL or SQLite databases. All required software programs are designed to reside on the CD-ROM to avoid the need to install software on the user's computer. The Web serving core is based on Apache 1.3.x, and has additional functionality to detect and use free ports and automatically start a browser on each platform. Desktop applications can also be created with it.
PDFTextStream is a PDF text and metadata extraction library available for Java and .NET. It supports all versions of the PDF document specification (including v1.7, used by Acrobat 8, 9, and X), extraction of text encoded using double-byte character sets (including Chinese, Japanese, and Korean), decryption of documents encrypted using 40-bit, 128-bit, 256-bit, and variable bit length ciphers, and extraction of all document metadata provided by PDF documents (including form data, bookmarks, and annotations). Easy integration with Jakarta Lucene is included, as well as interactive form update capability.
The metric implemented in the Image::Density Perl Module estimates the density of data where there is data, and has a reasonable correlation with goodness as judged by humans. That is, if you let a human look at a set of images and judge quality, the density values for those images as calculated here tend to correlate well with the human judgement (densities that are too high or too low represent "bad'' images). It is intended for use on bitonal TIFF images, such as those from scanning paper documents.
EditLive! is a cross-platform, browser-based Web content editor with a Word-like WYSIWYG interface. Key features include a live spell checker and advanced table and nested list support. It produces content that complies with Section 508 and W3C accessibility, key XHTML and CSS standards. It is designed for use with Web content management, knowledge management, and e-learning applications. Integrations for major CMS platforms and business solutions including IBM Workplace Web Content Management, Vignette, EMC (Documentum), Percussion, FileNet, Open Text, Ektron, Ingeniux, Stellent, vCampus, and Schoolwires are also available.
Berkeley DB XML is a native XML database engine for use within your product. Made available as a C++ library with language bindings for Java, Perl, Python, PHP, and Tcl, it integrates directly into your application (it is not a standalone database server). It provides XQuery access into a database of document containers. XML documents are stored and indexed in their native format using Berkeley DB as the transactional database engine.
Werc is a minimalistic RESTful Web application framework and content management system. It follows the Unix "tool philosophy" and it is designed to be fast, simple, convenient, and easily extensible. It handles both small and big sites and has a flexible system for user and group permissions. All data is stored in plain text files that can be easily manipulated with standard tools, without using any databases or other external dependencies. Existing applications include a blogging engine with RSS/Atom feeds, a wiki system that can easily integrate pre-existing documents (can be enabled for any directory tree), and others.