OrientDB is a NoSQL DBMS which can store 150,000 documents per second on common hardware. Even with a document-based database, the relationships are managed as in graph databases, with direct connections among records. You can traverse entire or parts of trees and graphs of records in a few milliseconds. It supports schema-less, schema-full, and schema-mixed modes, has a strong security profiling system based on users and roles, and supports SQL between the query languages. Thanks to the SQL layer, it's straightforward to use for people skilled in the relational world.
Dove is an application that facilitates the distribution of documents to a variety of destination types such as email, local files, FTP, FTPS, SFTP, TFTP, Samba servers, Windows network shared drives, and WebDAV servers. Being an abstraction layer over previously enumerated protocols, it allows sending of documents to email or to a WebDAV server with equal ease.
LuSql is a command line Java application for the construction of a Lucene index from an arbitrary SQL query of a JDBC-accessible SQL database. It allows a user to control a number of parameters, including the SQL query to use, individual indexing/storage/term-vector nature of fields, analyzer, stop word list, and other tuning parameters. In its default mode, it uses threading to take advantage of multiple cores. LuSql can handle complex queries, allows for additional per record sub-queries, and has a plug-in architecture for arbitrary Lucene document manipulation.
SCAN is a personal information retrieval framework, combining search, text analysis, tagging, and metadata functions for document collections management. SCAN is a component-based software using a number of plugins for specific features. The basic SCAN platform can be easily extended with plugins for different document formats and document location types.
Apache Jackrabbit is a fully conforming implementation of the Content Repository for Java Technology API (JCR). A content repository is a hierarchical content store with support for structured and unstructured content, full text searching, versioning, transactions, observation, and more. Typical applications that use content repositories include content management, document management, and records management systems.
Graffito is a framework used to build content-based applications like CMSs, document management systems, forums, blogs, etc. It offers a complete platform for creating, managing, and publishing content in your portal or in any other kind of Java application. It integrates content repositories, workflow, collaboration, and personalization via existing open source projects and standards like WEBDAV.