Dowser is a Web research and archiving tool that clusters results from search engines, associates words that appear in previous searches, and keeps a local cache of all the results you click on in a searchable database along with summaries and links to related information. It helps you to keep track of what you find, with no advertising.
DOCMGR is a document management system that incorporates automatic indexing of uploaded files, automatic OCR and content indexing of pictures, group-level permissions, LDAP authentication, email notifications, WebDAV, and a discussion board for stored files. Beyond its stock indexing subsystem, DocMGR also has the ability to incorporate Tsearch2 (a full-text indexing add-on for PostgreSQL) for a responsive, full-text file indexing system.
tag-not-ed is a system that allows you to create and manage text documents by attaching tags to them. Later, documents can be retrieved by running queries on those tags (e.g., "show me all docs that deal with 'dogs' and 'cats'"). It is composed of a front-end (currently a mode for the jed text editor) and an indexer. The latter can be used to implement a rudimentary "tagging file system".
WAscii is a Web frontend intended to display an AsciiDoc documentation repository. It allows you to search and browse your documentation files and automatically converts AsciiDoc to HTML, PDF, and ODF documents. It is intended to work directly from a subversion repository containing your AsciiDoc files.
SCAN is a personal information retrieval framework, combining search, text analysis, tagging, and metadata functions for document collections management. SCAN is a component-based software using a number of plugins for specific features. The basic SCAN platform can be easily extended with plugins for different document formats and document location types.
LuSql is a command line Java application for the construction of a Lucene index from an arbitrary SQL query of a JDBC-accessible SQL database. It allows a user to control a number of parameters, including the SQL query to use, individual indexing/storage/term-vector nature of fields, analyzer, stop word list, and other tuning parameters. In its default mode, it uses threading to take advantage of multiple cores. LuSql can handle complex queries, allows for additional per record sub-queries, and has a plug-in architecture for arbitrary Lucene document manipulation.