PDFTextStream is a PDF text and metadata extraction library available for Java and .NET. It supports all versions of the PDF document specification (including v1.7, used by Acrobat 8, 9, and X), extraction of text encoded using double-byte character sets (including Chinese, Japanese, and Korean), decryption of documents encrypted using 40-bit, 128-bit, 256-bit, and variable bit length ciphers, and extraction of all document metadata provided by PDF documents (including form data, bookmarks, and annotations). Easy integration with Jakarta Lucene is included, as well as interactive form update capability.
Epiware is an AJAX-enabled Project and Document Management application that provides access to a complete set of file management functions, including document check-in, check-out, version control, approval, change notification, and access history. It enables the creation of virtual teams by providing an online workspace for users to collaborate and exchange information in a secure, protected setting.
nexB OpenAssets is a tool for inventorying, managing, and monitoring applications, software, hardware, networks, and generally any IT asset. It is designed so that system administrators, IT, and finance can determine what they have, how it is configured, what it is used for, and how much it is being used, so that informed decisions can be made. It complements existing network management software, integrates with a growing number of protocols and tools, and features no-agent discovery and inventory, configuration management including dependencies and correlation, monitoring, and reporting. It makes extensive and innovative use of XML, Xpath, and Xquery.
DoXFS (pron. docs-eff-ess) is a document management system that uses the XFS filesystem to store both content (files) and meta-data (file attributes), bypassing the traditional architecture of filesystem and database, and leveraging the built-in advantages of XFS. Currently, it offers a Web frontend based on PHP and standard management functionality (create, delete, attributes add/delete, etc.) as well as extras such as versioning and Full Text Indexing (using Namazu).
eContent is a Web-based content management system for creating information systems, intranets, B2B, B2C, catalogs, and vertical portals. Written in Java for scalability, and based on open standards Struts and Expresso for stability, eContent integrates content management, scalable content and application delivery, resource management, workflow and personalization. It supports executables, all documents types, OLAP reports, Java programs, and legacy integration.
Contineo is a Web-based document management system. It assists its users by managing documents in most popular formats. Contineo aims to fulfill all phases of the document lifecycle. You can create and develop documents by using office software. With contineo itself, you can publish, search, and manage the versions of documents. Furthermore, you can communicate with other users directly or via email.
Werc is a minimalistic RESTful Web application framework and content management system. It follows the Unix "tool philosophy" and it is designed to be fast, simple, convenient, and easily extensible. It handles both small and big sites and has a flexible system for user and group permissions. All data is stored in plain text files that can be easily manipulated with standard tools, without using any databases or other external dependencies. Existing applications include a blogging engine with RSS/Atom feeds, a wiki system that can easily integrate pre-existing documents (can be enabled for any directory tree), and others.
Glossword is a system to publish dictionaries, glossaries, and encyclopedias. It features an installation wizard, support for multiple languages, visual themes, multi-domain installation, an administrative interface with multi-user support, built-in search and cache engines, the ability to export/import dictionaries in XML format, and W3C-validated code. Glossword is useful for any sort of dictionary-like content, including sites with game cheat codes, online translators, references, and various kinds of CMS solutions.