SILVERCODERS DocStorage is a utility to improve document management. You can have one database for all invoices, guarantees, protocols, and other documents. DocStorage can extract plain text from documents in doc, XLS, PPT, PDF, RTF, ODT, ODS, ODP, docx, XLSX, PPTX, and many other formats. It can use an OCR engine to extract plain text even from scanned documents. It can perform global fulltext search in all documents regardless of format. It supports document versioning, document duplicate detection, document notes, and document signing. It provides full integration with software suites like Microsoft Office and OpenOffice.
SILVERCODERS OCR Server is a server-based optical character recognition (OCR) and PDF conversion solution for enterprises. It is able to perform conversion of printed documents to editable and searchable formats like plain text, RTF, PDF, and HTML, providing highly accurate recognition in 189 languages. It is available as a Linux application or a stand-alone machine, with a fully documented API, very good performance, and flexible licensing rules. It has been designed specifically for the purpose of cooperation with document management systems such as SILVERCODERS DocStorage.
smupcheck, which stands for Smart Update Checker, checks Web sites for updates automatically, even if they don't offer an RSS feed. It is a very basic tool, and does not offer advanced features such as checking password-protected Web sites, highlighting changes, or filtering results.
LingNUX is a dictionary for French students of the Russian language. Both languages can be easily entered without affecting the AZERTY French keyboard settings at the system level. It reads dictionary files with the DSL extension, which is one of the formats used by the famous Russian dictionary "ABBYY Lingvo" under Windows.
SILVERCODERS DocToText is a powerful utility which can convert documents in many formats to plain text. It includes a console application and C/C++ library, which allows embedding text extraction mechanisms into other applications. It supports MS Office binary formats (MS Word (DOC), MS Excel (XLS), MS PowerPoint (PPT), and Rich Text Format (RTF)), OpenDocument formats (text documents (ODT), spreadsheets (ODS), and presentations (ODP)), Office Open XML formats (MS Word (DOCX), MS Excel (XLSX), and MS PowerPoint (PPTX)), and HyperText Markup Language (HTML). DocToText can extract text not only from the document body but also from annotations (comments) embedded in odt, doc, docx, or rtf files and read metadata like author, last modification date, or number of pages. It can be used as a fast console viewer, and is able to convert corrupted OpenDocument and Office Open XML documents. It can be used to recover text even if other recovery methods failed.
Foxtrot is a full text indexing software for PDF, OpenOffice.org 1 and 2, MS Word, and XLS files. The packge provides two different frontends: a Google-like searching tool implemented with Perl-Gtk and a PHP-based Web interface. The backend scans directories asynchronously, converts files to text, and indexes them in a MySQL database.
DOCMGR is a document management system that incorporates automatic indexing of uploaded files, automatic OCR and content indexing of pictures, group-level permissions, LDAP authentication, email notifications, WebDAV, and a discussion board for stored files. Beyond its stock indexing subsystem, DocMGR also has the ability to incorporate Tsearch2 (a full-text indexing add-on for PostgreSQL) for a responsive, full-text file indexing system.