fakedbfs is a suite of tools and a library for cataloguing files. Anything that is a file can be catalogued along with metadata. The metadata is then searchable through various querying facilities. It runs entirely in userland and was inspired by BFS, which doesn't do content indexing, only metadata.
File Selection Language is a descriptive language for file selection. It is used to selectively pick files from a directory structure. FSL uses glob patterns as the basic building block. For fine-tuning the selection, inclusion and exclusion rule combinations and conditional expressions are available. File size and modification date can be used in expressions.
ECMerge compares and merges files and folders, side-by-side or 3-way. It provides syntax coloring for C/C++, Java, HTML, ASP, PHP, and many other programming languages. It can save your work at an intermediate state and restart it later. It can generate Unix patches and XML/HTML reports between files. A shell extension is provided under Windows. PDF, Word, and ZIP documents can be compared using converters.
Areca is a file backup system that supports data compression (zip / zip64 format) and encryption, incremental backups, FTP/SFTP file transfer, delta storage mode, and many other features. It includes a transaction mechanism, which guarantees the integrity of your backups. Two user interfaces are available: a command-line interface (useful for backup automation) and a graphical user interface (useful for backup administration).
SILVERCODERS DocToText is a powerful utility which can convert documents in many formats to plain text. It includes a console application and C/C++ library, which allows embedding text extraction mechanisms into other applications. It supports MS Office binary formats (MS Word (DOC), MS Excel (XLS, XLSB), MS PowerPoint (PPT), and Rich Text Format (RTF)), OpenDocument formats (text documents (ODT), spreadsheets (ODS), presentations (ODP) and graphics (ODG)), Office Open XML formats (MS Word (DOCX), MS Excel (XLSX), and MS PowerPoint (PPTX)), iWork formats (PAGES, NUMBERS, KEYNOTE), OpenDocument Flat XML formats (FODP, FODS, FODT), Portable Document Format (PDF), Email files (EML), and HyperText Markup Language (HTML). DocToText can extract text not only from the document body but also from annotations (comments) embedded in odt, doc, docx, or rtf files and read metadata like author, last modification date, or number of pages. It can be used as a fast console viewer, and is able to convert corrupted OpenDocument and Office Open XML documents. It can be used to recover text even if other recovery methods failed.