Solr-Connector-Files crawls and indexes directories and files from your filesystem (whatever is mountable to Linux) into Apache Solr. It features extraction of file contents with Tika, which extracts metadata and text form many document and file formats. It also integrates automatic text recognition (OCR) for images, photos, and PDFs using Tesseract OCR.
Tomb is a system to make strong encryption easy for everyday use. A tomb is like a locked folder that can be safely transported and hidden in a filesystem. Its keys can be kept separate; for example, you can keep the tomb on your computer and its key on a USB stick. Tomb is written in code that is easy to review and links shared components: it consists of a ZShell script and desktop integration apps; it uses standard GNU tools and the cryptographic API of the Linux kernel (dm-crypt) via cryptsetup.
The pylib library features the popular py.test command line tool for rapidly writing and running tests in Python and py.execnet, a new way to do ad-hoc distribution of Python code to remote processes. py.test runs well on Linux, Windows, and OSX using Python 2.4 through to 2.6. It can distribute a single test run to multiple machines. It is used in many projects, ranging from running tens of thousands of tests integrated with buildbot to a few inlined tests on a command line script.