S3QL is a file system that stores all its data online. It supports Amazon S3, Google Storage, and OpenStack and effectively provides you with a hard disk of dynamic, infinite capacity that can be accessed from any computer with Internet access. S3QL provides a standard, full featured Unix file system that is conceptually indistinguishable from any local file system. Additional features include compression, encryption, data de-duplication, immutable trees, and snapshotting, which make it especially suitable for online backup and archiving. The design favors simplicity and elegance over performance and feature-creep. Care has been taken to make the source code as readable and serviceable as possible. Solid error detection, error handling, and extensive automated test cases are provided.
Shoelacer attempts to generate a pair of small C functions that compress or decompress a set of data (primarily short strings) presented to it, and possibly other data of that kind. The data it is given may consist of entries of just a few bytes or longer. Its goal is to be reasonably fast with a tiny memory overhead. Compression performance is not so much the goal, although it certainly isn't ignored.
storeBackup is a backup utility that stores files on other disks. It's able to compress data, and recognize copying and moving of files and directories (deduplication), and unifies the advantages of traditional full and incremental backups. It can handle big image files with block-wise changes efficiently. Depending on its contents, every file is stored only once on disk. Tools for analyzing backup data and restoring are provided. Once archived, files are accessible by mounting file systems (locally, or via Samba or NFS). It is easy to install and configure. Additional features are backup consistency checking, offline backups, and replication of backups.
Various archive formats can be created, extracted, tested, listed, searched, compared, and repacked by patool. The advantage of patool is its simplicity in handling archive files without having to remember myriad programs and options. The archive format is determined by the file(1) program and as a fallback by the archive file extension. patool supports 7z (.7z), ACE (.ace), ADF (.adf), ALZIP (.alz), APE (.ape), AR (.a), ARC (.arc), ARJ (.arj), bzip2 (.bz2), CAB (.cab), COMPRESS (.Z), CPIO (.cpio), deb (.deb), DMS (.dms), FLAC (.flac), gzip (.gz), ISO (.iso), LRZIP (.lrz), LZH (.lha, .lzh), LZIP (.lz), LZMA (.lzma), LZOP (.lzo), RPM (.rpm), RAR (.rar), RZIP (.rz), SHN (.shn), tar (.tar), XZ (.xz), zip (.zip, .jar), and ZOO (.zoo) formats. It relies on helper applications to handle those archive formats (for example bzip2 for BZIP2 archives). The archive formats tar, zip, bzip2, and gzip are supported natively and do not require helper applications to be installed.
lzo-java is a Java port of liblzo2, a portable lossless data compression library. It provides block and stream interfaces to the lzo1x, lzo1y, and lzo1z algorithms, and it can handle raw and lzop file formats. The implementation is designed for high performance reading and writing, and simplicity of integration. It is integrated with Hadoop for high performance, and transparent compression is included.
Pipa reads data from stdin and does various things with it: rotates by time interval / output file size; compresses it (gz, bz2, lzo, xz); names the output file with strftime(); splits it intelligently when given info about the file structure (newlines / PCAP); calls a user command on the finished output file; passes it to stdout (like tee); and more.
GNU `tar' saves many files together into a single tape or disk archive, and can restore individual files from the archive. It includes multivolume support, the ability to archive sparse files, automatic archive compression/decompression, remote archives and special features that allow `tar' to be used for incremental and full backups. It also includes `rmt', the remote tape server (the `mt' tape drive control program is in GNU `cpio').
Lessfs is a high performance inline data deduplicating file system for Linux. Lessfs complies to the POSIX standard and is very useful for backup purposes as well as providing storage for virtual machine images. Although lessfs is a file system that is implemented in user space with FUSE, it offers decent performance. Lessfs is capable of handling data rates up to 350MB/sec. It supports filesystem encryption.
Multicrush is a wrapper around pngcrush that distributes the work of brute-force compressing a single image over several pngcrush processes. This gets you a slightly less than a linear speedup, caused by different compression methods having slightly different durations. For example, the author has measured a speedup of 1.92 on a 2-core Intel T5300, and 3.86 on a 4-core Intel Q6600. The only requirements are a copy of pngcrush and at least Python 2.4. If you have version 2.6 or higher, multicrush can automatically detect how many CPUs to use.