FVWM-min is a minimal standard configuration for FVWM which provides a sane and functional set of window manager defaults. It configures FVWM to a very clean and functional state, then gets out of your way so you can do your work. The basic idea is that after installation, it should be usable enough for you to start productive work in it immediately. Styles are included for customizing the look, and are applied with a small shell script.
The Courier Unicode Library implements several algorithms related to the Unicode Standard: look up of uppercase, lowercase, and titlecase equivalents of a Unicode character, grapheme and work breaking rules, line breaking rules, and several other related functions. The library also implements C++ bindings for these algorithms.
Harry is a small tool for comparing strings and measuring their similarity. It implements several common distance and kernel functions for strings, as well as some exotic similarity measures. For example, Harry supports the Levenshtein (edit) distance, the Jaro-Winkler distance, and the compression distance. Harry is implemented using OpenMP, so its runtime scales linearly with the number of available CPU cores. Efficient implementations and effective caching speed comparison of strings.
php_mt_seed finds possible seeds given the very first PHP mt_rand() output after possible seeding with mt_srand(). With advanced invocation modes, php_mt_seed is also able to match multiple, non-first, and/or inexact mt_rand() outputs to possible seed values. php_mt_seed is written in C with optional SIMD intrinsics (SSE4.1/AVX, XOP, AVX2, MIC) and OpenMP. On a modern quad-core CPU, it is able to search the full 32-bit seed space in one minute (or in just seven seconds on Xeon Phi).
Salad (short for Letter Salad) is an efficient and flexible implementation of the well-known anomaly detection method Anagram by Wang et al. (RAID 2006). Salad is based on n-gram models, that is, data is represented as all of its substrings of length n. During training these n-grams are stored in a Bloom filter. This enables the detector to represent a large number of n-grams in little memory and still being able to efficiently access the data. Salad extends Anagram by allowing various n-gram types, a 2-class version of the detector for classification, and various model analysis modes.
Shasplit takes a large data block, splits it into smaller parts, and puts those parts into an SHA-based content-addressed store. Reassembling those parts is a trivial "cat" invocation. Repeating parts (e.g., from previous split operations) are stored only once, which allows efficient incremental backups of whole LVM snapshots via Rsync. Shasplit shows its strengths on encrypted block devices, but might be useful for non-encrypted data, too.
iCan is an intelligent, community-driven command line that helps you to automate any kind of task arising in your daily Linux use. It is extensible through simple Lua modules, which run in a sandbox environment. It is designed to learn what you do and make things easier for you, every time you do something.