Sanzang is a compact and simple cross-platform machine translation system. It is especially useful for translating from the CJK languages (Chinese, Japanese, and Korean), and it is very suitable for working with ancient and otherwise difficult texts. Unlike most other machine translation systems, Sanzang is small and approachable. Any user can develop his or her own translation rules, and these rules are simply stored in a text file and applied at runtime.
papaya CMS is a Web Content Management System based on open standards (including XML, XSLT, PHP, and MySQL/PostgreSQL). It is compatible with almost every operating system, is platform-independent, is multi-lingual, offers great usability, and is easy to extend via its plugin system. It is scalable and perfect for business websites.
cronbot is a wrapper script for cron jobs. Rather than running your job/script directly, you run it through cronbot, and it will allow a little more control to help keep things from getting out of hand. The default behavior is to only allow one instance of your job to run. This can be changed with the --duplicate option. You can also pass in a maximum time of execution for a job or add a random amount time to delay the start of your job. By default, jobs are run with a default "nice" value, which is usually 10.
Weed-FS is a simple and highly scalable distributed file system. There are two objectives: to store billions of files, and to serve the files fast! Instead of supporting full POSIX file system semantics, it implements only a key-file mapping. Instead of managing all file metadata in a central master, it manages file volumes in the central master and lets volume servers manage files and the metadata. This relieves concurrency pressure from the central master and spreads file metadata into volume servers' memories, allowing faster file access with just one disk read operation. It is modelled on Facebook's Haystack design paper. Only 40 bytes of disk storage are required for each file's metadata, and disk reads are O(1).
dhcpy6d delivers IPv6 addresses for DHCPv6 clients, which can be identified by DUID, hostname, or MAC address, as in the good old IPv4 days. Addresses may be generated randomly, by range, or by arbitrary ID or MAC address. Clients can get more than one address, leases and client configuration can be stored in databases, and DNS can be updated dynamically.
Jaro Mail is an integrated suite of interoperable tools for GNU/Linux and Apple/OS X to manage email communication in a private way, without relying on online services (in fact, encouraging users to store email locally). It mainly features a system for mailinglist filtering, address-book integrated whitelisting, and long-term email storage and archiving with functions for fast search and usage statistics. It makes internal use of customized versions of popular Unix tools like Mutt, Fetchmail, Procmail, Mairix, and Msmtp.
txtorcon is a Twisted-based asynchronous Tor control protocol implementation. Twisted is an event-driven networking engine written in Python, and Tor is an onion-routing network designed to improve people’s privacy and security on the Internet. It includes unit-tests with 96%+ coverage, multiple examples, and documentation. There are abstractions to track Tor configuration and state (circuits, streams), launch private instances, support Hidden Service, and more.