Bayesian Spam Filter is a class that can be used to detect spam in text messages using Bayesian techniques. It analyzes the text in terms of n-grams in a way that is idiom independent. It can be trained to progressively distinguish what is spam and what is not spam by detecting patterns in training samples. Training data is stored in a MySQL database.
UDUNITS, the Unidata units C library, supports conversion of unit specifications between formatted and binary forms, arithmetic manipulation of unit specifications, and conversion of values between compatible scales of measurement. As of version 2, the unit database is encoded using human-readable XML.
Cybercluster is a multi-master replication solution. Cybercluster has no single point of failure, an integrated load balancer, an integrated replication server, a monitoring server (to track replication), a consistency checker, an optional 2 Phase Commit for cluster-wide writing, support for all PostgreSQL features. Cybercluster is easy to setup. In contrast to many other solutions, Cybercluster is built on top of the PostgreSQL core and has full access to PostgreSQL internals. Cybercluster is heavily based on the concepts of PGCluster.
Hoc, the High Order Calculator, is an interpreted language for floating-point calculations. Its most basic use is as a powerful and convenient calculator, interactively evaluating expressions such as 1+2*sin(0.7). But hoc is no ordinary calculator: It also lets you assign values to variables, define your own functions, and use loops, conditionals, and everything else you'd expect in a programming language.
MPIBZIP2 is a parallel implementation of the bzip2 block-sorting file compressor that uses MPI and achieves significant speedup on cluster machines. The output of this version is fully compatible with bzip2 1.0.2 or newer (i.e. anything compressed with MPIBZIP2 can be decompressed with bzip2).