Release Notes: This release fixes a few minor problems in 1.3x1 and improves the source code structure a bit more. It adds a new experimental feature. If configure detects a working libungif and gif_lib.h on the system, it will add support for extracting useful terms from GIF images in email messages. These terms can provide much needed data for email messages that contain nothing but headers and an image attachment.
Release Notes: This release adds the final missing pieces to the new parser code. MBX files and Content-Length headers are now supported. Database cleanup when signals are caught has also been improved.
Release Notes: This release adds exec and exec-shared commands to allow manipulation of the database using external programs. Other improvements include a compilation option to force big-endian data storage, Debian compatibility fixes, and minor bugfixes.
Release Notes: This release adds some minor changes, including performance tuning, an improved dump command that allows limiting the terms printed to those matching a regular expression, and support for using an external tokenizer instead of spamprobe's own.
Release Notes: This release improves the scoring algorithm to produce more evenly-distributed scores. It includes support for the X-Status: header, reduces I/O in train mode, and allows users to set their own spam score threshold.
Release Notes: A new tokenize command is provided. Several useful command line options were added, including the ability to specify specific headers for processing. Some minor bug fixes were included.
Release Notes: This release changes from using PBL's ISAM file format to using just its key-file format. The change reduces the size of PBL databases by more than half. This release also includes some changes to the way that top terms are selected when scoring emails. The change reduces the probability of getting false positives. The output of the -T command line option has changed to include the overall database good and spam counts for each term. It is important to note that the PBL file format has changed; see the README.txt file for details.
Release Notes: Not in time for Christmas but before the New Year, SpamProbe 0.8 is finally out the door. This release includes improved recall (correctly scoring a manually-classified email), improved phrase handling (including a -p option to set phrase lengths manually), and improved accuracy through special treatment of terms taken from the To, Cc, and Subject headers.
Release Notes: Several improvements were made, including faster HTML processing, more flexible URL detection, output of message digests with scores to allow later recognition of email even if it has been altered, a new summarize command, conversion from "mark and sweep" cleanup to date stamp based cleanup, and improved performance when processing many small mboxes.
Release Notes: This release fixes some bugs related to reading mbox files. New features include the use of MD5 digests to identify messages instead of relying on message-ids, a -T command line option to print out list of terms used to score each email, and break down of domain names into smaller parts for better precision.