Release Notes: This release is focused on the classification engine. A new, slightly faster default tokenizer and a reference measure for spam filtering were added (the old ones are still available). Support for token classes was implemented. Some miscellaneous bugs were also fixed.
Release Notes: This is a hodge-podge of fixes and improvements. A new hypex command, the TREC 2005 options files, and an essay on chess are now in the tarball. Several improvements to the parsing engine were made, including a new -e char option and bugfixes. Compilation problems on various architectures were fixed, and libslang2 support was added.
Release Notes: This release fixed some bugs, cleaned up the behaviour of the -w switch, changeed the "complexity" accounting algorithm, and improved the organization of the man page and tutorials.
Release Notes: This release includes various bugfixes and small usability improvements in the documentation and default switch handling. The major addition is support for the TREC spamjig and improved memory mapping for faster online learning.
Release Notes: This release added a new MAP confidence score (-U, to complement the -X switch), some new scoring types in mailinspect, and a new parsing switch for trace headers in email (-T email:theaders). Category learning now accepts directory names as well as file names, and preliminary work on a new header mining tool (hmine) was performed. Category files are now written in 'portable' format by default.
Release Notes: Many bugs were discovered and fixed. A test suite was added to prevent future regressions. It can be called using make check. Memory management was improved, giving a large speedup in classification speed, and a putative confidence score is now available via an -X switch. Some documentation changes were made.