similarity-utils is a set of two programs to give a quantitative measure of how similar two files are, on a scale 0 to 1. similarity_by_diff measures the number of difference lines reported by diff(1), while similarity_by_zlib tries compressing the two files, both separately and together, and comparing the results.
The Multivalent PDF Tools is a suite of tools for manipulating PDF documents. It includes tools for compressing, uncompressing (for hand editing), obtaining metadata, splitting and merging, encrypting and decrypting, validating, imposition (aka n-up), making page images, extracting text, and full-text indexing (with Lucene). The compress tool shrinks the PDF 1.5 Reference from 13.5MB to 8MB in PDF 1.5/Acrobat 6 format and down to 5.1MB in a new proposed "Compact" format.
QDPF was written to solve the problem of exposing internal services to the Internet. It runs on machines which bridge two networks and forwards TCP/IP packets from one to the other. Its main use is to expose corporate or Intranet services to the Internet, by running it on an intervening machine. TCP sessions can also be followed in detail using the 'trace' option. QDPF is a Java console application.
Absolut Engine is a PHP/MySQL framework built on the basis of a former news publishing system. It features 3 layer access, article posting, editing, and deleting. Included are a basic image and file manager as well as a related articles manager. Rich text editing is provided via a built-in WYSIWYG editor that produces XHTML 1.0 Strict compliant code. It is easy to extend via the module functionality. It is fully configurable and easy-to-use. Modules include discussions, surveys, RSS feed export and import, search engine optimization, a messaging system, weather reports, and Google Maps.
SMX is a cross-platform dynamic HTML generation language. It is a high level language that is easier to learn than PHP or Perl, but is just as powerful for Web page creation. It was originally designed for developing financial reporting software, and is still uniquely suited for that sort of customer-driven Web development. It features builtin support for ODBC queries, preformatted HTML, linking to Perl or PHP, and much more. It supports Basic authentication and Digest authentication.
Aephea is a text-based authoring tool for HTML. It enforces well-formedness with a simpler and stricter TeX-like syntax and provides useful extensions and abstractions with facilities for adding new ones. It emphasizes a single unified approach that stays close to HTML itself and promotes and utilizes CSS extensively. Abstractions such as dictionary stacks, arithmetic, and iteration are part of Aephea.
regexp-engine is a regexp engine written entirely in Ruby that makes it possible to search in text with advanced search patterns. It supports Perl 5 and some Perl 6 syntax and is fairly compatible with Ruby's native regexp engine, passing almost 98% of the Rubicon tests. The implementation is simple and contains no optimizations.
Absimpa is a Java class library that provides a recursive decent parser. It is an experiment in abstraction and Java generics. It tries to include only the essentials of what makes out a parser and to leave out, or rather leave free for specification, what is not relevant for the parsing process.