Mini-XML is a small XML parsing library that you can use to read XML and XML-like data files in your application without requiring large non-standard libraries. It only requires an ANSI C compatible compiler (GCC works, as do most vendors' ANSI C compilers) and a "make" program. It supports reading of UTF-8 and UTF-16 and writing of UTF-8 encoded XML strings and files, and provides a hierarchical view of the file via a linked-list tree structure of typed nodes and functions for managing, traversing, indexing, and searching the tree.
Multi-byte Keyword Generator extracts meta keywords from multi-byte text. It is an enhanced version of the "Automatic Keyword Generator" class originally written by Ver Pangonilo. This version provides better word segmentation, the ability to handle multi-byte strings, and support for text in multiple languages.
The Multivalent PDF Tools is a suite of tools for manipulating PDF documents. It includes tools for compressing, uncompressing (for hand editing), obtaining metadata, splitting and merging, encrypting and decrypting, validating, imposition (aka n-up), making page images, extracting text, and full-text indexing (with Lucene). The compress tool shrinks the PDF 1.5 Reference from 13.5MB to 8MB in PDF 1.5/Acrobat 6 format and down to 5.1MB in a new proposed "Compact" format.
The NCBI C++ Toolkit provides portable libraries and applications for assisting genetic science. These include libraries for networking, SQL and BerkeleyDB access, CGI and HTML handling, ASN.1 and XML handling, sequence alignment engines, sequence retrieval engines, BLAST database engines, FLTK and OpenGL graphics toolkits, and basic system utilities.
Namazu is a full-text search system intended for easy use. Not only does it work as a small or medium scale Web search engine, but also as a personal search system for email or other files. Supported document types: HTML, Mail/News, MHonArc, RFC, TeX (with detex), man (with groff), Word (with wvWare), PDF (with pdftotext) and plain text.
Net::Z3950::SimpleServer is a Perl module which implements the server side of the Z39.50 (information retrieval) protocol. It hides the complexity of network exchanges, packet serialization, and session handling. You are required only to implement simple callbacks to support searching and record retrieval. It is the basis of the "Zoogle" project, which is a Z39.50 gateway to the Google web index.
Ninive is an XML socket server. It accepts TCP connections from clients capable of building, sending, and interpreting XML buffers coded according to the internal XML Ninive Protocol (XNP). The main purpose is to extend in the file operations domain software applications that can't directly write or get a file from a location, e.g. for security policies. Through the XML Ninive Protocol, a generic client can read a file, read a directory's contents, or write a file in a specific location; Ninive will perform this request, sending to client its response. It can be useful as a filesystem interface for applications written in Macromedia Flash.
No-install Unix Tools for Windows is a collection of 18 of the most useful Cygwin tools packaged into a single 1.1MB (floppy sized) EXE file that runs without any installation process. Simply running the executable instantly creates a Unix environment for Windows, without making any persistent changes to the underlying system. It includes bash, less, gzip, grep, find, df, du, ls, cat, cp, mv, ps, rmdir, mkdir, rm, pwd, and tar.