dbacl is a digramic Bayesian text classifier. Given some text, it calculates the posterior probabilities that the input resembles one of any number of previously learned document collections. It can be used to sort incoming email into arbitrary categories such as spam, work, and play, or simply to distinguish an English text from a French text. It fully supports international character sets, and uses sophisticated statistical models based on the Maximum Entropy Principle.
mod_musicindex is an Apache module aimed at being a C alternative to the Perl module Apache::MP3. It allows nice displaying of directories containing MP3, Ogg Vorbis, FLAC, or MP4/AAC files, including sorting them on various fields, streaming/downloading them, constructing playlists, and searching. It also provides features such as RSS and Podcast feeds, multiple CSS support, and archive downloads.
Doodle is a desktop search engine for Linux. It searches your hard drive for files using pattern matching on meta-data. It extracts file-format specific meta-data using libextractor and builds a suffix tree to index the files. The index can then be searched rapidly. It is similar to locate, but can take advantage of information such as ID3 tags. It is possible to do full-text indexing using the appropriate libextractor plugins. It also supports using FAM to keep the database up-to-date.
pHoToMoLo is a toolset for managing JPEG images in JFIF or Exif format. Photomolo generates a static HTML 4.01- and CSS 1-based multi-resolution image gallery for a directory tree containing images. Exifiron losslessly compresses and corrects the orientation of Exif JPEG images. Jpegnail removes or embeds thumbnails. Jpegcom lets you edit embedded image comments using any text editor and image viewer. A shell script frontend for Exifiron is included for copying images from removable storage, such as USB or Compact Flash.
ConceptBase is a multi-user meta-database system based on a deductive object manager. It is mainly intended for conceptual modeling and coordination in design environments. The system implements O-Telos, a dialect of Telos that amalgamates properties of deductive and object-oriented languages.
libiptcdata is a C library for manipulating the International Press Telecommunications Council (IPTC) metadata stored within multimedia files such as images. This metadata can include captions and keywords, often used by popular photo management applications. The library provides routines for parsing, viewing, modifying, and saving this metadata. The libiptcdata package also includes Python bindings and a command-line utility, iptc, for viewing and editing IPTC data in JPEG files.
Generating new objects for the Query Object Framework is repetitive, tedious, and time consuming. Qof Generator automates this process in PHP to build a working test program linked against QOF. Objects are created from an HTML form using a temporary MySQL cache and exported with Makefile, ./autogen.sh, ChangeLog, README, C source code, and doxygen mark-up comments in a tarball built by the PHP code.
Morla is an editor of RDF documents that allows you to manage more RDF documents simultaneously, visualize graphs, and use templates for quick writing. You can import RDFS documents and use their content to write new RDF triples. Templates are also RDF documents, and they make Morla easily personalizable and expandable. You can also use Morla as an RDF navigator, browsing the RDF documents present on the Internet exactly as you are used to doing with normal browsers.
Evergreen is an integrated library system originally developed by the Georgia PINES consortium for use as their automation system, and now includes contributions from around the world. It was designed from scratch for large-scale deployment in very large public library and state-wide consortium environments with tens of millions of records and hundreds of libraries, but can also scale down to the smallest of single-branch libraries.