Haystack is a powerful tool designed to enable each and every individual manage all of her information in the way that makes the most sense. By removing the arbitrary barriers created by applications that only handle certain information "types", and recording only a fixed set of relationships defined by the developer, users can define whichever arrangements of, connections between, and views of information they find most effective. Such personalization of information management will dramatically improve your ability to find what you need when you need it.
Doodle is a desktop search engine for Linux. It searches your hard drive for files using pattern matching on meta-data. It extracts file-format specific meta-data using libextractor and builds a suffix tree to index the files. The index can then be searched rapidly. It is similar to locate, but can take advantage of information such as ID3 tags. It is possible to do full-text indexing using the appropriate libextractor plugins. It also supports using FAM to keep the database up-to-date.
K.E.T.T.L.E (Kettle ETTL Environment) is a meta- data driven ETL (Extraction, Transformation, Transportation, and Loading) tool. This means that no code has to be written to perform complex data transformations. It is possible to create plugins to do custom transformations or access proprietary data sources. Kettle supports most databases on the market, and has native support for slowly changing dimensions on most database platforms.
Autodia is a command line Perl application that generates UML class diagrams from source code, SQL, and database connections. It supports multiple programming languages including SQL, and can output images (using GraphViz/VCG/SpringGraph), Dia XML and Umbrello XML, or custom formats using templates.
Kraken is an application for managing knowledge objects, which can be documents, remote or locally cached Web pages, personal information, todo list items, appointments, and so on. It is especially useful for researchers or students to manage their information. Users can annotate these knowledge objects with metadata, perform complex queries, and present the results as HTML pages. Kraken uses RDF as its native format, allowing its data to be easily read by external applications.
XAware provides a solution for building real time data integrations and data services. It uses an Eclipse-based designer and a run-time engine implemented using the Spring Framework. XAware has extensive built-in support for database transactions, messaging systems, structured and unstructured text, XML schemas, and more.
Red-Piranha is a search system that can actually learn what you are looking for. It can be used as a Web page, command line, or XML-WebService, so it will work with most languages, including Java, Perl, C#/.NET, and PHP. It includes learning abilities for the Desktop/Internet search functionality. All feedback from the user is stored in (editable) XML and RDF, and is used by the system to improve the quality of searches.
The OMCSNet-WordNet project aims to improve the quality of the OMCSNet dataset by using automated processes to map WordNet synonym sets to OMCSNet concepts and import additional semantic linkage data from WordNet. It is based on OMCSNet 1.2, a semantic network and inference toolkit written in Python/Java. OMCSNet currently contains over 280,000 separate pieces of common sense information extracted from the raw OMCS dataset. This project is also based on WordNet, an online lexical reference system that in recent years has become a popular tool for AI researchers.
pHoToMoLo is a toolset for managing JPEG images in JFIF or Exif format. Photomolo generates a static HTML 4.01- and CSS 1-based multi-resolution image gallery for a directory tree containing images. Exifiron losslessly compresses and corrects the orientation of Exif JPEG images. Jpegnail removes or embeds thumbnails. Jpegcom lets you edit embedded image comments using any text editor and image viewer. A shell script frontend for Exifiron is included for copying images from removable storage, such as USB or Compact Flash.
The Video Performance Evaluation Resource (ViPER) is a toolkit of Java programs and scripts that enables evaluation of video processing algorithms. This includes a markup tool for video, a performance evaluation tool that compares how well result data matches ground truth video metadata, and scripts and systems for running sets of experiments.