RSS 6 projects tagged "Indexing/Search"

Download Website Updated 22 Dec 2001 XMLDB

Screenshot
Pop 53.94
Vit 1.77

XMLDB uses an RDBMS to persist arbitrary XML documents. Due to its storage mechanism, searching for and recalling documents is extremely quick. You can also perform XSL translation on documents with surprising speed. The library can be used in any program to store libxml2 documents. A PHP module is also included, making XMLDB into a complete three-tier Web application development suite.

Download Website Updated 15 May 2004 StringSearch

Screenshot
Pop 60.98
Vit 1.75

The StringSearch library provides implementations of algorithms of the Boyer-Moore family and the Shift-Or (bit-parallel) family, for use in Java programs that need fast string searching algorithms.

Download No website Updated 20 Feb 2004 GoldSeeker

Screenshot
Pop 22.00
Vit 1.00

GoldSeeker is a small formatted data extraction application. It can parse information from a text, HTML, or other file, and export it to a database.

Download Website Updated 12 Oct 2006 PDFBox

Screenshot
Pop 114.38
Vit 2.77

PDFBox is a Java library for manipulating PDF documents and extracting contents from existing PDF documents.

No download Website Updated 29 Nov 2005 isobel

Screenshot
Pop 24.41
Vit 1.00

Isobel is a framework to build complex information retrieval and analysis systems. Isobel can be functionally divided in two subsytems, Isobel Gatherer (the crawling and filtering subsystem) and Isobel Analyzer (the analysis subsystem). The two subsytems can also be used separately. Isobel Gatherer offers ready-to-use services like content fetching, scheduling, document format conversion, Hyperlink graph storage and analysis, content storage and indexing. A programmer may easily add new services. Isobel Analyzer uses the IBM UIMA architecture to reuse the analysis components developed for this architecture.

Download Website Updated 06 Oct 2007 Pagex

Screenshot
Pop 14.35
Vit 1.00

Pagex is designed to be a barebones content management system. It was originally built as a time saving precursor to new web development projects. It is a completely functioning "bolt-on" solution aimed at SEO workers, Web developers, and the like. It works purely from the URL used to request a page, simplifying any number of dynamic Web address issues caused when using mod rewrite and allowing you to set a range of variables depending on the URL.

Screenshot

Project Spotlight

Aspose.Slides for Java

A Java component for manipulating PowerPoint presentations.

Screenshot

Project Spotlight

NetStats Baseball

A simulation of major league baseball.