RSS 8 projects tagged "Indexing"

Download Website Updated 22 Dec 2001 XMLDB

Screenshot
Pop 54.41
Vit 1.78

XMLDB uses an RDBMS to persist arbitrary XML documents. Due to its storage mechanism, searching for and recalling documents is extremely quick. You can also perform XSL translation on documents with surprising speed. The library can be used in any program to store libxml2 documents. A PHP module is also included, making XMLDB into a complete three-tier Web application development suite.

Download Website Updated 04 Jul 2011 Emdros

Screenshot
Pop 158.09
Vit 16.82

Emdros is a corpus query system for storing and searching linguistically annotated text. It is very generic, supporting almost any kind of annotation from almost any linguistic theory. All linguistic levels of analysis are supported, including phonology, morphology, the lexical level, syntax, and discourse. The core libraries act as a middleware layer between a client and an underlying SQL database. MySQL, PostgreSQL, and SQLite are supported.

No download Website Updated 19 May 2002 SODA Native XML Database System

Screenshot
Pop 31.46
Vit 1.42

The SODA Native XML Database System is a native XML database that provides efficient management of large amounts of XML data. It is based on a multi-user, client-server architecture with a generic query processing layer that can easily support different query languages. In this lightweight version, user- defined indexes and query optimizations have been removed, however full transaction support (commits and rollbacks) and crash recovery are available.

Download Website Updated 19 Mar 2009 Multivalent PDF Tools

Screenshot
Pop 172.97
Vit 2.41

The Multivalent PDF Tools is a suite of tools for manipulating PDF documents. It includes tools for compressing, uncompressing (for hand editing), obtaining metadata, splitting and merging, encrypting and decrypting, validating, imposition (aka n-up), making page images, extracting text, and full-text indexing (with Lucene). The compress tool shrinks the PDF 1.5 Reference from 13.5MB to 8MB in PDF 1.5/Acrobat 6 format and down to 5.1MB in a new proposed "Compact" format.

Download Website Updated 15 May 2004 StringSearch

Screenshot
Pop 62.40
Vit 1.75

The StringSearch library provides implementations of algorithms of the Boyer-Moore family and the Shift-Or (bit-parallel) family, for use in Java programs that need fast string searching algorithms.

Download Website Updated 12 Oct 2006 PDFBox

Screenshot
Pop 116.29
Vit 2.77

PDFBox is a Java library for manipulating PDF documents and extracting contents from existing PDF documents.

No download Website Updated 29 Nov 2005 isobel

Screenshot
Pop 25.22
Vit 1.00

Isobel is a framework to build complex information retrieval and analysis systems. Isobel can be functionally divided in two subsytems, Isobel Gatherer (the crawling and filtering subsystem) and Isobel Analyzer (the analysis subsystem). The two subsytems can also be used separately. Isobel Gatherer offers ready-to-use services like content fetching, scheduling, document format conversion, Hyperlink graph storage and analysis, content storage and indexing. A programmer may easily add new services. Isobel Analyzer uses the IBM UIMA architecture to reuse the analysis components developed for this architecture.

No download Website Updated 20 Mar 2007 jSlovo

Screenshot
Pop 46.04
Vit 1.00

jSlovo is a fast database engine with a GUI that was designed for free dictionaries. It can create a file-based database from a text file and then be used to search it for particular words. It can scan any large number of file-based databases and the size of the databases is not limited. HTML tags can be used in the text files and for cross-references.

Screenshot

Project Spotlight

Whole Platform

A technology for engineering the production of software.

Screenshot

Project Spotlight

CyaSSL

A lightweight SSL/TLS library supporting up to TLS 1.2 and DTLS 1.2.