RSS 72 projects tagged "Indexing"

Download Website Updated 30 Nov 2003 harvest

Screenshot
Pop 186.89
Vit 8.65

Harvest is a system to collect information and make it searchable using a Web interface. It can collect information using HTTP, FTP, NNTP, and local files. Supported formats include HTML, DVI, PS, fulltext, mail, man pages, news, troff, WordPerfect, C sources, and many more. Adding support for new formats is easy due to Harvest's modular design.

Download Website Updated 14 Jun 2004 Net::Z3950::SimpleServer

Screenshot
Pop 50.20
Vit 2.63

Net::Z3950::SimpleServer is a Perl module which implements the server side of the Z39.50 (information retrieval) protocol. It hides the complexity of network exchanges, packet serialization, and session handling. You are required only to implement simple callbacks to support searching and record retrieval. It is the basis of the "Zoogle" project, which is a Z39.50 gateway to the Google web index.

Download Website Updated 30 Jan 2001 Sary

Screenshot
Pop 25.10
Vit 2.66

Sary is a suffix array library and tools. It provides fast full-text search facilities for text files on the order of 10 to 100 MB using a data structure called a suffix array. It can also search specific fields in a text file by assigning index points to those fields.

Download Website Updated 29 Oct 2008 WebGlimpse

Screenshot
Pop 173.20
Vit 11.33

WebGlimpse is a scalable, feature-rich search engine for indexing your Web site or any collection of local and remote sites you choose. Features include customizable output formats, custom ranking/ordering of hits, fuzzy matching, boolean queries, a Web administration interface for multiple archives, logging of queries, caching of results, and more. Localized search interfaces are provided in multiple languages including Spanish, German, French, Italian, Norwegian, Finnish, Russian, Hebrew, and others. It supports 3rd party filters for indexing PDF, Word, and Excel files. It is free for academic and most nonprofit users.

Download Website Updated 03 Feb 2001 XM Tool

Screenshot
Pop 25.57
Vit 1.00

XM Tool is a series of Perl snippets than can be called separately or combined into more complex Perl scripts. It uses XMLish (plain) text as the representation between stages, and a sample processor to read C/JavaDoc sources and generate HTML or even docbook is provided.

Download Website Updated 22 Dec 2001 XMLDB

Screenshot
Pop 53.94
Vit 1.77

XMLDB uses an RDBMS to persist arbitrary XML documents. Due to its storage mechanism, searching for and recalling documents is extremely quick. You can also perform XSL translation on documents with surprising speed. The library can be used in any program to store libxml2 documents. A PHP module is also included, making XMLDB into a complete three-tier Web application development suite.

Download Website Updated 16 Mar 2005 Radsearch

Screenshot
Pop 26.68
Vit 2.06

Radsearch is a text utility used to retrieve records from a text file or list of text files, given a keyword and delimiter. This utility was written with the specific purpose of allowing quick retrieval of all login and logout records for a particular user in Radiusd log files.

No download Website Updated 28 Nov 2005 X-Hive/DB

Screenshot
Pop 82.36
Vit 3.71

X-Hive/DB is a powerful native XML database designed for software developers who require advanced XML data processing and storage functionality within their applications. The comprehensive X-Hive/DB Java API contains methods for storing, querying, retrieving, transforming, and publishing XML data. X-Hive/DB supports all major W3C standards, such as XQuery, XPath, DOM, XPointer, XML Schemas, and more.

Download Website Updated 24 Oct 2001 gelapas

Screenshot
Pop 19.52
Vit 1.00

gelapas crawls the file tree and extracts information from files. The default settings (and the shorthand options) are useful to extract information such as the title or meta tags from HTML files, but it could also be used for other kind of documents.

No download Website Updated 04 Feb 2002 Java Search Engine

Screenshot
Pop 76.93
Vit 66.80

Java Search Engine is a server-side search engine program for Web sites written completely in Java. It features HTML and PDF indexing, a built-in Web crawler, international encodings support, words and phrases search, and returning results as quotations with highlighted words (like Google). It is available as EJB, JSP, servlet, or Java API library. For non-Java enviroments, it is available as an XML server with XSLT support.

Screenshot

Project Spotlight

JustSort

A simple application for sorting data in your browser.

Screenshot

Project Spotlight

Goggles Music Manager

A music collection manager and player.