RSS 89 projects tagged "Indexing"

Download Website Updated 30 Nov 2003 harvest

Screenshot
Pop 188.75
Vit 8.65

Harvest is a system to collect information and make it searchable using a Web interface. It can collect information using HTTP, FTP, NNTP, and local files. Supported formats include HTML, DVI, PS, fulltext, mail, man pages, news, troff, WordPerfect, C sources, and many more. Adding support for new formats is easy due to Harvest's modular design.

Download Website Updated 30 Jan 2001 Sary

Screenshot
Pop 25.38
Vit 2.66

Sary is a suffix array library and tools. It provides fast full-text search facilities for text files on the order of 10 to 100 MB using a data structure called a suffix array. It can also search specific fields in a text file by assigning index points to those fields.

Download Website Updated 29 Oct 2008 WebGlimpse

Screenshot
Pop 173.05
Vit 11.33

WebGlimpse is a scalable, feature-rich search engine for indexing your Web site or any collection of local and remote sites you choose. Features include customizable output formats, custom ranking/ordering of hits, fuzzy matching, boolean queries, a Web administration interface for multiple archives, logging of queries, caching of results, and more. Localized search interfaces are provided in multiple languages including Spanish, German, French, Italian, Norwegian, Finnish, Russian, Hebrew, and others. It supports 3rd party filters for indexing PDF, Word, and Excel files. It is free for academic and most nonprofit users.

Download Website Updated 17 Jan 2008 wf

Screenshot
Pop 41.35
Vit 2.94

wf scans a text file or standard input and counts the frequency of words through the whole text, sending resulting output to stdout showing each word and corresponding frequency.

Download Website Updated 13 Apr 2009 YASE

Screenshot
Pop 79.94
Vit 3.30

YASE is a text indexing and retrieval system. It allows you to index your document collection very easily. All words are indexed and can be optionally stemmed. The query tool supports searching all/any terms and can rank query results by relevance using the cosine measure.

Download Website Updated 03 Feb 2001 XM Tool

Screenshot
Pop 25.61
Vit 1.00

XM Tool is a series of Perl snippets than can be called separately or combined into more complex Perl scripts. It uses XMLish (plain) text as the representation between stages, and a sample processor to read C/JavaDoc sources and generate HTML or even docbook is provided.

Download Website Updated 15 Jul 2002 Doli

Screenshot
Pop 18.76
Vit 2.10

Doli (Documentation Libre Indexée) is a portable system to index and search documentation. The system consists of an indexer, and a Tcl-based Web server which provides the search interface. It was designed to provide a platform-independent method for searching HTML documentation. A PHP and MySQL interface is also included.

Download Website Updated 22 Dec 2001 XMLDB

Screenshot
Pop 53.67
Vit 1.77

XMLDB uses an RDBMS to persist arbitrary XML documents. Due to its storage mechanism, searching for and recalling documents is extremely quick. You can also perform XSL translation on documents with surprising speed. The library can be used in any program to store libxml2 documents. A PHP module is also included, making XMLDB into a complete three-tier Web application development suite.

Download Website Updated 24 Jun 2001 LinkMaster

Screenshot
Pop 31.08
Vit 68.46

LinkMaster is a method of linking data between different applications on Palm devices. There are not many applications that support this method, but the list is growing. Even without special application support, it tracks recently-used programs and bookmarks for quick access.

Download Website Updated 16 Mar 2005 Radsearch

Screenshot
Pop 26.61
Vit 2.06

Radsearch is a text utility used to retrieve records from a text file or list of text files, given a keyword and delimiter. This utility was written with the specific purpose of allowing quick retrieval of all login and logout records for a particular user in Radiusd log files.

Screenshot

Project Spotlight

Excel Writer

A package to write Excel files with basic formatting easily.

Screenshot

Project Spotlight

PCI Utilities

Utilities for diagnostics and configuration of PCI devices.