RSS 22 projects tagged "Indexing/Search"

Download Website Updated 21 Dec 2000 bew

Screenshot
Pop 56.42
Vit 67.54

bew is a Web mirroring tool that will recursively get a whole set of Web pages using the HEAD mechanism so that it only downloads pages that have changed. It mostly works, but there are still a lot of features to be added. Support for checking external links is also implemented.

Download Website Updated 14 Jun 2004 Net::Z3950::SimpleServer

Screenshot
Pop 62.11
Vit 2.66

Net::Z3950::SimpleServer is a Perl module which implements the server side of the Z39.50 (information retrieval) protocol. It hides the complexity of network exchanges, packet serialization, and session handling. You are required only to implement simple callbacks to support searching and record retrieval. It is the basis of the "Zoogle" project, which is a Z39.50 gateway to the Google web index.

Download Website Updated 05 Mar 2001 PicBook

Screenshot
Pop 44.45
Vit 1.49

PicBook automatically produces a photo album in HTML format of your scanned images or photographs. It come with automatic image processing, slideshow, transition effects, and other nifty features. It is easy to customise with its configuration file and HTML-templates. PicBook is a Bourne shell script and therefore should run on any Unix or Linux system. It requires the standard grep and awk commands, which should be available on most systems. Additionally, it needs the "convert" and "identify" commands from the ImageMagick package to handle images.

Download Website Updated 12 Feb 2004 Site Index

Screenshot
Pop 31.61
Vit 1.16

Site Index is a simple script which generates HTML pages showing a site index for all of your local domains. It uses the natural hierarchy of your filesystem, so if you've organized your pages well, the job is fully automated. It breaks the index into multiple pages to respect a links-per-page limit, and can do some sorting/management of the results according to domain/page "importance".

Download Website Updated 04 Aug 2002 Java WebSuck

Screenshot
Pop 48.18
Vit 3.36

WebSuck goes through a Web page, following links and making a list of the datafiles encountered along the way. It is useful for such tasks as downloading large image galleries without clicking all the links yourself. It can output a file list in a format appropriate for wget, and another for GetRight. It can be used either via a Swing GUI or in console mode.

No download Website Updated 30 Sep 2001 @1 Links Submission and Approval System

Screenshot
Pop 14.87
Vit 65.42

@1 Links Submission and Approval System I lets visitors submit free links to your site. Their links will only be added to the main datafile once you have viewed and approved them. A password- protected admin interface for searching, adding, marking, approving, holding, editing, and deleting is included. The user may also upload an optional site logo.

No download Website Updated 04 Feb 2002 Java Search Engine

Screenshot
Pop 86.61
Vit 64.44

Java Search Engine is a server-side search engine program for Web sites written completely in Java. It features HTML and PDF indexing, a built-in Web crawler, international encodings support, words and phrases search, and returning results as quotations with highlighted words (like Google). It is available as EJB, JSP, servlet, or Java API library. For non-Java enviroments, it is available as an XML server with XSLT support.

Download Website Updated 22 Nov 2006 Metis

Screenshot
Pop 34.73
Vit 1.84

This is a tool to collect information from web servers and to spider the web sites. This was written for the Open Source Security Testing Methodology (OSSTM) located on http://www.ideahamster.org/osstmm- description.htm. The spider is a multi-threaded resusable module that can be used in other projects.

Download Website Updated 02 Aug 2004 DocTaur

Screenshot
Pop 124.27
Vit 3.32

DocTaur is a Web-based searchable directory of reference manuals. You can freely download, install, and administrate it on your local Linux intranet server. It is powered by the ht://Dig search engine and contains reference manuals for developers.

Download Website Updated 18 Sep 2004 Dirlister

Screenshot
Pop 50.10
Vit 1.52

Dirlister provides an alternative to the standard Apache directory listings and directory indexer . It creates Web pages that are displayed when looking at folders without index files. There is icon theme support and complete style setup options to make it more personalized.

Screenshot

Project Spotlight

synctool

A cluster administration tool.

Screenshot

Project Spotlight

Snow

A program to conceal messages in ASCII text by appending whitespace to the end of lines.