RSS 30 projects tagged "Indexing/Search"

Download Website Updated 22 Jul 2002 ASPseek

Screenshot
Pop 175.67
Vit 5.10

ASPseek is an Internet search engine, written in C++ using the STL library. It consists of an indexing robot, a search daemon, and a search frontend (CGI or Apache module). It can index as many as a few million URLs and search for words and phrases, use wildcards, and do a Boolean search. Search results can be limited to time period given, site, or Web space (set of sites) and sorted by relevance (PageRanks are used) or date. It is optimized for multiple sites (threaded index, async DNS lookups, grouping results by site, and Web spaces), but can be used for searching one site as well. It can work with multiple languages/encodings at once (including multi-byte encodings such as Chinese) due to optional Unicode storage mode. Other features include stopwords and ispell support, a charset and language guesser, HTML templates for search results, excerpts, and query words highlighting.

Download Website Updated 09 Aug 2002 Alkaline UNIX/NT Search Engine

Screenshot
Pop 124.38
Vit 3.74

Alkaline is a full-featured standalone search and index server. The spider is a fully remote indexing daemon which includes support for all standards like robots.txt and "skip" meta tags, and allows multiple distinct configurations and search groups (searching many different sites from your server), including complex regexp indexing paths, authentification, filters for various document formats, XML-based online management and statistics, mrtg-compatible perf numbers, and more.

No download Website Updated 12 Sep 2004 C-Cramp

Screenshot
Pop 25.38
Vit 59.18

C-Cramp (the C-Cramp College Radio Audio Management Program) is a Web-based frontend to MySQL for managing the types of things that small radio stations might need: audio files, data, and "metadata"; DJ and staff information, schedules, live music and program logs, and all sorts of other data. Currently, a cross-platform PHP application is the focus of the project, but more features and types of programs are planned that will hopefully enable easier playback, storage, loading, and entering for all types of applicable data.

Download Website Updated 16 Apr 2009 CLucene

Screenshot
Pop 75.78
Vit 2.54

CLucene is a C++ port of Lucene, a high-performance, full-featured text search engine. It is, however, faster than Lucene as it is written in C++.

Download Website Updated 01 Jul 2003 CMS loa2003

Screenshot
Pop 18.97
Vit 1.00

CMS loa2003 is an attempt to produce a cross-platform CMS using C# and SSCLI on Windows .NET and Mono. It provides an object database, security features, and much more.

Download Website Updated 28 Mar 2004 DirList

Screenshot
Pop 39.55
Vit 2.01

DirList is a user directory system that runs as a CGI to serve up user lists, search for various user attributes, view their web sites, define personalised user attributes, and keep it all synchronized automatically with the underlying operating system's user database on periodic intervals with cron.

Download Website Updated 26 Aug 2008 Douglas Thrift's Search Engine

Screenshot
Pop 42.83
Vit 2.44

Douglas Thrift's Search Engine is an indexing search engine for use on small Web sites such as personal or small business sites. It is designed to be very similar to Google for end users and its output is customizable. For indexing, it supports both the Robots Exclusion Protocol and the Robots META Tag.

Download Website Updated 14 Sep 2004 FreyaSX

Screenshot
Pop 23.41
Vit 1.00

FreyaSX is a simple and portable full-text search engine which runs stand-alone on diverse operating systems. It runs without background database or external command, and its data files are portable across platforms. It provides a search interface via HTTP by the bundled HTTP server (DeleGate).

Download Website Updated 02 Aug 2007 Greenstone

Screenshot
Pop 68.61
Vit 4.51

Greenstone is a complete digital library creation, management, and distribution package for Unix, Windows, and Mac OS X. Users create collections by gathering a set of input documents, specifying a configuration file, and running the build script. It provides full-text and fielded searching, browsable indexes, customised formatting, metadata extraction (acronyms, languages, etc), a Z39.50 client, and many other features. It supports many input formats, the interface is configurable and multi-lingual, and collections can be distributed on the Web or on CD-ROM.

Download Website Updated 21 Jan 2008 Java Mozilla Html Parser

Screenshot
Pop 43.13
Vit 2.41

Mozilla Java Html Parser is a Java package that enables you to parse HTML pages into a Java Document object. The parser is a wrapper around Mozilla's HTML parser, thus giving the user a browser-quality HTML parser. This parser was developed as a part of Dapper.

Screenshot

Project Spotlight

Sudokuki

A Sudoku game.

Screenshot

Project Spotlight

Novius OS

A CMS that takes up the challenge of managing Web content in today’s multi-channel environment.