ASPseek is an Internet search engine, written in C++ using the STL library. It consists of an indexing robot, a search daemon, and a search frontend (CGI or Apache module). It can index as many as a few million URLs and search for words and phrases, use wildcards, and do a Boolean search. Search results can be limited to time period given, site, or Web space (set of sites) and sorted by relevance (PageRanks are used) or date. It is optimized for multiple sites (threaded index, async DNS lookups, grouping results by site, and Web spaces), but can be used for searching one site as well. It can work with multiple languages/encodings at once (including multi-byte encodings such as Chinese) due to optional Unicode storage mode. Other features include stopwords and ispell support, a charset and language guesser, HTML templates for search results, excerpts, and query words highlighting.
cthumb allows you to create Web albums of digital pictures with thumbnails, captions, and several different views of the collection, including (optionally) several languages and resolutions. An album is composed of a series of pages, each composed of a collection of pictures. You can have several annotations per picture, and can customize almost everything in the way the albums look on the screen.
DirList is a user directory system that runs as a CGI to serve up user lists, search for various user attributes, view their web sites, define personalised user attributes, and keep it all synchronized automatically with the underlying operating system's user database on periodic intervals with cron.
Enzyme is an architecture for creating user-searching-user applications. Right now, it is primarily useful as a resume create/search and HR management system. However, it's potential uses are wide-ranging, and it offers PHP developers an alternative to relying on simple "keyword" searching systems.
mp3riot (formerly known as f2html.pl) is a command line utility that searches recursively through directories, builds a file list (with additional file information), and generates HTML files, playlists, etc. The output can be controlled, links can be corrected, and more. The script is mainly desigend to create Web pages, playlists, and databases for MP3 and Ogg files, but can also used for other purposes.
FemFind is a crawler and search engine for SMB shares (provided by Samba/Unix or Windows) and FTP servers. The crawler maps the filesystem structure of your shares to a MySQL database. Then, the Web interface or a Windows client can be used to quickly locate any file on the network.
Greenstone is a complete digital library creation, management, and distribution package for Unix, Windows, and Mac OS X. Users create collections by gathering a set of input documents, specifying a configuration file, and running the build script. It provides full-text and fielded searching, browsable indexes, customised formatting, metadata extraction (acronyms, languages, etc), a Z39.50 client, and many other features. It supports many input formats, the interface is configurable and multi-lingual, and collections can be distributed on the Web or on CD-ROM.
HPFind searches your users' directories for homepages and displays them in a table on an html page. All outputted text and background colours, and file locations can be specified by the administrator. It is able to exclude a given list of users. This program can be used as a small search program to search for filenames.