Alkaline is a full-featured standalone search and index server. The spider is a fully remote indexing daemon which includes support for all standards like robots.txt and "skip" meta tags, and allows multiple distinct configurations and search groups (searching many different sites from your server), including complex regexp indexing paths, authentification, filters for various document formats, XML-based online management and statistics, mrtg-compatible perf numbers, and more.
PHP Content Management System (phpCMS) makes it possible to need only one template for your whole Web site. It allows you to provide dynamic menus with unlimited levels, and use templates and sub-templates without a database. It is search engine-friendly and proxy-friendly, as the pages it generates can not be distinguished from static HTML pages. PHP code can be added to any template and content file with an optional module. It supports the caching of parsed pages and gzip compression.
SWISH stands for Simple Web Indexing System for Humans. With it you can index directories of files and search the generated indexes. Based on EIT's swish, this is a cleaner-compiling, enhanced version with better docs, better installation and configuration tools, more sample configurations, and a Web page to help manage index rebuilding. SWISH is used with w4ais as a Web-based front end.
w4ais is a gateway between a forms-capable Web browser and an indexing/search program, based on the old EIT wwwwais program. This new version is tested completely with swish 1.2 and above, and should still work with freeWAIS. This version allows easy and thorough customization of the search page look and feel, and includes help and various utilities to install and configure multiple interfaces for various users or sites on a single system.
Turbo Seek provides the capability to create and run a directory and search engine with ease. It comes with a visually friendly admin control panel that provides all of the aspects to create, customize, and run a fully functional search engine. It supports unlimited sub-categories, and includes a crawler, link checker, site rankings, site reviews, the ability to customize all text/layouts, relevance to search keywords, and more.
PowerSeek SQL allows you to create, manage, and run your own search engine and directory portal with total control and ease. it is user friendly in every aspect and built for the most demanding uses and customization needs. It comes with an extensive admin panel, the ability to sell link listings, SEO friendly URLs, link reviews/ratings, content sensitive banner rotator, spam filter, broken link checker, custom data fields, mailers, crawlers, pre-designed template sets, reciprocal link checker, image/video/file uploading, RRS feeds, optional PPC functionality, and much more. It can be used for Yellow Pages, real estate, and travel directories, complex product catalogs, image galleries, and more.
PDFTextStream is a PDF text and metadata extraction library available for Java and .NET. It supports all versions of the PDF document specification (including v1.7, used by Acrobat 8, 9, and X), extraction of text encoded using double-byte character sets (including Chinese, Japanese, and Korean), decryption of documents encrypted using 40-bit, 128-bit, 256-bit, and variable bit length ciphers, and extraction of all document metadata provided by PDF documents (including form data, bookmarks, and annotations). Easy integration with Jakarta Lucene is included, as well as interactive form update capability.