PDFTextStream is a PDF text and metadata extraction library available for Java and .NET. It supports all versions of the PDF document specification (including v1.7, used by Acrobat 8, 9, and X), extraction of text encoded using double-byte character sets (including Chinese, Japanese, and Korean), decryption of documents encrypted using 40-bit, 128-bit, 256-bit, and variable bit length ciphers, and extraction of all document metadata provided by PDF documents (including form data, bookmarks, and annotations). Easy integration with Jakarta Lucene is included, as well as interactive form update capability.
PHP Content Management System (phpCMS) makes it possible to need only one template for your whole Web site. It allows you to provide dynamic menus with unlimited levels, and use templates and sub-templates without a database. It is search engine-friendly and proxy-friendly, as the pages it generates can not be distinguished from static HTML pages. PHP code can be added to any template and content file with an optional module. It supports the caching of parsed pages and gzip compression.
PowerSeek SQL allows you to create, manage, and run your own search engine and directory portal with total control and ease. it is user friendly in every aspect and built for the most demanding uses and customization needs. It comes with an extensive admin panel, the ability to sell link listings, SEO friendly URLs, link reviews/ratings, content sensitive banner rotator, spam filter, broken link checker, custom data fields, mailers, crawlers, pre-designed template sets, reciprocal link checker, image/video/file uploading, RRS feeds, optional PPC functionality, and much more. It can be used for Yellow Pages, real estate, and travel directories, complex product catalogs, image galleries, and more.
Turbo Seek provides the capability to create and run a directory and search engine with ease. It comes with a visually friendly admin control panel that provides all of the aspects to create, customize, and run a fully functional search engine. It supports unlimited sub-categories, and includes a crawler, link checker, site rankings, site reviews, the ability to customize all text/layouts, relevance to search keywords, and more.
Twibright Twig is a static HTML photo gallery software that supports organization of JPEG and PNG images into a directory structure and EXIF and JPEG comments. It is meant for more experienced users rather than newbies. Three levels of downscaled image and three levels of thumbnails are generated. Each image is assigned a unique identifier to faciliate easy random linking from a master Web site. It handles reasonable large galleries (and is currently used for a 3GB one). Automatic regeneration of added, changed, and deleted images can be done with one script.
WebGlimpse is a scalable, feature-rich search engine for indexing your Web site or any collection of local and remote sites you choose. Features include customizable output formats, custom ranking/ordering of hits, fuzzy matching, boolean queries, a Web administration interface for multiple archives, logging of queries, caching of results, and more. Localized search interfaces are provided in multiple languages including Spanish, German, French, Italian, Norwegian, Finnish, Russian, Hebrew, and others. It supports 3rd party filters for indexing PDF, Word, and Excel files. It is free for academic and most nonprofit users.
SWISH stands for Simple Web Indexing System for Humans. With it you can index directories of files and search the generated indexes. Based on EIT's swish, this is a cleaner-compiling, enhanced version with better docs, better installation and configuration tools, more sample configurations, and a Web page to help manage index rebuilding. SWISH is used with w4ais as a Web-based front end.
w4ais is a gateway between a forms-capable Web browser and an indexing/search program, based on the old EIT wwwwais program. This new version is tested completely with swish 1.2 and above, and should still work with freeWAIS. This version allows easy and thorough customization of the search page look and feel, and includes help and various utilities to install and configure multiple interfaces for various users or sites on a single system.