Yioop! is a PHP search engine. Yioop! can be configured as either a general purpose search engine for the whole Web or it can be configured to provide search results for a set of URLs or domains. Yioop can crawl pages or can directly index archives such as ARC and WARC. It supports indexing several file formats such as HTML, Atom, PDF, DOC, PPT, RTF, RSS, XML, SVG, PNG, JPG, BMP, GIF, and sitemaps. The Yioop! crawler can be deployed on one or many machines. It supports having one or more to crawl scheduler processes, as well as multiple fetchers and mirrors. Crawling respects robots.txt including Crawl-delay. Yioop! crawls are stored in a Web archive format that is easy to move around. Crawling can be done on one machine and the results deployed elsewhere. Yioop! supports mixing of crawls. Yioop! comes with a search front end that can be localized as desired using a GUI. This GUI supports RTL languages. Management of crawls can also be done using this GUI. Yioop! can be configured in a straightforward manner to make use of file caching or memcache if available.
Puggle is a desktop search engine that provides full text search over files, folders, music, photos, Web pages, and other data that are stored locally on your computer. Puggle is able to create many different indices, each with a different configuration. For example, you may have a different index for your music collection as well as your documents. Each of them can be used on demand, simply by loading it. Furthermore, Puggle supports indexing of portable devices, like USB flash drives or external hard disks. The index will be stored in the device, using relative paths, allowing you to search over the data very quickly on any computer.