RSS 4 projects tagged "web crawler"

Download Website Updated 02 May 2013 OpenSearchServer

Screenshot
Pop 917.68
Vit 51.20

OpenSearchServer is a stable, high-performance search engine and a suite of high-powered full text search algorithms. Documents can be indexed in sixteen languages. Multi-lingual analyzers slice sentences into words, then run lemmatisation algorithms on words based on the document's language. Numerous document formats are supported, such as XML, HTML/XHTML, PDF, Word, PowerPoint, RTF, OpenOffice, plain text, MP3/4, Ogg, FLAC, etc. The Web interface, built around the Zkoss framework, provides an easy way to manage OSS. The integration is fast using the PHP client or the API (XML over HTTP). The crawlers of OpenSearchServer go through Web sites, file systems, and databases to rapidly and easily build your index.

Download No website Updated 18 Nov 2012 SpiderBot

Screenshot
Pop 21.12
Vit 13.70

SpiderBot crawls the Web, retrieves content, and performs actions on the content. It is an effort to design and develop a truly pipelined distributed Web crawler.

Download Website Updated 13 Jun 2009 Smart Cache Loader

Screenshot
Pop 73.32
Vit 4.65

Smart Cache Loader is a very configurable Web grabber with special Smart Cache support.

No download No website Updated 25 Jun 2009 Methanol

Screenshot
Pop 33.90
Vit 1.02

Methanol is a modular, customizable Web crawling system with crawlers optimized for speed. It is designed to allow the administrator to set up any kind of filetype handling, parsing, and indexing rules.

Screenshot

Project Spotlight

OpenSearchServer

A search engine with a Web, file, and database crawler.

Screenshot

Project Spotlight

Aspose.Email for .NET

A suite of .NET components for email programming.