RSS 2 projects tagged "web crawler"

No download No website Updated 25 Jun 2009 Methanol

Screenshot
Pop 34.42
Vit 1.02

Methanol is a modular, customizable Web crawling system with crawlers optimized for speed. It is designed to allow the administrator to set up any kind of filetype handling, parsing, and indexing rules.

Download No website Updated 18 Nov 2012 SpiderBot

Screenshot
Pop 21.19
Vit 13.63

SpiderBot crawls the Web, retrieves content, and performs actions on the content. It is an effort to design and develop a truly pipelined distributed Web crawler.

Screenshot

Project Spotlight

OpenSearchServer

A search engine with a Web, file, and database crawler.

Screenshot

Project Spotlight

HPCC Systems

A massive parallel-processing computing platform that solves big data problems.