SpiderBot crawls the Web, retrieves content, and performs actions on the content. It is an effort to design and develop a truly pipelined distributed Web crawler.
Smart Cache Loader is a very configurable Web grabber with special Smart Cache support.
An integrated development environment.
A Java framework for building data integration and ETL applications.