Projects / Sirobot

Sirobot

Sirobot is a Perl script that downloads Web pages recursively. The main advantage over wget is its ability to get them concurrently, and is able to continue aborted downloads and convert absolute links to relative ones. It uses curses, can do HTTPS, and has a pattern-matching filter to prevent you from downloading the whole Internet.

Tags
Licenses
Implementation

Recent releases

  •  09 Apr 2003 01:19

    Release Notes: This release copes with long filenames (> 255 chars), and doesn't ignore cookies set during a 301/302 response.

    •  21 Aug 2001 18:03

      Release Notes: The options --flush and --dump-todolist have been added. They allow you to gracefully shut down pending downloads and continue later with those not processed.

      •  20 Mar 2001 17:14

        Release Notes: Cookie support and an option to dump retrieved links for external processing.

        •  09 Sep 2000 10:55

          Release Notes: This release now recognizes inline images, and includes a new option --exec <prg> to execute external programs after each successful download.

          •  23 May 2000 22:25

            Release Notes: HTTPS support.

            Screenshot

            Project Spotlight

            OpenStack4j

            A Fluent OpenStack client API for Java.

            Screenshot

            Project Spotlight

            TurnKey TWiki Appliance

            A TWiki appliance that is easy to use and lightweight.