Websitary is a script that monitors Web pages, RSS feeds, and podcasts and reports what's new. For many tasks, it reuses other programs (such as w3m, diff, and webdiff) to do the actual work. By default, it works on an ASCII basis, i.e. with the output of text-based Web browsers. With the help of some friends, it can also work with HTML.
|Tags||Internet Web Site Management Link Checking Networking Monitoring|
|Operating Systems||OS Independent|
Release Notes: For RSS attachments, source title is preferred to the channel's title. In body_html, if there is no body tag, the document is used as is. For RSS, items without descriptions are also scanned with :rss_find_enclosure. Various bugfixes and minor improvements that accumulated over the years were applied.
Release Notes: Improved handling of certain HREFs, limited support for use of a proxy (depends on the downloader), and more versatile RSS enclosure extraction.
Release Notes: This releasea has a :timeout option. It can exclude URLs. htmldiff now has an :ignore option to exclude certain nodes from the diff. This release can convert encodings via iconv, and can exclude mailto: URLs.
Release Notes: A ruby-based tool was included to highlight news on HTML pages. The usual set of bugs were fixed.
Release Notes: websitary now also deals with FTP directories and RSS feeds (and it optionally saves attachments such as podcasts). Local copies are stored in a directory tree in order to rebuild the file structure on the server unless the filenames get too long, in which case MD5 encoded filenames are used. The executable name has changed (the extra "i" was dropped). Command-line options have changed. Many other changes were made.