urlwatch is a script intended to help you watch URLs and get notified (via email) of any changes. The change notification will include the URL that has changed and a unified diff of what has changed. The script works out of a single directory, so there is no need to install anything. State files are kept in the same folder. The script supports stripping parts of a page that are always changing through the use of a filter hook function. It is typically run as a cronjob.
SWEC is a program that automates testing of dynamic Web sites. It parses each HTML file it finds for links, and if those links are within the site specified, it will check that page as well. In addition to parsing and locating links, it will also parse the pages looking for known errors and report those. It will report if a page cannot be read (by either returning a 404, 500, or similar).
smupcheck, which stands for Smart Update Checker, checks Web sites for updates automatically, even if they don't offer an RSS feed. It is a very basic tool, and does not offer advanced features such as checking password-protected Web sites, highlighting changes, or filtering results.
PottyMouth transforms completely unstructured and untrusted text to valid, nice-looking, completely safe XHTML. PottyMouth is designed to handle input text from non-technical, potentially careless, or malicious users. It produces HTML that is completely safe, programmatically and visually, to include on any Web page. You don't need to make your users read any instructions before they start typing. They don't even need to know that PottyMouth is being used.
safox is a simple PHP API for XML handling. It merges the DOM approach with XML, and it provides a simple, object-oriented API for PHP-based XML generation, parsing, manupilation, and traversal. SAFOX provides a generation package and a package that parses XML documents and returns objects.
EZ Reusable Objects (EZRO) is a Web application that can be used by non-technical staff to manage content as "objects." Content objects containing text, video, and audio can be shared, modified, and re-styled to appear via a traditional Web site, an on-line course, an innovative "Coach," or as a community of interest site. It is highly scalable and can be used for public Web sites, secure environments, and private intra/extranets.
HTML::TagReader is a Perl module which allows you to read HTML/XML files by tag. Its package contains some application programs which are very useful for Webmasters. They all start with the prefix "tr_" to make it easy to remember their names. tr_blck is a utility to check for broken links in Web pages. tr_llnk is a utility to build tar archives. tr_xlnk can process HTML files and expand links on directories. tr_mvlnk modifies links in HTML files with Perl commands. tr_staticssi expands the SSI directives #include virtual and #exec cmd. tr_tagcontentgrep greps for a tag (e.g., "img src"). tr_imgaddsize adds "width" and "height" to "img src=...".