HotSAX is a fast, small footprint, non-validating SAX2 parser for HTML/XML/XHTML. It can be used in simple web agents, page scrapers, and spiders. It is similar to the Apache Xerces parser, except that it can generate SAX events for badly formatted HTML as well.
|Tags||Internet Web Indexing/Search Site Management Link Checking Software Development Libraries Java Libraries Version Control CVS Text Processing Markup HTML/XHTML XML|
|Operating Systems||OS Independent|
Release Notes: Added org.xml.sax classes to the build which removes the dependency on Xerces or some other SAX parser. This means that HotSAX can run by itself. A classic compiler property was removed from build.xml, which fixes a bug on some JDK1.3+ platforms. HotSAX-buildtools was updated with all of the files required to build and test HotSAX.
Release Notes: This version adds contrib package for user contributions.