Projects / Jericho HTML Parser

Jericho HTML Parser

Jericho HTML Parser is a Java library allowing analysis and manipulation of parts of an HTML document, including server-side tags, while reproducing verbatim any unrecognized or invalid HTML. It also provides high-level HTML form manipulation functions.

Tags
Licenses
Operating Systems
Implementation

Recent releases

  •  31 Oct 2012 01:29

    Release Notes: This release includes important bugfixes and various enhancements.

    •  05 Mar 2011 06:42

      Release Notes: This version includes important bugfixes and various enhancements including HTML5 support.

      •  11 Jun 2009 01:22

        Release Notes: Important bugfixes and a new stream-based parsing option allowing memory efficient processing of large files.

        •  09 Apr 2009 23:49

          Release Notes: This version is a major new release that requires the Java 5 runtime or later. It introduces major API changes such as generics and enums, as well as some new features.

          •  25 Jun 2008 06:56

            Release Notes: This version includes important bugfixes and the following enhancements. Non-server tags are no longer recognized inside server tags. Microsoft downlevel-revealed conditional comments are recognized. All unnecessary white space may be removed from a source document. Various other enhancements were made to existing features.

            Recent comments

            09 Mar 2010 18:37 marcu

            it just works.

            Screenshot

            Project Spotlight

            OpenStack4j

            A Fluent OpenStack client API for Java.

            Screenshot

            Project Spotlight

            TurnKey TWiki Appliance

            A TWiki appliance that is easy to use and lightweight.