Jericho HTML Parser is a Java library allowing analysis and manipulation of parts of an HTML document, including server-side tags, while reproducing verbatim any unrecognized or invalid HTML. It also provides high-level HTML form manipulation functions.
| Tags | Text Processing Markup HTML/XHTML Software Development Libraries Java Libraries Internet Web Dynamic Content |
|---|---|
| Licenses | LGPL |
| Operating Systems | OS Independent |
| Implementation | Java |
Recent releases


Release Notes: This release includes important bugfixes and various enhancements.


Release Notes: This version includes important bugfixes and various enhancements including HTML5 support.


Release Notes: Important bugfixes and a new stream-based parsing option allowing memory efficient processing of large files.


Release Notes: This version is a major new release that requires the Java 5 runtime or later. It introduces major API changes such as generics and enums, as well as some new features.


Release Notes: This version includes important bugfixes and the following enhancements. Non-server tags are no longer recognized inside server tags. Microsoft downlevel-revealed conditional comments are recognized. All unnecessary white space may be removed from a source document. Various other enhancements were made to existing features.
Java-based nuclear physics data acquisition.