RSS All releases of Jericho HTML Parser

  •  31 Oct 2012 21:22

    Release Notes: This release includes important bugfixes and various enhancements.

    •  05 Mar 2011 08:18

      Release Notes: This version includes important bugfixes and various enhancements including HTML5 support.

      •  11 Jun 2009 12:09

        Release Notes: Important bugfixes and a new stream-based parsing option allowing memory efficient processing of large files.

        •  10 Apr 2009 10:06

          Release Notes: This version is a major new release that requires the Java 5 runtime or later. It introduces major API changes such as generics and enums, as well as some new features.

          •  25 Jun 2008 06:56

          Release Notes: This version includes important bugfixes and the following enhancements. Non-server tags are no longer recognized inside server tags. Microsoft downlevel-revealed conditional comments are recognized. All unnecessary white space may be removed from a source document. Various other enhancements were made to existing features.

          •  02 Sep 2007 05:41

          Release Notes: This version includes important bugfixes and introduces the following minor enhancements: elements inside SCRIPT elements are ignored. Encoding detection and analysis were improved. Parsing of attributes containing server tags was improved.

          •  20 May 2007 04:30

          Release Notes: This version has been released under a dual licence system, allowing a choice between the Eclipse Public License (EPL) and the LGPL. It includes important bugfixes and introduces the following major features: simple rendering of HTML markup into text, integrated logging with various logging frameworks, and easier parsing of HTML tags containing server tags.

          •  10 Sep 2006 23:45

          Release Notes: Version 2.3 includes important bug fixes as well as some minor improvements to source indenting functionality.

          •  19 Jun 2006 10:58

          Release Notes: This release includes important bugfixes, and introduces the following major features: much more efficient parsing of the entire source document, the ability to indent the source document based on its element hierarchy, and the ability to navigate the document element hierarchy.

          •  25 Dec 2005 05:32

          Release Notes: This version adds two main features. The first is the Segment.extractText() method, which extracts all of the text from a segment of the source document, removing all markup and collapsing whitespace. This is simple text extraction only and makes no attempt to render the markup. The second feature is the fact that new Source constructors automatically determine the encoding of an input stream. The new getEncoding() and getEncodingSpecificationInfo() methods allow the user to see which encoding was used and how it was determined.

          Screenshot

          Project Spotlight

          Capstone

          A lightweight multi-arch, multi-platform disassembly framework.

          Screenshot

          Project Spotlight

          gslapt

          A GTK version of slapt-get.