Projects / Java Mozilla Html Parser

Java Mozilla Html Parser

Mozilla Java Html Parser is a Java package that enables you to parse HTML pages into a Java Document object. The parser is a wrapper around Mozilla's HTML parser, thus giving the user a browser-quality HTML parser. This parser was developed as a part of Dapper.

Tags
Licenses
Operating Systems
Implementation

Recent releases

  •  21 Jan 2008 18:31

    Release Notes: This release has a major performance boost and a major encoding-related bugfix.

    •  30 Jul 2007 14:13

      Release Notes: Missing DLL files were added to the package. Parsing of the title tag and entities was improved.

      •  19 Feb 2007 16:21

        Release Notes: The parser is now fully parallelized and fully scalable. Performance improvements were made, and this version is 30% faster than the previous version.

        •  07 Feb 2007 19:17

          Release Notes: Many bugs related to HTML parsing were fixed. The size of mozilla-components-base has decreased, and it is now attached to the source files.

          •  29 Jan 2007 01:31

            No changes have been submitted for this release.

            Screenshot

            Project Spotlight

            OpenStack4j

            A Fluent OpenStack client API for Java.

            Screenshot

            Project Spotlight

            TurnKey TWiki Appliance

            A TWiki appliance that is easy to use and lightweight.