Projects / Java Mozilla Html Parser

Java Mozilla Html Parser

Mozilla Java Html Parser is a Java package that enables you to parse HTML pages into a Java Document object. The parser is a wrapper around Mozilla's HTML parser, thus giving the user a browser-quality HTML parser. This parser was developed as a part of Dapper.

Tags
Licenses
Operating Systems
Implementation

RSS Recent releases

  •  21 Jan 2008 10:31

Release Notes: This release has a major performance boost and a major encoding-related bugfix.

  •  30 Jul 2007 07:13

Release Notes: Missing DLL files were added to the package. Parsing of the title tag and entities was improved.

  •  19 Feb 2007 08:21

Release Notes: The parser is now fully parallelized and fully scalable. Performance improvements were made, and this version is 30% faster than the previous version.

  •  07 Feb 2007 11:17

Release Notes: Many bugs related to HTML parsing were fixed. The size of mozilla-components-base has decreased, and it is now attached to the source files.

No changes have been submitted for this release.

Screenshot

Project Spotlight

SCaVis

A scientific computation and visualization environment.

Screenshot

Project Spotlight

Buildes

A designer’s program for describing parts of the building.