Projects / ShaniXmlParser


ShaniXmlParser is an XML/HTML DOM/SAX parser. It can parse not well formed XML/HTML files. It can parse files with inverted tags and bad escaped &,<,> and ". It expands all XHTML entities by default. It is well suited to parse HTML files, and is fast with low memory usage. It is compliant with the jaxp/w3c DOM1/2/3 interfaces.

Operating Systems

Recent releases

  •  20 Dec 2009 09:53

    Release Notes: Invalid double decoding of certain entities in TextNode appendData/substring/replaceData was fixed. Invalid space trim during document normalization was fixed.

    •  26 Apr 2007 21:32

      Release Notes: The JAXP interfaces have been updated to 1.4.2. The SAX interfaces have been updated to sax2r3.

      •  24 Apr 2007 18:04

        Release Notes: Support for DOM 2 HTML interfaces. 668/685 successful tests on the DOM 2 HTML Test Validation suite.

        •  19 Apr 2007 17:34

          Release Notes: The HTML parser no longer removes non-HTML tags from an HTML document.

          •  16 Apr 2007 21:19

            Release Notes: Faster parsing of documents without namespaces.

            Recent comments

            28 May 2012 07:22 gslowikowski

            I'm mavenizing Play! Framework PDF module ( which uses your YaHP and Shani libraries.
            I found that you have modified Jaxen and Xml-apis dependencies.
            Jaxen 1.1.1 is just 1.1.1 version without "org.jaxen.dom4j", "org.jaxen.jdom" and "org.jaxen.xom" packages.
            I have problem with xml-apis. What version of Apache xml-apis is is based on. Why have you changed the copyright headers and (most important) what is the license of this library? Is it Apache Licences or LGPL (is it possible to change it from oryginal Apache one)? I need to add licensing information to Maven project (pom.xml) files.
            Thank you in advance.

            Grzegorz Slowikowski


            Project Spotlight


            A Fluent OpenStack client API for Java.


            Project Spotlight

            TurnKey TWiki Appliance

            A TWiki appliance that is easy to use and lightweight.