Projects / xmlformat

xmlformat

xmlformat is a configurable formatter (or "pretty-printer") for XML documents. It gives the user control over indentation, line-breaking, and text wrapping. These properties can be defined on a per-element basis.

Tags
Licenses
Implementation

RSS Recent releases

  •  15 Aug 2006 01:37

Release Notes: Each token is now assigned an input line number, which is displayed in error messages. This provides better information to the user about the location of problems in input files. The token stack is now printed when an error occurs. This provides some idea of the context of the element that is malformed or has malformed content.

  •  27 Mar 2004 00:35

Release Notes: Minor fixes were made for compatibility with Ruby 1.8.x.

  •  07 Feb 2004 00:22

Release Notes: This release adds options for in-place formatting and backup creation, and a tutorial document.

  •  29 Jan 2004 19:07

No changes have been submitted for this release.

RSS Recent comments

04 Feb 2004 05:29 xmldoc Thumbs up

A clever tool that works as advertised

Definitely give this app a try. You'll need to do some initial
configuration to teach it, for example, which elements in your XML
files are inline elements and which are block elements, which
elements need to be handled as "verbatim" elements, and which you
want it to whitespace-normalize. But once you've done that initial
configuration, I think you'll find it works as expected --
including handling mixed content correctly.

In testing it with a number of files of 15,000+ lines, I never
found a single instance of it adding whitespace where it shouldn't
have been added or deleting whitespace where it should have been
preserved, or wrapping or indenting anything I didn't ask it
to.

For a few more details, see the
xmlhack item (xmlhack.com/read.php?i...) I
wrote about it.

31 Jan 2004 12:30 pauldubois

Re: don't pretty print

Hmm. I'm uncertain how to respond to this. It
seems to be a comment such as
one might write without having examined the software
in question or read any
of its documentation.

First.
Please cite your reference for saying that any and all
whitespace in XML
is significant.

The XML spec (www.w3.org/TR/REC-xml)
doesn't appear to forbid the
idea of optional whitespace. For example, see
www.w3.org/TR/REC-xml#..., which
says:

"2.10 White Space Handling

In editing XML documents, it is often convenient to
use "white space"
(spaces, tabs, and blank lines) to set apart the markup
for greater
readability. Such white space is typically not intended
for inclusion in the
delivered version of the document. On the other hand,
"significant" white
space that should be preserved in the delivered version
is common, for
example in poetry and source code."

In addition, the spec for Canonical XML (http://
www.w3.org/TR/xml-c14n)
deals with various types of "reformatting" and appears
to reject the notion
that whitespace *must* be preserved. In particular, see
www.w3.org/TR/xml-c14n....

Still, all that is irrelevant for my purposes. If a
document is mine, it's
my call which whitespace should be preserved and
which may be transformed.
If I want to reformat my documents, I will. I fail to see
the value of
telling people that should not do with their documents as
they see fit.
Sure, you can reformat a document in such a way that
it becomes unsuitable
for some purposes. I assume that users are intelligent
enough to know what
is permissible for their purposes and what is not.

Second.
It's true that some editors decide to reformat things.
That's a case of some
program performing reformatting without consulting
you. xmlformat doesn't
reformat anything unless you ask it to. It doesn't sneak
up on you and work
its will on you unbeckoned. The two situations are quite
different.

It's entirely irrelevant what an editor might do,
except in the sense that
xmlformat can in fact be used to compensate for the
formatting imposed on
you by an editor: As it happens, one of the motivations
for writing
xmlformat was to have a way to put XML files in a
standard format before
checking them into a revision control system. If
different people work on
the files using different editors with different format
conventions, it
artificially balloons the size of diffs and makes them
more difficult to
read. xmlformat helps reduce this problem. I apologize
if this was not
clear, though in my defense, it's necessary to read only
into the second
paragraph of the documentation to find it out.

29 Jan 2004 20:15 elanthis

don't pretty print
Doing things like this absolutely breaks the XML spec. Any and all whitespace is significant. Adding whitespace for "pretty printing", assuming XML processors all work like HTML, is very evil.

I've had quite a few apps breaks because editors decide to be helpful and reformat things, adding spaces which the app interpreted literally (as it *should*, according to the XML spec). This kind of thing shouldn't be encouraged. ;-)

Screenshot

Project Spotlight

sshdfilter

A program that automatically blocks ssh brute force attacks.

Screenshot

Project Spotlight

Catharsis.Grails.Widgets

A Grails 2 tag library that provides useful social media tags to include in Web pages.