Projects / Vilistextum


Vilistextum is a small and fast HTML to text converter. It is quite fault-tolerant and deals well with badly-formed or otherwise quirky HTML. It has full support for different character sets (e.g. Unicode). It is able to optimize for ebook reading, collapse multiple blank lines, and create footnotes out of links. A GUI frontend using kaptain is included.


Recent releases

  •  25 Oct 2006 13:59

    Release Notes: Support for several Unicode ligatures and the --links-inline option were added. Libiconv is no longer mandatory for the Unicode version. Memory leaks and a core dump were fixed.

    •  21 Dec 2004 07:55

      Release Notes: Several buffer overflows and a compilation problem on Solaris were fixed. Fault tolerant behaviour when encountering invalid byte sequences was improved.

      •  24 Aug 2004 18:26

        Release Notes: Automatic detection of the UTF-8 locale has been improved. The default encoding for XML documents is now UTF-8. Several small bugfixes for parsing the character set were made.

        •  02 Jun 2004 06:58

          Release Notes: Several small bugfixes were made.

          •  20 Apr 2004 14:40

            Release Notes: The UTF8 version now compiles cleanly on FreeBSD. Several problems with text formatting were fixed, all comments were changed to C90 comments (/* ... */), and a small test suite was added.

            Recent comments

            08 Aug 2001 20:13 Dideon

            Great Utility
            I am using vilistextum instead of sgmltools ( i couldn't get that to install properly) to convert html generated from docbook to text. The results were great and I love the options (especially showing the link references).
            I would rate this pretty highly.


            Project Spotlight


            A Fluent OpenStack client API for Java.


            Project Spotlight

            TurnKey TWiki Appliance

            A TWiki appliance that is easy to use and lightweight.