Vilistextum is a small and fast HTML to text converter. It is quite fault-tolerant and deals well with badly-formed or otherwise quirky HTML. It has full support for different character sets (e.g. Unicode). It is able to optimize for ebook reading, collapse multiple blank lines, and create footnotes out of links. A GUI frontend using kaptain is included.
|Tags||Text Processing Markup HTML/XHTML|
Release Notes: Support for several Unicode ligatures and the --links-inline option were added. Libiconv is no longer mandatory for the Unicode version. Memory leaks and a core dump were fixed.
Release Notes: Several buffer overflows and a compilation problem on Solaris were fixed. Fault tolerant behaviour when encountering invalid byte sequences was improved.
Release Notes: Automatic detection of the UTF-8 locale has been improved. The default encoding for XML documents is now UTF-8. Several small bugfixes for parsing the character set were made.
Release Notes: Several small bugfixes were made.
Release Notes: The UTF8 version now compiles cleanly on FreeBSD. Several problems with text formatting were fixed, all comments were changed to C90 comments (/* ... */), and a small test suite was added.