All releases tagged Major feature enhancements


Release Notes: HyperText Markup Language (HTML) format support was introduced in this version. The ability to retrieve metadata like document author, last modification date, or number of pages was added. The new important feature is extracting text from annotations (comments) embedded in odt, doc, docx, or rtf files. Some malfunctions were also fixed.


Release Notes: This is the first version available for Mac OS X and also the first version available as a C/C++ library in addition to the console application. MS PowerPoint binary format (PPT) support has been added. Headers, footers, and embedded XLS workbooks in DOC files are now supported. Extracting text from OpenDocument and OOXML formats has been significantly optimized. A lot of bugs have been fixed.


Release Notes: In addition to bugfixes and optimizations, Office Open XML (ISO/IEC 29500, also called OOXML, OpenXML, or MSOOXML) documents are supported.


Release Notes: Support for ODT (OpenDocument) documents was added. Fixes were made in RTF format support.


Release Notes: Support for RTF documents was added.