RSS All releases of Cainteoir Engine

  •  13 Feb 2014 21:16

Release Notes: This release supports ePub 3 media overlays and ePub 3 navigation documents, improves HTML whitespace normalization, improves the HTML5 parser with funding from by the CSIR (Council for Scientific and industrial Research), adds ALSA support for audio, adds initial text-to-phoneme support (document text tokenization, context analysis, cardinal and ordinal number to word conversion, phonetic model and phoneme transcriptions, and pronunciation dictionaries), adds improvements and bugfixes, and adds a new dictionary and phoneme-converter application.

  •  02 Dec 2012 01:30

Release Notes: This release preserves the event information from the readers into the document model, reworks the document event structure to use a CSS-like subset, models the CSS3 Counter Styles (previously Lists) spec for defining the disc and decimal list types, adds support for onix and marc metadata in ePub 3 documents, and fixes reading MHT documents.

  •  02 Oct 2012 21:58

Release Notes: Initial support for MS Word generated HTML. Fixes table of content navigation within zip files. Improves MIME header From username and email extraction. Supports story-based mime-like headers Title, Story, Author, and Keyword. Supports dc:subject statements in OPF containing comma-separated keywords. Improves the phoneme transcription scheme chart generation. Uses C++11 range-based for loops. Supports building on clang >= 3.2. Full doxygen documentation for the public API. Simplifies the buffer API. Adds expression templates to the RDF Query Language API. Optimizes RDF select on subjects.

  •  31 Jul 2012 23:09

Release Notes: This release supports encoding selections for XML, HTML, MIME, and email, improves text formatting for doc2doc, improves argument and error handling for the commandline tools, switches NEWS and README to markdown format, supports mimetype/alternative for MIME, supports email embedded within html <pre> tags, fixes parsing of HTML containing implicit (missing) tags, supports HTML table markup, and is more relaxed about checking the epub mimetype file.

  •  03 Jun 2012 21:24

Release Notes: Supports PDF documents using the Poppler library. Supports reading generic zipped document collections. Supports extracting (X)HTML metadata from several meta tag names. Reports the filename as a ToC entry if no ToC is present. Switches document processing from an event model to a reader model. doc2doc: a simple command line tool for converting documents from one format to another.

  •  01 Apr 2012 12:36

Release Notes: This release supports espeak using installed mbrola voices, translates language and region names using the iso-codes package, implements the BCP 47 standard for interpreting language, script, and region tags, recognizes h1..h6 as table of content entries in (X)HTML, supports newsgroup information for email, supports property attributes on an empty property element in RDF/XML, and fixes detection of HTML which is valid XML, but is not marked as XHTML.

  •  30 Jan 2012 22:56

Release Notes: This version releases the file handle when finished recording Ogg files. It has improved HTML parsing. It supports epub 3 @refines and @datatype metadata. It handles malformed entities in opf (epub package) documents.

  •  24 Nov 2011 12:19

Release Notes: The shared-mime-info database is now used for MIME type detection. iconv is used for character encoding conversions. All Content-Transfer-Encoding types in MIME headers are supported, including Base64. Audio error handling during reading/recording was improved. Additional language code mappings for new espeak voices are supported. UND is now supported as a language identifier for Calibri eBooks. XML-encoded HTML without an associated xmlns is now supported.

  •  27 Jul 2011 23:20

Release Notes: This release support epub 2.0 table of contents. It supports epub 3.0 metadata. Basic support for SSML. cainteoir: read/record a range of sections in a document's table of contents. cainteoir: allows the voice reading speed, pitch, and volume to be set on the command line.

  •  01 Jul 2011 09:18

Release Notes: Single-file HTML pages (MHTML, MHT) are supported. RTF documents are supported. The heuristics for estimating the total reading time for documents were improved. Support for HTML documents was improved. The program no longer crashes when opening an epub with a missing OPF file.


Project Spotlight

CloverETL Designer

A visual data transformation designer for the CloverETL framework.


Project Spotlight

Template Data Interface (TDI)

A powerful markup template system for Python.