LibAxl is an efficient implementation of the XML 1.0 standard specification. It doesn't have any external library dependencies, having a clean implementation based on opaque types and a consistent API to manipulate your XML documents without compromising your code. It is extremely memory efficient and thread safe with a small footprint (111k). It also includes XML Namespaces support.
SILVERCODERS DocToText is a powerful utility which can convert documents in many formats to plain text. It includes a console application and C/C++ library, which allows embedding text extraction mechanisms into other applications. It supports MS Office binary formats (MS Word (DOC), MS Excel (XLS, XLSB), MS PowerPoint (PPT), and Rich Text Format (RTF)), OpenDocument formats (text documents (ODT), spreadsheets (ODS), presentations (ODP) and graphics (ODG)), Office Open XML formats (MS Word (DOCX), MS Excel (XLSX), and MS PowerPoint (PPTX)), iWork formats (PAGES, NUMBERS, KEYNOTE), OpenDocument Flat XML formats (FODP, FODS, FODT), Portable Document Format (PDF), Email files (EML), and HyperText Markup Language (HTML). DocToText can extract text not only from the document body but also from annotations (comments) embedded in odt, doc, docx, or rtf files and read metadata like author, last modification date, or number of pages. It can be used as a fast console viewer, and is able to convert corrupted OpenDocument and Office Open XML documents. It can be used to recover text even if other recovery methods failed.
Emdros is a corpus query system for storing and searching linguistically annotated text. It is very generic, supporting almost any kind of annotation from almost any linguistic theory. All linguistic levels of analysis are supported, including phonology, morphology, the lexical level, syntax, and discourse. The core libraries act as a middleware layer between a client and an underlying SQL database. MySQL, PostgreSQL, and SQLite are supported.
Syntext Serna is a highly customizable, multi-platform, pure XSL-driven WYSIWYG XML content editor. It looks and works like a conventional word processor, yet it gives you much more flexibility to alter or reuse content while keeping its integrity intact. It lets you collaborate on compound documents that contain embedded data from various sources. Content is viewed as an integrated whole that is intuitively transparent to authors.
Vim is an almost fully-compatible version of the Unix editor Vi. Many new features have been added including multi-level undo, syntax highlighting, commandline history, online help, filename completion, and block operations. It is descended from the vi clone "stevie" and runs on many systems, including Unix, MS Windows, OS/2, Macintosh, VMS, and Amiga.