ExactScan is a versatile document capture application for home offices and workgroups. It is designed from the ground up for high-speed document scanners and can easily handle hundreds of images per minute, including duplex scans. Included functionality reaches from managing, sorting, and editing singles pages to writing multi- as well as single-page PDF files including JPEG compression and TIFF, JPEG, JPEG2000, and PNG bitmap files. ExactScan allows performing state of the art image processing including automatic cropping, deskewing, dynamic thresholding for perfect black and white documents, and descreening print rasters.
LyX is a document processor that encourages an approach to writing based on the structure of your documents, not their appearance. It is intended for people people who write and want their writing to look great without tinkering with formatting details, font attributes, or page boundaries. On screen, it looks like any word processor, but it uses the TeX engine for printed output and producing richly cross-referenced PDFs. It is stable and fully featured.
SILVERCODERS DocToText is a powerful utility which can convert documents in many formats to plain text. It includes a console application and C/C++ library, which allows embedding text extraction mechanisms into other applications. It supports MS Office binary formats (MS Word (DOC), MS Excel (XLS, XLSB), MS PowerPoint (PPT), and Rich Text Format (RTF)), OpenDocument formats (text documents (ODT), spreadsheets (ODS), presentations (ODP) and graphics (ODG)), Office Open XML formats (MS Word (DOCX), MS Excel (XLSX), and MS PowerPoint (PPTX)), iWork formats (PAGES, NUMBERS, KEYNOTE), OpenDocument Flat XML formats (FODP, FODS, FODT), Portable Document Format (PDF), Email files (EML), and HyperText Markup Language (HTML). DocToText can extract text not only from the document body but also from annotations (comments) embedded in odt, doc, docx, or rtf files and read metadata like author, last modification date, or number of pages. It can be used as a fast console viewer, and is able to convert corrupted OpenDocument and Office Open XML documents. It can be used to recover text even if other recovery methods failed.
GNU TeXmacs is a free wysiwyw (what you see is what you want) editing platform with special features for scientists. The software aims to provide a unified and user friendly framework for editing structured documents with different types of content: text, mathematics, graphics, interactive content. TeXmacs can also be used as an interface to many external systems for computer algebra, numerical analysis, and statistics. New presentation styles can be written by the user and new features can be added to the editor using Scheme.
Foswiki is wiki software, supporting the editing of Web pages in an ordinary Web browser by end users. What makes Foswiki special is that it supports the embedding of active and passive macros that enhance the page content (e.g. with global or dynamic information) and allow end-users to build applications that store and process data in a structured manner.
Barcode Writer in Pure Postscript implements the printing of many barcode formats entirely within PostScript, so that the process of converting the input string into the printed output is performed by the printer or print system. The project supports all major barcode formats including: EAN-13 (JAN-13), EAN-8 (JAN-8), UPC-A, UPC-E, EAN-5 & EAN-2 (EAN/UPC add-ons), ISBN (including legacy ISBN), ISMN (including legacy ISMN), ISSN, Code 128 (A, B & C), GS1-128, SSCC-18 (EAN-18, NVE), EAN-14, Code 39, Code 39 Extended, Code 93, Code 93 Extended, Code 32 (Italian Pharmacode), Pharmazentralnummer (PZN), Interleaved 2 of 5, ITF-14 (UPC SCS), GS1 DataBar (Omnidirectional, Stacked, Stacked Omnidirectional, Limited, Expanded, Expanded Stacked), Code 2 of 5 (Industrial, IATA, Matrix, Datalogic & COOP), Code 11 (USD-8), BC412, Codabar (NW-7), Pharmacode (including two-track), MSI, Plessey, Telepen, Channel Code, PosiCode, PDF417, Data Matrix (ECC200), QR Code (including Micro QR Code), and more.
TWiki is a flexible, powerful, and simple Web based collaboration platform. It is suitable for dynamic intranets and knowledge bases, and for sharing and managing documents and collaborative projects. It resembles a normal Web site, but every page can be changed from a browser. It features automatic link generation, full text search, group authorization, Web forms, reporting, change notification, file attachments, revision control of pages and attachments, a modular templating system with skins, hierarchical navigation based on the topic parenting feature, and more. Plugins can be used to enhance the program and build groupware applications.
OOoPy is a Python library for modifying OpenOffice.org documents. It provides a set of transformations on the OOo XML format using the ElementTree XML Library. Transformations included are a mail merge application and the concatenation of documents with formatting intact. The framework supports easy creation of new transformations.