Enca detects the encoding of text files, on the basis of knowledge of their language. It can also convert them to other encodings, allowing you to recode files without knowing their current encoding. It supports most of Central and East European languages, and a few Unicode variants, independently on language.
The Enchant extension aims to provide a full featured Enchant binder for PHP. The Enchant library provides a common API to work with many spelling tools. All inputs and outputs are in UTF-8 encoding. All language tags are based on ISO standards. It supports: Aspell/Pspell (intends to replace Ispell), Ispell, MySpell/Hunspell (an OOo projects also used by Mozilla), Uspell (primarily Yiddish, Hebrew, and Eastern European languages), Hspell (Hebrew), and AppleSpell (Mac OS X).
ExactScan is a versatile document capture application for home offices and workgroups. It is designed from the ground up for high-speed document scanners and can easily handle hundreds of images per minute, including duplex scans. Included functionality reaches from managing, sorting, and editing singles pages to writing multi- as well as single-page PDF files including JPEG compression and TIFF, JPEG, JPEG2000, and PNG bitmap files. ExactScan allows performing state of the art image processing including automatic cropping, deskewing, dynamic thresholding for perfect black and white documents, and descreening print rasters.
FreeMarker is a template engine that was originally designed so that servlet-based applications could keep graphical design separate from application logic. The templates provide an easy and highly flexible way to generate any kind of text output (HTML, PostScript, TeX, source code, etc.) from a variety of data sources such as Java objects, Jython objects, XML object models, and more.
GNU Source-highlight produces a document with syntax highlighting when given a source file. It handles many languages, e.g., Java, C/C++, Prolog, Perl, PHP3, Python, Flex, HTML, and other formats, e.g., ChangeLog and log files, as source languages and HTML, XHTML, DocBook, ANSI color escapes, LaTeX, and Texinfo as output formats. Input and output formats can be specified with a regular expression-oriented syntax.
GXPARSE is not a new XML parser, but is an additional processing layer that makes it much easier to use event-based parsers like the SAX parser. It supports both direct sequential output and random access output (via the Resequencer interface). The random access mode delays output until all input has been processed, but makes it much easier to handle ID/IDREF attributes. GXPARSE maintains most advantages of the event-based parser. Application development and maintenance is considerably easier, but processing is a little slower.