ANTLR (ANother Tool for Language Recognition) is a language tool that provides a framework for constructing recognizers, compilers, and translators from grammatical descriptions containing C++, Java, or Sather actions. It is similar to the popular compiler generator YACC, however ANTLR is much more powerful and easy to use. ANTLR-produced parsers are not only highly efficient, but are both human-readable and human-debuggable (especially with the interactive ParseView debugging tool). ANTLR can generate parsers, lexers, and tree-parsers in either C++, Java, or Sather. ANTLR is currently written in Java.
Diogenes is a tool for searching and browsing the Latin and ancient Greek texts published on CD-ROM by the Packard Humanities Institute and the Thesaurus Linguae Graecae. It comes as an easy-to-install stand-alone application for GNU/Linux, Mac OS X, and Windows, based on the Firefox browser (i.e. Xulrunner). Alternatively, it can be installed by a network administrator as a server on a local network, and users then access it via an ordinary Web browser. There is also a command-line tool which can optionally format output as LaTeX instead of HTML.
ICU provides a Unicode implementation, with functions for formatting numbers, dates, times, and currencies (according to locale conventions, transliteration, and parsing text in those formats). It provides flexible patterns for formatting messages, where the pattern determines the order of the variable parts of the messages, and the format for each of those variables. These patterns can be stored in resource files for translation to different languages. Included are more than 100 codepage converters for interaction with non-unicode systems.
polcnv is designed to convert files between different encoding methods used for Polish texts. It can be also used to covert plain text documents in any language using supported character encoding methods. The program uses ISO-10646 UCS-4 (equivalent to Unicode UTF-32) as internal representation.
ssct is a command-line utility, humble of intent, that takes a single word, spell checks it, takes the result(s) and then translates them. It works to/from english only. From/to languages are limited by ispell in the first instance, and by the IDP (Internet Dictionary Project) files in the second. Currently the latter includes Spanish, Portuguese (minimal), Latin, German, French and Italian. These files are included with this package. This utility was originally created to make it easier to decode badly-scrawled postcards from Spain.