The Aida project is two-fold: it defines a simple common markup language designed to describe structured text, and it implements a compiler that translates the Aida syntax into various target formats (HTML, LaTeX, Trac, Mediawiki, Text, Markdown, and others). The system is fast, flexible, and extensible. The core command, aida, is a strict parser (written using Bison and Flex), which analyzes files written in the Aida Markup Language and invokes callbacks in order to convert them to the target format. It embeds a Tcl language interpreter and the callbacks are written in Tcl. This makes it very easy to extend the library and to define new target formats. Furthermore, the Aida files are highly parameterizable via a header and the entire system is configurable at the admin and at the user level. It is also possible to evaluate Tcl code within an Aida file and thus create dynamic contents.
The BID software was designed to apply business rules to your email traffic. Business information can come from business applications, monitoring applications, or normal users requesting or providing data. The goal of the BID tool is to automate the dispatching of this information and to join a business-case with a responsible receiver. A receiver can be a human being or another BID server customized for a special business-case. The BID server was designed as a generic server using plugins to serve specific business-rules.
Xlit converts text from one writing system into another. It allows the user to define a transliteration simply by typing the input strings in one window and the strings to which they are to be mapped in another. Transliteration may be restricted to regions bounded by specified delimiters or their complements. Transliteration may also be performed by external commands or plugins. Xlit can also convert one type of delimiter to another, e.g. from HZ escapes to XML. Xlit can read and write transliteration definitions in its own format and as Yudit keymaps. It can be run in batch mode without the GUI.
uni2ascii and ascii2uni provide conversion in both directions between UTF-8 Unicode and more than thirty 7-bit ASCII equivalents, including RFC 2396 URI format and RFC 2045 Quoted Printable format, the representations used in HTML, SGML, XML, OOXML, the Unicode standard, Rich Text Format, POSIX portable charmaps, POSIX locale specifications, and Apache log files. It can also convert between the escapes used for Unicode in languages such as Ada, C, Common Lisp, Java, Pascal, Perl, Postscript, Python, Scheme, and Tcl.
Ellogon is a multi-lingual, cross-platform, general-purpose language engineering environment, developed in order to aid both researchers who are doing research in computational linguistics, as well as companies who produce and deliver language engineering systems. As a language engineering platform, it offers an extensive set of facilities, including tools for processing and visualising textual/HTML/XML data and associated linguistic information, support for lexical resources (like creating and embedding lexicons), tools for creating annotated corpora, accessing databases, comparing annotated data, or transforming linguistic information into vectors for use with various machine learning algorithms.