Apertium is a machine translation platform, initially aimed at related-language pairs, but recently expanded to deal with more divergent language pairs (such as English-Catalan). The platform provides a language-independent machine translation engine, tools to manage the linguistic data necessary to build a machine translation system for a given language pair, and linguistic data for a growing number of language pairs.
Algraeph is a tool for manual alignment of linguistic graphs, such as phrase structure trees or dependency structures, where each node corresponds to a subsequence of the analyzed input sentence. It allows you to express the similarity between two graphs by aligning their nodes and attaching relation labels to these alignments. Graphs are read from one or more graphbanks (or treebanks) in the GraphML or Alpino formats. Alignment relations are user-defined and are stored in a simple XML format, which can be used for further processing. The resulting parallel graph corpus is a useful data set for many tasks in computational linguistics and natural language processing.
Esperantilo ("Tool for Esperanto") is a UTF-8 editor with linguistics functions for the language Esperanto, and is also a system for computer aided translation. It contains a spell checker and grammar checker for the Esperanto language. It can translate Esperanto text in different formats to Polish, German, English, and Swedish and from Polish and English. It also supports computer aided translation by interactive machine translation. Translation memory can be used also for any language pairs. It is an XLIFF editor. It supports XLIFF and TMX (Level 1) formats. Machine translation uses direct translation at the syntax level.
Isobel is a framework to build complex information retrieval and analysis systems. Isobel can be functionally divided in two subsytems, Isobel Gatherer (the crawling and filtering subsystem) and Isobel Analyzer (the analysis subsystem). The two subsytems can also be used separately. Isobel Gatherer offers ready-to-use services like content fetching, scheduling, document format conversion, Hyperlink graph storage and analysis, content storage and indexing. A programmer may easily add new services. Isobel Analyzer uses the IBM UIMA architecture to reuse the analysis components developed for this architecture.
Linguistic Tree Constructor is an application for drawing linguistic syntax trees. Its main strength is assisting in data production by quickly analyzing large amounts of text. "Generic" trees are supported, as well as RRG and X-Bar trees. Node-categories are user-definable, and additional user-definable labels can also be applied to each node. Publication-quality, high-resolution, horizontal trees can be drawn. The file format is based on TIGER-XML.