pyPEG is a quick and easy solution for creating a parser in Python programs. pyPEG uses a PEG language in Python data structures to parse, so it can be used dynamically to parse nearly every context free language. The output is a plain Python data structure called pyAST, or, as an alternative, XML.
SILVERCODERS OCR Server is a server-based optical character recognition (OCR) and PDF conversion solution for enterprises. It is able to perform conversion of printed documents to editable and searchable formats like plain text, RTF, PDF, and HTML, providing highly accurate recognition in 189 languages. It is available as a Linux application or a stand-alone machine, with a fully documented API, very good performance, and flexible licensing rules. It has been designed specifically for the purpose of cooperation with document management systems such as SILVERCODERS DocStorage.
Project35 is an application suite that allows users to generate data entry forms from XML schema. Application designers use a Configuration Tool to associate records and record fields defined in the schema with application properties that include features such as: validation services, controlled vocabulary services, general plugins, and various aspects of look-and-feel.
MonoDecrypt uses pattern matching and its knowledge about character frequencies in order to decrypt messages encoded with a monoalphabetic substitution cipher. MonoDecrypt can decrypt texts of any language, as long as it has sufficient information about the language. Depending on the information you give it, the tool decrypts about 50%-100% on its own. Then you can decrypt the remaining data by filling the gaps or correcting bad guesses. MonoDecrypt can also encrypt texts using monoalphabetic substitution.
Apertium is a machine translation platform, initially aimed at related-language pairs, but recently expanded to deal with more divergent language pairs (such as English-Catalan). The platform provides a language-independent machine translation engine, tools to manage the linguistic data necessary to build a machine translation system for a given language pair, and linguistic data for a growing number of language pairs.