RSS 11 projects tagged "General"

Download Website Updated 17 Feb 2001 Cryptic Muse

Screenshot
Pop 39.38
Vit 1.75

Cryptic Muse is a library for performing fast word pattern searches including anagrams, wildcards, overlap, containment, and masks. It provides searches for entry files like dictionaries.

Download Website Updated 26 May 2011 International Components for Unicode (C/C++)

Screenshot
Pop 184.62
Vit 12.40

ICU provides a Unicode implementation, with functions for formatting numbers, dates, times, and currencies (according to locale conventions, transliteration, and parsing text in those formats). It provides flexible patterns for formatting messages, where the pattern determines the order of the variable parts of the messages, and the format for each of those variables. These patterns can be stored in resource files for translation to different languages. Included are more than 100 codepage converters for interaction with non-unicode systems.

Download Website Updated 06 Mar 2008 Mguesser

Screenshot
Pop 32.50
Vit 2.51

Mguesser is a tool to guess a text's character set and language. It is a standalone part of the mnoGoSearch engine. More than 100 various character set and language combinations are supported.

Download Website Updated 04 Jul 2011 Emdros

Screenshot
Pop 158.51
Vit 16.83

Emdros is a corpus query system for storing and searching linguistically annotated text. It is very generic, supporting almost any kind of annotation from almost any linguistic theory. All linguistic levels of analysis are supported, including phonology, morphology, the lexical level, syntax, and discourse. The core libraries act as a middleware layer between a client and an underlying SQL database. MySQL, PostgreSQL, and SQLite are supported.

Download Website Updated 15 May 2011 uni2ascii

Screenshot
Pop 187.70
Vit 12.22

uni2ascii and ascii2uni provide conversion in both directions between UTF-8 Unicode and more than thirty 7-bit ASCII equivalents, including RFC 2396 URI format and RFC 2045 Quoted Printable format, the representations used in HTML, SGML, XML, OOXML, the Unicode standard, Rich Text Format, POSIX portable charmaps, POSIX locale specifications, and Apache log files. It can also convert between the escapes used for Unicode in languages such as Ada, C, Common Lisp, Java, Pascal, Perl, Postscript, Python, Scheme, and Tcl.

No download Website Updated 29 Nov 2005 isobel

Screenshot
Pop 25.61
Vit 1.00

Isobel is a framework to build complex information retrieval and analysis systems. Isobel can be functionally divided in two subsytems, Isobel Gatherer (the crawling and filtering subsystem) and Isobel Analyzer (the analysis subsystem). The two subsytems can also be used separately. Isobel Gatherer offers ready-to-use services like content fetching, scheduling, document format conversion, Hyperlink graph storage and analysis, content storage and indexing. A programmer may easily add new services. Isobel Analyzer uses the IBM UIMA architecture to reuse the analysis components developed for this architecture.

Download Website Updated 24 Aug 2009 CharEntry

Screenshot
Pop 35.28
Vit 3.83

CharEntry is a tool for inserting non-ASCII characters into text, with particular emphasis on linguistic notation. It provides charts of the consonants, vowels, and diacritics of the International Phonetic Alphabet as well as a chart of precomposed accented characters. Clicking on a character inserts it into a text region, the contents of which may be saved to a file or copied and pasted elsewhere. A widget for inserting characters by Unicode codepoint is also provided. Furthermore, it is possible to read the definition of a custom character chart from a file.

Download Website Updated 05 Nov 2007 OmegaT

Screenshot
Pop 29.83
Vit 1.07

OmegaT is a translation memory application intended for professional translators. It does not translate for you (software that does this is called "machine translation"). It features fuzzy matching, match propagation, simultaneous processing of multiple-file projects, simultaneous use of multiple translation memories, and external glossaries. Document file formats include plain text, HTML, and OpenOffice.org/StarOffice. It has Unicode (UTF-8) support (can be used with non-Latin alphabets). It is compatible with other translation memory applications (TMX Level 1).

Download Website Updated 09 Dec 2007 libuninum

Screenshot
Pop 71.11
Vit 3.71

libuninum is a library for converting Unicode strings to integers and integers to Unicode strings. Internal computation is done using arbitrary precision arithmetic, so there is no limit on the size of the integer that can be converted. Values are passed and returned as ASCII decimal strings, GNU MP mpz_t objects, or unsigned long integers. Auto-detection of the number system is provided. Very many number systems are supported. Group delimitation for output strings is fully controllable. Command line and graphical interfaces are also provided.

Download Website Updated 23 Aug 2007 JDing

Screenshot
Pop 11.75
Vit 49.27

JDing is a clone of the Unix translation tool "Ding". It has been ported to Java to make it platform independent. Ding dictionaries can be used. JDing is a simple but powerful dictionary.

Screenshot

Project Spotlight

Wenity

A multi-platform Zenity clone.

Screenshot

Project Spotlight

EC2Box

A Web-based multi-terminal ssh tool for EC2 instances.