RSS 47 projects tagged "Linguistic"

Download Website Updated 11 Feb 2013 ANTLR

Screenshot
Pop 296.90
Vit 5.52

ANTLR (ANother Tool for Language Recognition) is a language tool that provides a framework for constructing recognizers, compilers, and translators from grammatical descriptions containing C++, Java, or Sather actions. It is similar to the popular compiler generator YACC, however ANTLR is much more powerful and easy to use. ANTLR-produced parsers are not only highly efficient, but are both human-readable and human-debuggable (especially with the interactive ParseView debugging tool). ANTLR can generate parsers, lexers, and tree-parsers in either C++, Java, or Sather. ANTLR is currently written in Java.

Download Website Updated 05 Oct 2013 Apache Lucene

Screenshot
Pop 258.49
Vit 21.25

Apache Lucene is a high-performance, full-featured text search engine library written entirely in Java. It is suitable for nearly any application that requires full-text search, especially cross-platform.

Download Website Updated 12 Sep 2008 Redet

Screenshot
Pop 229.75
Vit 11.02

Redet is a tool for developing and executing regular expressions using any of more than 50 search programs, editors, and programming languages, intended both for developing regular expressions for use elsewhere and as a search tool in its own right. For each program in each locale, a palette showing the available constructs is provided. The properties of each program are determined by runtime tests, which guarantees that they will be correct for the program version and locale. Additional features include persistent history, extensive help, a variety of character entry tools, and the ability to change locale while running. Redet is highly configurable and fully supports Unicode.

Download Website Updated 10 Jan 2014 pyPEG

Screenshot
Pop 194.81
Vit 26.73

pyPEG is a quick and easy solution for creating a parser in Python programs. pyPEG uses a PEG language in Python data structures to parse, so it can be used dynamically to parse nearly every context free language. The output is a plain Python data structure called pyAST, or, as an alternative, XML.

Download Website Updated 15 May 2011 uni2ascii

Screenshot
Pop 189.59
Vit 12.21

uni2ascii and ascii2uni provide conversion in both directions between UTF-8 Unicode and more than thirty 7-bit ASCII equivalents, including RFC 2396 URI format and RFC 2045 Quoted Printable format, the representations used in HTML, SGML, XML, OOXML, the Unicode standard, Rich Text Format, POSIX portable charmaps, POSIX locale specifications, and Apache log files. It can also convert between the escapes used for Unicode in languages such as Ada, C, Common Lisp, Java, Pascal, Perl, Postscript, Python, Scheme, and Tcl.

Download Website Updated 26 May 2011 International Components for Unicode (C/C++)

Screenshot
Pop 184.79
Vit 12.38

ICU provides a Unicode implementation, with functions for formatting numbers, dates, times, and currencies (according to locale conventions, transliteration, and parsing text in those formats). It provides flexible patterns for formatting messages, where the pattern determines the order of the variable parts of the messages, and the format for each of those variables. These patterns can be stored in resource files for translation to different languages. Included are more than 100 codepage converters for interaction with non-unicode systems.

Download Website Updated 30 Jan 2001 Ciao Prolog

Screenshot
Pop 109.78
Vit 1.00

Ciao is a complete Prolog system subsuming ISO-Prolog with a novel modular design which allows both restricting and extending the language. Ciao extensions currently include feature terms (records), higher-order, functions, constraints, objects, persistent predicates, a good base for distributed execution (agents), and concurrency. Libraries also support WWW programming, sockets, and external interfaces (C, Java, TCL/Tk, relational databases, etc.). An Emacs-based environment, a stand-alone compiler, and a toplevel shell are also provided.

Download Website Updated 29 Dec 2007 GNU Talk Filters

Screenshot
Pop 110.43
Vit 4.80

The GNU Talk Filters are filter programs that convert ordinary English text into text that mimics a stereotyped or otherwise humorous dialect. Some of these filters have been in the public domain for many years, but here they are provided as a single integrated package. The filters include austro, b1ff, brooklyn, chef, cockney, drawl, dubya, fudd, funetak, jethro, jive, kraut, pansy, pirate, postmodern, redneck, valspeak, and warez. This package provides the filters both as individual executables and collectively as a C library, so they can be easily embedded in other programs.

Download Website Updated 22 Jan 2004 Pythoñol

Screenshot
Pop 102.65
Vit 2.11

Pythoñol is an all-in-one program that helps English speakers learn Spanish. It features pronunciation, verb conjugation, a dictionary with over 70,000 words, a thesaurus, quizzes, full-text translation, idioms, a verb browser, and a large reference section.

Download Website Updated 13 May 2008 yawl

Screenshot
Pop 86.38
Vit 3.52

This is a comprehensive "word game" word list for UNIX/Linux. It is a superset of the author's ENABLE list, the "OSW", and various lists researched by the author's colleague, Alan Beale. At 264,093 words, it is the largest list of its kind, suitable for use in all manners of crossword-type board games and word construction games, as well as for a spell checker dictionary. The YAWL package now includes two anagramming utilities (supplied as source code, handled by the included Makefile). There is also a shell script that extends the UNIX "strings" system command. This is the word list package recommended for the author's Quackey word game.

Screenshot

Project Spotlight

MatrixSSL

An embedded SSL/TLS implementation for small footprint devices.

Screenshot

Project Spotlight

Aspose.Pdf for Reporting Services

A system to generate PDF reports from MS SQL Server 2000 & 2005 Reporting Services.