seltz_analyzer is a PHP class that tries to find the most important words inside a well-formed XHTML trunk. Every word takes a score based on the role in the XHTML structure. For example, a word between strong tags will take 5 points. In addition, it will look at some simple syntax rules. For example a word with the first character uppercase will take 4 points. The score is cumulative, so the more a word is used, the more meaning it will have.
pyPEG is a quick and easy solution for creating a parser in Python programs. pyPEG uses a PEG language in Python data structures to parse, so it can be used dynamically to parse nearly every context free language. The output is a plain Python data structure called pyAST, or, as an alternative, XML.
Spelling Suggestion Class is a PHP class that can be used to suggest text spelling corrections using the Yahoo API. It takes text as a parameter and sends a request to the Yahoo Spelling Suggestion Web service API server. The class returns a text string with the corrected spelling text if any was suggested.
UnicodeDataBrowser is a browser for the UnicodeData.txt file, which contains much useful information but is not easily read by humans. It creates a scrollable table in which columns represent properties. The table may be sorted on any column. Abbreviations are expanded and characters cross-referenced in decomposition and casing fields are named. Regular expression search restricted to a selected column is available. The set of characters for which information is displayed may be restricted to those characters matching a regular expression on a specified property.
JOrtho is a spell checker for Java. The library works with any JTextComponent from the Swing framework and checks as you type. The dictionary is based on the free Wiktionary.org, and is applicable for multiple languages. You can select the spell checking language via a context menu. The Features of JOrtho are the highlighting of potentially wrongly spelled words, a context menu with suggestions for correct forms of the word, and a context menu with option to change the checking language. At the moment there are nine languages for spell checking available: English, German, French, Spanish, Italian, Russian, Polish, Dutch, and Arabic.
PottyMouth transforms completely unstructured and untrusted text to valid, nice-looking, completely safe XHTML. PottyMouth is designed to handle input text from non-technical, potentially careless, or malicious users. It produces HTML that is completely safe, programmatically and visually, to include on any Web page. You don't need to make your users read any instructions before they start typing. They don't even need to know that PottyMouth is being used.
Java Fathom is a Java library that measures the readability of English text. This package is a port of two Perl packages: Lingua::EN::Fathom and Lingua::EN::Syllable. It provides methods for measuring Fog Index, Flesch reading ease level, and Flesch-Kincaid grade level indices for English text, as well as other word/syllable statistics.
rospell is a UTF-8 text editor for programmers and general use. It features syntax highlighting, a code beautifier for C and C++, and support for gdb, ctags, grep, diff, and merge. It includes spellers for English, French, German, Spanish, and Romanian languages. Romanian aspell and hunspell dictionaries are also available.
Linguistico is a set of language tools based on the Italian language. It includes a dictionary, a thesaurus, word definitions, and other scripts and programs. You can use these tools with OpenOffice.org, Mozilla Thunderbird, Mozilla Firefox, MySpell, MyThes, Aspell, and HunSpell.