194 projects tagged "Linguistic"
Booleano is an interpreter of Boolean expressions; a library to define and run filters available as text (e.g. in a natural language) or in Python code. In order to handle text-based filters, Booleano ships with a fully-featured parser whose grammar is adaptive: Its properties can be overridden using simple configuration directives. On the other hand, the library exposes a Pythonic API for filters written in pure Python. These filters are particularly useful to build reusable conditions from objects provided by a third party library.
queXC is a Web-based data cleaning and coding/classification system that takes a data file (such as data collected from a questionnaire) and cleans the text input fields by spacing them and spell checking them. It allows operators to code text fields to existing coding schemes, or to create a coding scheme on the fly. Multiple operators can code and clean simultaneously, with the ability to assign operators to do particular codes. The queXC system includes some coding schemes created from ABS (Australian Bureau of Statistics) data. It can be used as an open source replacement for Nvivo in some situations.
After the Deadline for WordPress is a plugin that interfaces with After the Deadline, a Web service that helps you improve your writing and spend less time editing. This plugin adds a button for checking spelling and writing style to the WordPress visual editor mode. An API key is required to access the After the Deadline service.
seltz_analyzer is a PHP class that tries to find the most important words inside a well-formed XHTML trunk. Every word takes a score based on the role in the XHTML structure. For example, a word between strong tags will take 5 points. In addition, it will look at some simple syntax rules. For example a word with the first character uppercase will take 4 points. The score is cumulative, so the more a word is used, the more meaning it will have.
pyPEG is a quick and easy solution for creating a parser in Python programs. pyPEG uses a PEG language in Python data structures to parse, so it can be used dynamically to parse nearly every context free language. The output is a plain Python data structure called pyAST, or, as an alternative, XML.
Siglar is a simple and lightweight program, yet powerful and fast. It automatically creates many suggestions for new acronyms using the initial letters of words or word parts in a phrase or name. Siglar has many options to configure how it creates the acronym suggestions. The suggested acronyms can be copied to another application or saved in a text file (UTF-8 format).
Spelling Suggestion Class is a PHP class that can be used to suggest text spelling corrections using the Yahoo API. It takes text as a parameter and sends a request to the Yahoo Spelling Suggestion Web service API server. The class returns a text string with the corrected spelling text if any was suggested.
libalinga-java is a Java native interface to libalinga. It provides C++ and Java wrappers, as well as Java classes. It also provides control files to generate them from the libalinga interface using the program swig. The major and minor versions of this JNI will track that of libalinga, but its bugfixes are independent of libalinga bugfixes.