2131 projects tagged "Text Processing"

Download Website Updated 03 Mar 2009 seltz_analyzer

Screenshot
Pop 21.35
Vit 1.00

seltz_analyzer is a PHP class that tries to find the most important words inside a well-formed XHTML trunk. Every word takes a score based on the role in the XHTML structure. For example, a word between strong tags will take 5 points. In addition, it will look at some simple syntax rules. For example a word with the first character uppercase will take 4 points. The score is cumulative, so the more a word is used, the more meaning it will have.

Download Website Updated 05 Mar 2009 After the Deadline for WordPress

Screenshot
Pop 30.59
Vit 1.00

After the Deadline for WordPress is a plugin that interfaces with After the Deadline, a Web service that helps you improve your writing and spend less time editing. This plugin adds a button for checking spelling and writing style to the WordPress visual editor mode. An API key is required to access the After the Deadline service.

Download No website Updated 07 Mar 2009 ezphpconfig

Screenshot
Pop 9.95
Vit 1.00

Ezphpconfig generates a set of PHP configuration classes from a supplied XML file. You can then access your configuration values very quickly without having to parse the XML file on every request. The element (tag) names become property names and the text contained in the elements becomes the property's value. It also supports nested elements. If the generated PHP file is older than the XML file, it is re-generated using the data in the newer XML file. This class also supports array types using the element inside an element whose type attribute is set to "array".

No download Website Updated 12 Mar 2009 PlainDoc

Screenshot
Pop 37.52
Vit 1.00

PlainDoc is a document production system based on plain text files. It keeps most of the document in human readable form; the PlainDoc source itself serves as the plain text version of the document. It handles EPS, gnuplot, dia diagrams, tables, and verbatim text, uses LaTeX for PDF production, and can produce monolithic or paginated HTML and DocBook natively. Includes are supported. The file format is CVS friendly and easily diffable. It is suitable for software manuals and documentation, technical publishing, scientific papers, books, legal documents, and presentation slides.

Download Website Updated 27 Mar 2009 TextEditor++

Screenshot
Pop 17.32
Vit 1.00

TextEditor++ is a cross-platform text editor for both plain and formatted text files, and for printing. It includes a tool for PDF conversion of plain text files.

Download Website Updated 07 Apr 2009 pymur

Screenshot
Pop 58.63
Vit 1.00

Pymur provides Python bindings to the C++ based Lemur Toolkit. The Lemur Toolkit is an open-source toolkit designed to facilitate research in language modeling and information retrieval. Lemur supports a wide range of industrial and research language applications such as ad-hoc retrieval, site-search, and text mining.

Download Website Updated 30 Jun 2009 libcsv_parser++

Screenshot
Pop 53.03
Vit 1.00

libcsv_parser++ is a C++ library for parsing text files to extract records and fields. The records can be delimited with any set of characters. It makes the following assumptions: the record terminator is only one character in length; the field terminator is only one character in length; and the fields are enclosed by single characters, if any. The parser can handle documents where fields are always enclosed, not enclosed at all, or optionally enclosed. When fields are strictly all enclosed, there is an assumption that any enclosure characters within the field are escaped by placing a backslash in front of the enclosure character. The software could be ported to Windows with very little effort.

No download Website Updated 29 Jun 2010 RXPY

Screenshot
Pop 29.77
Vit 1.00

RXPY is a pure-Python regular expression library, compatible with the re package. RXPY can be used as a test-bed for new matching algorithms, since the parser and compatibility components are reusable.

Download Website Updated 22 Apr 2011 jEdit Markdown Plugin

Screenshot
Pop 38.30
Vit 1.00

jEdit Markdown Plugin is a plugin for jEdit that can render Markdown text from the current buffer or from selected text. The resulting HTML is written to a new buffer or copied to the clipboard. The plugin can also show a preview in a Web browser (using the Info Viewer plugin).

Download Website Updated 25 Oct 2011 Speedpad

Screenshot
Pop 26.32
Vit 1.00

Speedpad is a small and portable ncurses-powered tool to test, train, and increase typing speed on arbitrary text input. It is designed for intermediate-to-advanced level typists and assumes that you have already learned how to touch type. It does not use lessons, single words, or other synthetic stuff. It supports tab expansion, auto indentation, and syntax to train on code. It features a reference speed robot and supports CPS, CPM, WPM, PPM, and CPH/KPH metrics. It shows detailed statistics about speed and helps find and eliminate frequent typos. Stats are dumped to standard output in a machine-readable format after completion, and can be piped into gnuplot.

Screenshot

Project Spotlight

4DIAC

A framework for distributed industrial automation and control.

Screenshot

Project Spotlight

Scribus

A page layout (DTP) application.