2131 projects tagged "Text Processing"

Download Website Updated 14 Jun 2014 TCPDF

Screenshot
Pop 1,935.19
Vit 709.67

TCPDF is a PHP class for generating PDF documents without requiring external extensions. TCPDF supports all ISO page formats and custom page formats, custom margins and units of measure, UTF-8 Unicode, RTL languages, HTML, barcodes, TrueTypeUnicode, TrueType, OpenType, Type1, and CID-0 fonts, images, graphic functions, clipping, bookmarks, JavaScript, forms, page compression, digital signatures, and encryption.

Download Website Updated 12 Jun 2014 TXR

Screenshot
Pop 622.75
Vit 142.94

TXR is a new data munging language. TXR's special pattern language provides template-based matching of entire documents or large sections of documents. It also contains a language for functional and imperative programming. It is written in C and takes the form of a utility that is portable to Unix-like platforms and Windows.

Download Website Updated 11 Jun 2014 SeaMonkey

Screenshot
Pop 1,815.03
Vit 257.51

The SeaMonkey project is a community effort to develop an all-in-one Internet application suite. It contains an Internet browser, email and newsgroup client with an included Web feed reader, HTML editor, IRC chat, and Web development tools, and is sure to appeal to advanced users, Web developers, and corporate users. It uses much of the Mozilla source code powering such successful siblings as Firefox, Thunderbird, Camino, Sunbird, and Miro.

Download Website Updated 09 Jun 2014 Recoll

Screenshot
Pop 473.01
Vit 116.75

Recoll is a personal full text desktop search tool based on Xapian. It provides an easy to use, feature-rich, easy administration interface with a Qt-based GUI. Text, HTML, PDF, PostScript, MS Word, OpenOffice, Wordperfect, KWord, Abiword, maildir, and mailbox mail folder formats are supported, along with their compressed versions and quite a few others. Powerful query facilities are provided. Multiple character sets are supported, and internal processing and storage uses Unicode UTF-8. Stemming is performed at query time and the stemming language can be switched after indexing.

No download Website Updated 07 Jun 2014 Tables for Ada

Screenshot
Pop 92.24
Vit 57.17

This library provides an implementation of tables indexed by strings. The binary search is used for names of known length. It is also possible to search a table for names of unknown length, i.e. to parse a string using some table. Table elements can be of any private type. Key- insensitive tables are supported.

Download Website Updated 04 Jun 2014 doclifter

Screenshot
Pop 353.04
Vit 87.27

doclifter helps with lifting documents with nroff markup to XML-DocBook. Lifting documents from presentation level to semantic level is hard, and a really good job requires human polishing. This tool aims to do everything that can be mechanized, and to preserve any troff-level information that might have structural implications in XML comments. TBL tables are translated into DocBook table markup, PIC into SVG, and EQN into MathML (relying on pic2svg and GNU eqn for the last two).

Download Website Updated 03 Jun 2014 ANTLR

Screenshot
Pop 289.72
Vit 5.37

ANTLR (ANother Tool for Language Recognition) is a language tool that provides a framework for constructing recognizers, compilers, and translators from grammatical descriptions containing C++, Java, or Sather actions. It is similar to the popular compiler generator YACC, however ANTLR is much more powerful and easy to use. ANTLR-produced parsers are not only highly efficient, but are both human-readable and human-debuggable (especially with the interactive ParseView debugging tool). ANTLR can generate parsers, lexers, and tree-parsers in either C++, Java, or Sather. ANTLR is currently written in Java.

Download Website Updated 02 Jun 2014 Vrapper

Screenshot
Pop 238.78
Vit 56.05

Vrapper is an Eclipse plugin which acts as a wrapper for Eclipse text editors to provide a Vim-like input scheme for moving around and editing text. Unlike other plugins which embed Vim in Eclipse, Vrapper imitates the behavior of Vim while still using whatever editor you have opened in the workbench. The goal is to have the comfort and ease which comes with the different modes, complex commands, and count/operator/motion combinations which are the key features behind editing with Vim, while preserving the powerful features of the different Eclipse text editors, like code generation and refactoring.

Download No website Updated 31 May 2014 Ascii Design

Screenshot
Pop 178.03
Vit 22.55

Ascii Design is an ASCII art program based on the FIGlet engine. You can create text-based art for many types of decorations for Web sites, email, text files, etc.

Download No website Updated 28 May 2014 4Suite

Screenshot
Pop 282.67
Vit 3.66

4Suite is a Python-based toolkit for XML and RDF application development. It features a library of integrated tools for XML processing, implementing open technologies such as DOM, RDF, XSLT, XInclude, XPointer, XLink, XPath, XUpdate, RELAX NG, and XML/SGML Catalogs. Layered upon this is an XML and RDF data repository and server, which supports multiple methods of data access, query, indexing, transformation, rich linking, and rule processing, and provides the data infrastructure of a full database system, including transactions, concurrency, access control, and management tools. It also supports HTTP, RPC, SOAP, and FTP, plus APIs in Python and XSLT.

Screenshot

Project Spotlight

Cerridwen

Accurate solar system data for everyone.

Screenshot

Project Spotlight

Toxic

A general purpose template engine.