RSS 1159 projects tagged "Text Processing"

Download Website Updated 10 Apr 2014 TCPDF

Screenshot
Pop 1,555.91
Vit 620.09

TCPDF is a PHP class for generating PDF documents without requiring external extensions. TCPDF supports all ISO page formats and custom page formats, custom margins and units of measure, UTF-8 Unicode, RTL languages, HTML, barcodes, TrueTypeUnicode, TrueType, OpenType, Type1, and CID-0 fonts, images, graphic functions, clipping, bookmarks, JavaScript, forms, page compression, digital signatures, and encryption.

Download Website Updated 09 Apr 2014 Highlight

Screenshot
Pop 1,155.10
Vit 287.87

Highlight is a universal converter from source code to HTML, XHTML, RTF, TeX, LaTeX, SVG, BBCode, and terminal escape sequences. (X)HTML and SVG output are formatted by Cascading Style Sheets. It supports more than 170 programming languages, and includes 80 highlighting color themes. The configuration files are Lua scripts with plug-in support. The converter includes some features to provide a consistent layout of the output code.

Download Website Updated 04 Apr 2014 PCRE

Screenshot
Pop 851.76
Vit 151.80

The PCRE library is a set of functions that implement regular expression pattern matching using the same syntax and semantics as Perl 5, with just a few differences. PCRE is used by many programs, including Exim, Postfix, and PHP.

Download Website Updated 05 Apr 2014 TXR

Screenshot
Pop 745.93
Vit 108.54

TXR is a new data munging language. TXR's special pattern language provides template-based matching of entire documents or large sections of documents. It also contains a language for functional and imperative programming. It is written in C and takes the form of a utility that is portable to Unix-like platforms and Windows.

Download Website Updated 28 Feb 2014 DocBook Doclet

Screenshot
Pop 755.86
Vit 104.10

DocBook Doclet is a javadoc doclet that creates DocBook XML and UML class diagrams from Javadoc.

No download Website Updated 06 Mar 2014 Java Serialization to XML

Screenshot
Pop 598.26
Vit 94.41

JSX serializes Java objects to XML. You can persist objects, evolve them, and send them over the network and between applications. Your object data becomes human-readable and human-writable. You can test it, search it, profile it, audit it, and edit it with ordinary text and XML tools. JSX handles all POJOs and also all classes that require Java's own object serialization.

No download Website Updated 20 May 1998 Isearch

Screenshot
Pop 16.61
Vit 76.22

Isearch is software for indexing and searching text documents. It supports full text and field based search, relevance ranked results, Boolean queries, and heterogeneous databases. It can parse many kinds of documents "out of the box," including HTML, mail folders, list digests, SGML-style tagged data, and USMARC. It can be extended to support other formats by creating descendant classes in C++ that define the document structure. It is pretty easy to customize in this way, provided that you know some C++ (and you will need to ftp the source code). A CGI interface is also included for Web based searching.

No download Website Updated 17 Feb 2014 iText

Screenshot
Pop 758.76
Vit 72.37

iText is a library that contains classes to generate and manipulate documents in the Portable Document Format (PDF). Document manipulation includes splitting, merging, and filling out forms (AcroForms, static and dynamic XFA forms).

Download Website Updated 23 Dec 1999 texi2html

Screenshot
Pop 23.98
Vit 72.30

Texi2html is a Perl script that converts GNU info files into HTML.

Download Website Updated 28 Dec 1999 Drunkifier

Screenshot
Pop 16.31
Vit 72.27

Drunkifier takes input from the keyboard, a pipe or a form on a webpage and translates the text into drunken text.

Screenshot

Project Spotlight

execline

A small, non-interactive, shell-like scripting language.

Screenshot

Project Spotlight

fix8

A modern C++ FIX framework featuring complete schema customisation, high performance, and fast development.