2131 projects tagged "Text Processing"

Download Website Updated 27 Jan 2014 GNU awk

Screenshot
Pop 516.82
Vit 19.62

The awk utility interprets a special-purpose programming language that makes it possible to handle simple data-reformatting jobs with just a few lines of code.

Download Website Updated 30 Jan 2001 GNU fontutils

Screenshot
Pop 61.87
Vit 1.41

The fontutils package includes the programs bpltobzr, bzrto, charspace, fontconvert, gsrenderfont, imageto, imgrotate, limn, and xbfe. These create fonts for use with Ghostscript or TeX (starting with a scanned type image and converting the bitmaps to outlines), convert between font formats, etc. The package also includes the libraries libbzr.a, libgf.a, libpbm.a, libpk.a, libtfm.a, and libwidgets.a.

Download Website Updated 12 Jan 2014 GNU m4

Screenshot
Pop 630.73
Vit 18.10

GNU m4 is an implementation of the traditional Unix macro processor. It is mostly SVR4 compatible, although it has some extensions (for example, handling more than 9 positional parameters to macros). GNU m4 also has built-in functions for including files, running shell commands, doing arithmetic, etc. Autoconf needs GNU m4 for generating `configure' scripts, but not for running them.

Download Website Updated 21 Dec 2013 GNU TeXmacs

Screenshot
Pop 778.27
Vit 73.94

GNU TeXmacs is a free wysiwyw (what you see is what you want) editing platform with special features for scientists. The software aims to provide a unified and user friendly framework for editing structured documents with different types of content: text, mathematics, graphics, interactive content. TeXmacs can also be used as an interface to many external systems for computer algebra, numerical analysis, and statistics. New presentation styles can be written by the user and new features can be added to the editor using Scheme.

No download Website Updated 19 Dec 2006 gocr

Screenshot
Pop 127.38
Vit 4.14

GOCR is optical character recognition software. It converts PNM files into ASCII files.

Download Website Updated 19 Sep 2004 GPP

Screenshot
Pop 226.12
Vit 4.07

GPP is a general-purpose preprocessor with customizable syntax, suitable for a wide range of preprocessing tasks. Its independence from any programming language makes it much more versatile than cpp, while its syntax is lighter and more flexible than that of m4. The syntax is fully customizable, which makes it possible to process text files, HTML, or source code equally efficiently in a variety of languages.

Download Website Updated 29 Jan 2010 Groff

Screenshot
Pop 242.75
Vit 5.14

The Groff package contains the traditional UN*X text formatting tools troff, nroff, tbl, eqn, and pic. These utilities, together with the man package, are essential for displaying the online manual pages. Output can be produced in a number of formats including plain ASCII and PostScript. All the standard macro packages are supported. A number of other utilities are also included together with several fonts.

Download Website Updated 05 Dec 2001 Grok

Screenshot
Pop 53.67
Vit 2.37

Grok is a library of Java components for performing various natural language tasks. These include several preprocessing tasks, chart parsing, a large categorial grammar for English (induced from the Penn treebank), and some knowledge representation components (basic coreference, salience tracking, etc.). The library also has a companion kit which provides a GUI interface to the components, several of which are implementations of interfaces in the Quipu OpenNLP API.

Download No website Updated 16 Mar 2004 gtkkanjipad

Screenshot
Pop 32.37
Vit 1.17

gtkkanjipad is a GTK widget for Japanese (kanji), and limited Chinese (hanzi), handwriting recognition. It is mostly based on Owen Taylor's KanjiPad, and includes Perl bindings.

Download Website Updated 08 Feb 2001 Guava

Screenshot
Pop 36.74
Vit 2.56

The Guava tools are a set of Perl scripts for HTML pre-processing. You can create multi-page documents with contents tables, or use templates to give a consistent look to a set of pages. All output is passed through the C preprocessor, so you can use directives such as #include, #define and #if. There are also built-in macros for producing dates, cross references, etc.

Screenshot

Project Spotlight

Cerridwen

Accurate solar system data for everyone.

Screenshot

Project Spotlight

Toxic

A general purpose template engine.