RSS 2129 projects tagged "Text Processing"

Download No website Updated 17 Apr 2005 4Suite

Screenshot
Pop 281.93
Vit 3.67

4Suite is a Python-based toolkit for XML and RDF application development. It features a library of integrated tools for XML processing, implementing open technologies such as DOM, RDF, XSLT, XInclude, XPointer, XLink, XPath, XUpdate, RELAX NG, and XML/SGML Catalogs. Layered upon this is an XML and RDF data repository and server, which supports multiple methods of data access, query, indexing, transformation, rich linking, and rule processing, and provides the data infrastructure of a full database system, including transactions, concurrency, access control, and management tools. It also supports HTTP, RPC, SOAP, and FTP, plus APIs in Python and XSLT.

No download Website Updated 31 Jan 2000 A.L.I.C.E. and AIML

Screenshot
Pop 110.34
Vit 72.08

The ALICE software implements AIML (Artificial Intelligence Markup Language), a non-standard evolving markup language for creating chat robots. The primary design feature of AIML is minimalism. Compared with other chat robot languages, AIML is perhaps the simplest. The pattern matching language is very simple, for example permitting only one wild-card ('*') match character per pattern. AIML is an XML language, implying that it obeys certain grammatical meta-rules. The choice of XML syntax permits integration with other tools such as XML editors. Another motivation for XML is its familiar look and feel, especially to people with HTML experience.

Download Website Updated 09 Jun 2002 a2ps

Screenshot
Pop 236.70
Vit 2.59

a2ps is an Any to PostScript filter. Of course it processes plain text files, but also pretty prints quite a few popular languages (66). Moreover it has the ability to delegate the processing of some files to other filters (such as groff, texi2dvi, dvips, gzip etc.), which allows a uniform treatment (n-up, page selection etc.) of heterogeneous files.

Download Website Updated 11 Sep 2010 AFT

Screenshot
Pop 253.37
Vit 8.39

AFT (Almost Free Text) is a document preparation system. It is mostly free form, meaning that there is little intrusive markup; AFT source documents look a lot like plain old ASCII text. It has a few rules for structuring your document, more to do with formatting your text than embedding lots of commands, and it produces all types of output (HTML, XHTML, LaTeX, roll-your-own XML, etc.). All that needs to be done is to edit a rule file. You can even customize your own rule files for specialized output.

Download Website Updated 30 Jan 2001 aliases2cdbm

Screenshot
Pop 76.52
Vit 1.00

Aliases2cdbm is a utility for converting mail aliases from a text file (e.g., /etc/aliases) into input suitable for the cdbmake utility. Cdbmake can then create a constant database (CDB) suitable for reliable, high-speed mail alias lookups.

Download Website Updated 21 Apr 2014 align

Screenshot
Pop 138.51
Vit 6.83

align is a general-purpose text filter tool that helps vertically align columns in string-separated tables of input text. It also includes width, another general-purpose text filter tool that helps you work with the printing width or length of lines of input text.

Download Website Updated 28 Nov 2005 antiword

Screenshot
Pop 540.81
Vit 3.85

Antiword is a free MS-Word reader for Linux, RISC OS, and DOS. It converts the documents from Word 2, 6, 7, 97, 2000, 2002, and 2003 to text, Postscript, and XML/DocBook. Antiword tries to keep the layout of the document intact.

Download Website Updated 11 Feb 2013 ANTLR

Screenshot
Pop 297.19
Vit 5.52

ANTLR (ANother Tool for Language Recognition) is a language tool that provides a framework for constructing recognizers, compilers, and translators from grammatical descriptions containing C++, Java, or Sather actions. It is similar to the popular compiler generator YACC, however ANTLR is much more powerful and easy to use. ANTLR-produced parsers are not only highly efficient, but are both human-readable and human-debuggable (especially with the interactive ParseView debugging tool). ANTLR can generate parsers, lexers, and tree-parsers in either C++, Java, or Sather. ANTLR is currently written in Java.

Download Website Updated 30 Jan 2001 ASCII art printer

Screenshot
Pop 49.94
Vit 1.00

The ASCII art printer shows letters printed in underscores, slashes, backslashes, and pipe characters. It can be used with figlet.

Download Website Updated 30 Jan 2001 ascii2pdf

Screenshot
Pop 118.45
Vit 1.01

ascii2pdf is a simple text to PDF converter. It has options for font, font size, and portrait vs. landscape.

Screenshot

Project Spotlight

wacky-tracky

A modern task tracking application that follows open standards and supports tags, subtasks, and more.

Screenshot

Project Spotlight

Google Map GPS Cell Phone Tracker

Web server and phone client applications for periodically tracking android, iOS, Windows Phone, and Jave ME cellphones.