RSS 468 projects tagged "Text Processing"

No download Website Updated 05 Jun 2013 TEA

Screenshot
Pop 576.29
Vit 189.94

TEA is a powerful and easy-to-use Qt4-based editor with many useful features for HTML, Docbook, and LaTeX editing. It features a small footprint, a tabbed layout engine, support for multiple encodings, code snippets, templates, customizable hotkeys, an "open at cursor" function for HTML files and images, miscellaneous HTML tools, preview in external browser, string manipulation functions, Morse-code tools, bookmarks, syntax highlighting, and more.

Download Website Updated 04 Jun 2013 Highlight

Screenshot
Pop 832.25
Vit 173.80

Highlight is a universal converter from source code to HTML, XHTML, RTF, TeX, LaTeX, SVG, BBCode, and terminal escape sequences. (X)HTML and SVG output are formatted by Cascading Style Sheets. It supports more than 170 programming languages, and includes 80 highlighting color themes. The configuration files are Lua scripts with plug-in support. The converter includes some features to provide a consistent layout of the output code.

Download Website Updated 04 Jun 2013 Sanzang

Screenshot
Pop 107.75
Vit 3.20

Sanzang is a compact and simple cross-platform machine translation system. It is especially useful for translating from the CJK languages (Chinese, Japanese, and Korean), and it is very suitable for working with ancient and otherwise difficult texts. Unlike most other machine translation systems, Sanzang is small and approachable. Any user can develop his or her own translation rules, and these rules are simply stored in a text file and applied at runtime.

Download Website Updated 22 May 2013 GNU Parallel

Screenshot
Pop 591.20
Vit 51.89

GNU parallel is a shell tool for executing jobs in parallel locally or using remote computers. A job is typically a single command or a small script that has to be run for each of the lines in the input. The typical input is a list of files, a list of hosts, a list of users, a list of URLs, or a list of tables. If you use xargs today you will find GNU parallel very easy to use, as GNU parallel is written to have the same options as xargs. If you write loops in shell, you will find GNU parallel may be able to replace most of the loops and make them run faster by running several jobs in parallel. If you use ppss or pexec you will find GNU parallel will often make the command easier to read. GNU parallel makes sure output from the commands is the same output as you would get had you run the commands sequentially. This makes it possible to use output from GNU parallel as input for other programs.

Download Website Updated 20 May 2013 Verbiste

Screenshot
Pop 393.45
Vit 67.99

Verbiste is a French conjugation system implemented as a C++ library, a GNOME applet, and two command-line tools. It can conjugate verbs and analyze conjugated verbs to determine their mode, tense, and person. The knowledge base contains over 6700 verbs.

Download Website Updated 20 May 2013 Asymptote

Screenshot
Pop 934.28
Vit 128.66

Asymptote is a powerful descriptive 2D and 3D vector graphics language for technical drawing, inspired by MetaPost but with an improved C++-like syntax. It provides for figures the same high-quality level of typesetting that LaTeX does for scientific text. Asymptote is a programming language as opposed to just a graphics program. It can exploit the best features of script (command-driven) and graphical user interface (GUI) methods. High-level graphics commands are implemented in the language itself, allowing them to be easily tailored to specific applications.

Download Website Updated 17 May 2013 TXR

Screenshot
Pop 291.14
Vit 46.47

TXR is a new data munging language to replace the likes of awk and Perl. TXR's special pattern language provides template-based matching of entire documents or large sections of documents. It also contains a language for functional and imperative programming. It is written in C and takes the form of a utility that is portable to Unix-like platforms and Windows.

Download Website Updated 14 May 2013 Recoll

Screenshot
Pop 354.91
Vit 55.70

Recoll is a personal full text desktop search tool based on Xapian. It provides an easy to use, feature-rich, easy administration interface with a Qt-based GUI. Text, HTML, PDF, PostScript, MS Word, OpenOffice, Wordperfect, KWord, Abiword, maildir, and mailbox mail folder formats are supported, along with their compressed versions and quite a few others. Powerful query facilities are provided. Multiple character sets are supported, and internal processing and storage uses Unicode UTF-8. Stemming is performed at query time and the stemming language can be switched after indexing.

Download Website Updated 06 May 2013 The Epeios XML preprocessor

Screenshot
Pop 135.50
Vit 20.16

The 'expp' tool (the Epeios XML preprocessor) reads an XML file to transform it to another XML file. It simplifies the writing of XML files by allowing the handling of macros, the definition and testing of variables, the inclusion of files, and more. This is done by writing, directly in the source XML file, predefined tags owned by a given namespace, tags which are then recognized and handled by the 'expp' tool. The tool is also available as a Java native component.

Download No website Updated 02 May 2013 Ascii Design

Screenshot
Pop 195.78
Vit 9.68

Ascii Design is an ASCII art program based on the FIGlet engine. You can create text-based art for many types of decorations for Web sites, email, text files, etc.

Screenshot

Project Spotlight

ClamTk

A frontend for ClamAV.

Screenshot

Project Spotlight

GSL Shell

An interactive programming shell to the GSL library.