RSS 715 projects tagged "Text Processing"

Download Website Updated 09 Apr 2014 Highlight

Screenshot
Pop 1,038.59
Vit 267.33

Highlight is a universal converter from source code to HTML, XHTML, RTF, TeX, LaTeX, SVG, BBCode, and terminal escape sequences. (X)HTML and SVG output are formatted by Cascading Style Sheets. It supports more than 170 programming languages, and includes 80 highlighting color themes. The configuration files are Lua scripts with plug-in support. The converter includes some features to provide a consistent layout of the output code.

No download Website Updated 08 Apr 2014 TEA

Screenshot
Pop 565.61
Vit 266.04

TEA is a powerful and easy-to-use Qt4-based editor with many useful features for HTML, Docbook, and LaTeX editing. It features a small footprint, a tabbed layout engine, support for multiple encodings, code snippets, templates, customizable hotkeys, an "open at cursor" function for HTML files and images, miscellaneous HTML tools, preview in external browser, string manipulation functions, Morse-code tools, bookmarks, syntax highlighting, and more.

Download Website Updated 07 Apr 2014 Verbiste

Screenshot
Pop 338.55
Vit 128.50

Verbiste is a French conjugation system implemented as a C++ library, a GNOME applet, and two command-line tools. It can conjugate verbs and analyze conjugated verbs to determine their mode, tense, and person. The knowledge base contains over 6700 verbs.

Download Website Updated 02 Apr 2014 poppler

Screenshot
Pop 684.08
Vit 124.15

Poppler is a PDF rendering library derived from xpdf. It has been enhanced to utilize modern libraries, and new features have been added. It also provides basic command line utilities.

Download Website Updated 01 Apr 2014 Docvert

Screenshot
Pop 311.62
Vit 67.22

Docvert is Web service software that takes multiple word processor files (typically .doc) and converts them to Oasis OpenDocument v1.0 format, and then optionally to any XML/HTML format. The results are returned in a .zip file.

Download Website Updated 01 Apr 2014 gjots

Screenshot
Pop 465.03
Vit 98.76

gjots lets you organize text notes in a convenient, hierarchical way. It can be used for notes, jottings, bits and pieces, recipes, and even PINs and passwords, using encryption. It can also be used to "mind-map" larger compositions like manuals, Web pages, articles, etc. It is a bit like the KDE program "kjots", but uses the GTK library and supports a hierarchy of folders. Files can be output to HTML with an automatic table of contents or to docbook XML. Encryption is supported with ccrypt(1), gpg(1), and openssl(1), so that musings can be kept private.

Download Website Updated 26 Mar 2014 Flat File Extractor

Screenshot
Pop 341.41
Vit 58.54

ffe is a flat file extractor. It can be used for reading different flat file structures and displaying them in different formats. ffe can read fixed length and separated text files and fixed length binary files. It is a command line tool developed under GNU/Linux. The main areas of use are extracting particular fields or records from a flat file, converting data from one format to an other, e.g. from CSV to fixed length, verifying a flat file structure, as a testing tool for flat file development, and displaying flat file content in human readable form.

Download Website Updated 24 Mar 2014 SeaMonkey

Screenshot
Pop 2,358.56
Vit 135.19

The SeaMonkey project is a community effort to develop an all-in-one Internet application suite. It contains an Internet browser, email and newsgroup client with an included Web feed reader, HTML editor, IRC chat, and Web development tools, and is sure to appeal to advanced users, Web developers, and corporate users. It uses much of the Mozilla source code powering such successful siblings as Firefox, Thunderbird, Camino, Sunbird, and Miro.

Download Website Updated 22 Mar 2014 GNU Parallel

Screenshot
Pop 896.63
Vit 66.63

GNU parallel is a shell tool for executing jobs in parallel locally or using remote computers. A job is typically a single command or a small script that has to be run for each of the lines in the input. The typical input is a list of files, a list of hosts, a list of users, a list of URLs, or a list of tables. If you use xargs today you will find GNU parallel very easy to use, as GNU parallel is written to have the same options as xargs. If you write loops in shell, you will find GNU parallel may be able to replace most of the loops and make them run faster by running several jobs in parallel. GNU parallel makes sure output from the commands is the same output as you would get had you run the commands sequentially. This makes it possible to use output from GNU parallel as input for other programs.

Download Website Updated 04 Mar 2014 Sanzang

Screenshot
Pop 313.22
Vit 10.56

Sanzang is a compact and simple cross-platform machine translation system. It is especially useful for translating from the CJK languages (Chinese, Japanese, and Korean), and it is very suitable for working with ancient and otherwise difficult texts. Unlike most other machine translation systems, Sanzang is small and approachable. Any user can develop his or her own translation rules, and these rules are simply stored in a text file and applied at runtime.

Screenshot

Project Spotlight

tlve

A tag-length-value parser.

Screenshot

Project Spotlight

XOWA

An offline application for Wikipedia (and other wikis).