RSS 2132 projects tagged "Text Processing"

No download Website Updated 08 Apr 2014 TEA

Screenshot
Pop 570.26
Vit 196.22

TEA is a powerful and easy-to-use Qt4-based editor with many useful features for HTML, Docbook, and LaTeX editing. It features a small footprint, a tabbed layout engine, support for multiple encodings, code snippets, templates, customizable hotkeys, an "open at cursor" function for HTML files and images, miscellaneous HTML tools, preview in external browser, string manipulation functions, Morse-code tools, bookmarks, syntax highlighting, and more.

Download Website Updated 07 Apr 2014 Docx to Text Converter (docx2txt)

Screenshot
Pop 206.80
Vit 31.85

docx2txt is a tool that attempts to generate equivalent text files from Microsoft .docx documents, preserving some formatting and document information (which MS text conversion drops) along with appropriate character conversions for a good (ASCII) text experience. It is a platform independent solution consisting of (core) Perl and (wrapper) Unix/Windows shell scripts and a configuration file to control the output text appearance to fair extent. It can very conveniently be used to build a Web based docx document conversion service. Some Makefiles and Windows batch files are provided for easy installation of the scripts. With unzippers like CakeCmd that can deal with corrupt Zip archives, this tool can extract text from corrupt docx documents in many cases, where MS word processor fails to even open them.

Download Website Updated 07 Apr 2014 Verbiste

Screenshot
Pop 349.66
Vit 98.84

Verbiste is a French conjugation system implemented as a C++ library, a GNOME applet, and two command-line tools. It can conjugate verbs and analyze conjugated verbs to determine their mode, tense, and person. The knowledge base contains over 6700 verbs.

Download Website Updated 06 Apr 2014 Vrapper

Screenshot
Pop 306.02
Vit 52.88

Vrapper is an Eclipse plugin which acts as a wrapper for Eclipse text editors to provide a Vim-like input scheme for moving around and editing text. Unlike other plugins which embed Vim in Eclipse, Vrapper imitates the behavior of Vim while still using whatever editor you have opened in the workbench. The goal is to have the comfort and ease which comes with the different modes, complex commands, and count/operator/motion combinations which are the key features behind editing with Vim, while preserving the powerful features of the different Eclipse text editors, like code generation and refactoring.

Download Website Updated 05 Apr 2014 TXR

Screenshot
Pop 705.07
Vit 81.82

TXR is a new data munging language. TXR's special pattern language provides template-based matching of entire documents or large sections of documents. It also contains a language for functional and imperative programming. It is written in C and takes the form of a utility that is portable to Unix-like platforms and Windows.

Download Website Updated 04 Apr 2014 Terrier

Screenshot
Pop 214.72
Vit 39.23

Terrier is software for the rapid development of Web, intranet, and desktop search engines. More generally, it is a modular platform for building large-scale information retrieval applications, providing indexing and probabilistic retrieval functionalities. It comes with a desktop search application.

Download Website Updated 04 Apr 2014 PCRE

Screenshot
Pop 876.90
Vit 116.63

The PCRE library is a set of functions that implement regular expression pattern matching using the same syntax and semantics as Perl 5, with just a few differences. PCRE is used by many programs, including Exim, Postfix, and PHP.

Download Website Updated 02 Apr 2014 poppler

Screenshot
Pop 691.45
Vit 102.03

Poppler is a PDF rendering library derived from xpdf. It has been enhanced to utilize modern libraries, and new features have been added. It also provides basic command line utilities.

No download No website Updated 02 Apr 2014 Template Data Interface (TDI)

Screenshot
Pop 90.84
Vit 9.75

Template Data Interface (TDI, /ʹtedɪ/) is a markup templating system written in Python with (optional but recommended) speedup code written in C. Unlike most templating systems, TDI does not invent its own language to provide functionality. Instead, you simply mark the nodes you want to manipulate within the template document. The template is parsed, and the marked nodes are presented to your Python code, where they can be modified in any way you want.

Download Website Updated 02 Apr 2014 Barcode Writer in Pure Postscript

Screenshot
Pop 1,053.86
Vit 28.07

Barcode Writer in Pure Postscript implements the printing of many barcode formats entirely within PostScript, so that the process of converting the input string into the printed output is performed by the printer or print system. The project supports all major barcode formats including: EAN-13 (JAN-13), EAN-8 (JAN-8), UPC-A, UPC-E, EAN-5 & EAN-2 (EAN/UPC add-ons), ISBN (including legacy ISBN), ISMN (including legacy ISMN), ISSN, Code 128 (A, B & C), GS1-128, SSCC-18 (EAN-18, NVE), EAN-14, Code 39, Code 39 Extended, Code 93, Code 93 Extended, Code 32 (Italian Pharmacode), Pharmazentralnummer (PZN), Interleaved 2 of 5, ITF-14 (UPC SCS), GS1 DataBar (Omnidirectional, Stacked, Stacked Omnidirectional, Limited, Expanded, Expanded Stacked), Code 2 of 5 (Industrial, IATA, Matrix, Datalogic & COOP), Code 11 (USD-8), BC412, Codabar (NW-7), Pharmacode (including two-track), MSI, Plessey, Telepen, Channel Code, PosiCode, PDF417, Data Matrix (ECC200), QR Code (including Micro QR Code), and more.

Screenshot

Project Spotlight

OpenStego

A tool (with a GUI) for steganography.

Screenshot

Project Spotlight

LinOTP

A flexible one time password authentication system with pluggable OTP algorithms and userstorages.