RSS 2130 projects tagged "Text Processing"

Download Website Updated 11 Apr 2014 Vilistextum

Screenshot
Pop 76.99
Vit 4.94

Vilistextum is a small and fast HTML to text converter. It is quite fault-tolerant and deals well with badly-formed or otherwise quirky HTML. It has full support for different character sets (e.g. Unicode). It is able to optimize for ebook reading, collapse multiple blank lines, and create footnotes out of links. A GUI frontend using kaptain is included.

Download Website Updated 10 Apr 2014 TCPDF

Screenshot
Pop 1,555.91
Vit 620.09

TCPDF is a PHP class for generating PDF documents without requiring external extensions. TCPDF supports all ISO page formats and custom page formats, custom margins and units of measure, UTF-8 Unicode, RTL languages, HTML, barcodes, TrueTypeUnicode, TrueType, OpenType, Type1, and CID-0 fonts, images, graphic functions, clipping, bookmarks, JavaScript, forms, page compression, digital signatures, and encryption.

Download Website Updated 10 Apr 2014 OpenGrok

Screenshot
Pop 331.17
Vit 69.91

OpenGrok is a fast and usable source code search and cross reference engine. It helps you search, cross-reference, and navigate your source tree. It can understand various program file formats and version control histories like Mercurial, Bazaar, Git, ClearCase, Perforce, SCCS, RCS, CVS, or Subversion. In other words, it lets you grok (profoundly understand) the source.

Download Website Updated 09 Apr 2014 Highlight

Screenshot
Pop 1,155.10
Vit 287.87

Highlight is a universal converter from source code to HTML, XHTML, RTF, TeX, LaTeX, SVG, BBCode, and terminal escape sequences. (X)HTML and SVG output are formatted by Cascading Style Sheets. It supports more than 170 programming languages, and includes 80 highlighting color themes. The configuration files are Lua scripts with plug-in support. The converter includes some features to provide a consistent layout of the output code.

Download Website Updated 08 Apr 2014 AutoLaTeX

Screenshot
Pop 647.15
Vit 153.24

AutoLaTeX is a tool for managing small to large LaTeX documents. It detects which files which are used to build the document (included TeX files, BibTeX, figures, etc.), and launches the various different tools (latex, bibtex, makeindex) when the sources files have been changed. It provides translation rules which automatically generate figures in EPS, PNG, or PDF formats from different types of sources (dia, xfig, svg, astah, source code, etc.) AutoLaTeX also provides graphical user interfaces, a plugin for the editors Gedit and Sublime Text, and a standalone Gtk application.

No download Website Updated 08 Apr 2014 TEA

Screenshot
Pop 564.45
Vit 283.55

TEA is a powerful and easy-to-use Qt4-based editor with many useful features for HTML, Docbook, and LaTeX editing. It features a small footprint, a tabbed layout engine, support for multiple encodings, code snippets, templates, customizable hotkeys, an "open at cursor" function for HTML files and images, miscellaneous HTML tools, preview in external browser, string manipulation functions, Morse-code tools, bookmarks, syntax highlighting, and more.

Download Website Updated 07 Apr 2014 Docx to Text Converter (docx2txt)

Screenshot
Pop 187.95
Vit 44.43

docx2txt is a tool that attempts to generate equivalent text files from Microsoft .docx documents, preserving some formatting and document information (which MS text conversion drops) along with appropriate character conversions for a good (ASCII) text experience. It is a platform independent solution consisting of (core) Perl and (wrapper) Unix/Windows shell scripts and a configuration file to control the output text appearance to fair extent. It can very conveniently be used to build a Web based docx document conversion service. Some Makefiles and Windows batch files are provided for easy installation of the scripts. With unzippers like CakeCmd that can deal with corrupt Zip archives, this tool can extract text from corrupt docx documents in many cases, where MS word processor fails to even open them.

Download Website Updated 07 Apr 2014 Verbiste

Screenshot
Pop 336.12
Vit 135.35

Verbiste is a French conjugation system implemented as a C++ library, a GNOME applet, and two command-line tools. It can conjugate verbs and analyze conjugated verbs to determine their mode, tense, and person. The knowledge base contains over 6700 verbs.

Download Website Updated 06 Apr 2014 Vrapper

Screenshot
Pop 302.35
Vit 71.86

Vrapper is an Eclipse plugin which acts as a wrapper for Eclipse text editors to provide a Vim-like input scheme for moving around and editing text. Unlike other plugins which embed Vim in Eclipse, Vrapper imitates the behavior of Vim while still using whatever editor you have opened in the workbench. The goal is to have the comfort and ease which comes with the different modes, complex commands, and count/operator/motion combinations which are the key features behind editing with Vim, while preserving the powerful features of the different Eclipse text editors, like code generation and refactoring.

Download Website Updated 05 Apr 2014 TXR

Screenshot
Pop 745.93
Vit 108.54

TXR is a new data munging language. TXR's special pattern language provides template-based matching of entire documents or large sections of documents. It also contains a language for functional and imperative programming. It is written in C and takes the form of a utility that is portable to Unix-like platforms and Windows.

Screenshot

Project Spotlight

gslapt

A GTK version of slapt-get.

Screenshot

Project Spotlight

Catharsis.NET.Web.Widgets

An ASP.NET MVC tag library with social media widgets.