RSS 222 projects tagged "Text Processing"

Download Website Updated 09 Apr 2014 Highlight

Screenshot
Pop 1,018.42
Vit 250.37

Highlight is a universal converter from source code to HTML, XHTML, RTF, TeX, LaTeX, SVG, BBCode, and terminal escape sequences. (X)HTML and SVG output are formatted by Cascading Style Sheets. It supports more than 170 programming languages, and includes 80 highlighting color themes. The configuration files are Lua scripts with plug-in support. The converter includes some features to provide a consistent layout of the output code.

No download Website Updated 08 Apr 2014 TEA

Screenshot
Pop 567.36
Vit 251.27

TEA is a powerful and easy-to-use Qt4-based editor with many useful features for HTML, Docbook, and LaTeX editing. It features a small footprint, a tabbed layout engine, support for multiple encodings, code snippets, templates, customizable hotkeys, an "open at cursor" function for HTML files and images, miscellaneous HTML tools, preview in external browser, string manipulation functions, Morse-code tools, bookmarks, syntax highlighting, and more.

Download Website Updated 07 Apr 2014 Verbiste

Screenshot
Pop 341.39
Vit 122.54

Verbiste is a French conjugation system implemented as a C++ library, a GNOME applet, and two command-line tools. It can conjugate verbs and analyze conjugated verbs to determine their mode, tense, and person. The knowledge base contains over 6700 verbs.

Download Website Updated 02 Apr 2014 poppler

Screenshot
Pop 686.33
Vit 120.04

Poppler is a PDF rendering library derived from xpdf. It has been enhanced to utilize modern libraries, and new features have been added. It also provides basic command line utilities.

Download Website Updated 24 Mar 2014 SeaMonkey

Screenshot
Pop 2,360.35
Vit 132.42

The SeaMonkey project is a community effort to develop an all-in-one Internet application suite. It contains an Internet browser, email and newsgroup client with an included Web feed reader, HTML editor, IRC chat, and Web development tools, and is sure to appeal to advanced users, Web developers, and corporate users. It uses much of the Mozilla source code powering such successful siblings as Firefox, Thunderbird, Camino, Sunbird, and Miro.

Download Website Updated 24 Jan 2014 LyX

Screenshot
Pop 1,226.91
Vit 57.78

LyX is a document processor that encourages an approach to writing based on the structure of your documents, not their appearance. It is intended for people people who write and want their writing to look great without tinkering with formatting details, font attributes, or page boundaries. On screen, it looks like any word processor, but it uses the TeX engine for printed output and producing richly cross-referenced PDFs. It is stable and fully featured.

No download Website Updated 07 Jan 2014 SILVERCODERS DocToText

Screenshot
Pop 189.80
Vit 15.83

SILVERCODERS DocToText is a powerful utility which can convert documents in many formats to plain text. It includes a console application and C/C++ library, which allows embedding text extraction mechanisms into other applications. It supports MS Office binary formats (MS Word (DOC), MS Excel (XLS, XLSB), MS PowerPoint (PPT), and Rich Text Format (RTF)), OpenDocument formats (text documents (ODT), spreadsheets (ODS), presentations (ODP) and graphics (ODG)), Office Open XML formats (MS Word (DOCX), MS Excel (XLSX), and MS PowerPoint (PPTX)), iWork formats (PAGES, NUMBERS, KEYNOTE), OpenDocument Flat XML formats (FODP, FODS, FODT), Portable Document Format (PDF), Email files (EML), and HyperText Markup Language (HTML). DocToText can extract text not only from the document body but also from annotations (comments) embedded in odt, doc, docx, or rtf files and read metadata like author, last modification date, or number of pages. It can be used as a fast console viewer, and is able to convert corrupted OpenDocument and Office Open XML documents. It can be used to recover text even if other recovery methods failed.

Download Website Updated 06 Jan 2014 HTMLDOC

Screenshot
Pop 751.91
Vit 40.05

HTMLDOC converts HTML files and Web pages into indexed HTML, PostScript, and PDF files suitable for online viewing and printing. It can be used as a standalone GUI application, in a batch document processing environment, as a Web-based report generation application, or in embedded environments to support printing of HTML content. It runs on all Unix platforms as well as Mac OS X and Windows 2000 and higher.

Download Website Updated 21 Dec 2013 GNU TeXmacs

Screenshot
Pop 836.91
Vit 90.54

GNU TeXmacs is a free wysiwyw (what you see is what you want) editing platform with special features for scientists. The software aims to provide a unified and user friendly framework for editing structured documents with different types of content: text, mathematics, graphics, interactive content. TeXmacs can also be used as an interface to many external systems for computer algebra, numerical analysis, and statistics. New presentation styles can be written by the user and new features can be added to the editor using Scheme.

No download Website Updated 13 Dec 2013 Time Based Text

Screenshot
Pop 47.18
Vit 5.06

Time Based Text allows the user to include more information in written text by saving the time delta between keystrokes and offering a way to reproduce it exactly how it has been written. It offers a protocol and reference implementation that is easily embeddable in applications using text-based human communication. The idea behind it is that email systems as well poetry and literature may benefit from a time-based approach to text. It comes with a portable C++ reference implementation to generate TBT messages and save them in HTML and DokuWiki (JSON), a Website to upload and exchange TBT poetry, plus various advanced TBT implementations in Javascript, Python, and Perl.

Screenshot

Project Spotlight

BaseX

An XML database and XQuery processor.

Screenshot

Project Spotlight

Unflattener

A tool that makes normal maps for 2D art.