RSS 50 projects tagged "Text Processing"

Download Website Updated 10 Jan 2014 pyPEG

Screenshot
Pop 202.55
Vit 27.13

pyPEG is a quick and easy solution for creating a parser in Python programs. pyPEG uses a PEG language in Python data structures to parse, so it can be used dynamically to parse nearly every context free language. The output is a plain Python data structure called pyAST, or, as an alternative, XML.

No download Website Updated 07 Jan 2014 SILVERCODERS DocToText

Screenshot
Pop 194.73
Vit 15.98

SILVERCODERS DocToText is a powerful utility which can convert documents in many formats to plain text. It includes a console application and C/C++ library, which allows embedding text extraction mechanisms into other applications. It supports MS Office binary formats (MS Word (DOC), MS Excel (XLS, XLSB), MS PowerPoint (PPT), and Rich Text Format (RTF)), OpenDocument formats (text documents (ODT), spreadsheets (ODS), presentations (ODP) and graphics (ODG)), Office Open XML formats (MS Word (DOCX), MS Excel (XLSX), and MS PowerPoint (PPTX)), iWork formats (PAGES, NUMBERS, KEYNOTE), OpenDocument Flat XML formats (FODP, FODS, FODT), Portable Document Format (PDF), Email files (EML), and HyperText Markup Language (HTML). DocToText can extract text not only from the document body but also from annotations (comments) embedded in odt, doc, docx, or rtf files and read metadata like author, last modification date, or number of pages. It can be used as a fast console viewer, and is able to convert corrupted OpenDocument and Office Open XML documents. It can be used to recover text even if other recovery methods failed.

Download Website Updated 12 Dec 2013 RedNotebook

Screenshot
Pop 458.58
Vit 32.74

RedNotebook is a graphical diary and journal to keep track of notes and thoughts throughout the day. It includes a calendar navigation, customizable templates for each day, export functionality, and a keyword search and cloud.

Download No website Updated 30 Nov 2013 Simple DocBook Processor

Screenshot
Pop 81.91
Vit 14.70

SDoP (Simple DocBook Processor) reads a DocBook XML file, processes it into typeset pages, and outputs the result as PostScript (which can easily be converted to a PDF). It is "simple" because it supports only a subset of DocBook, and also because it does not make use of a DTD or stylesheets or any other heavyweight apparatus. It is a single program. SDoP is used to format the Exim reference manual.

No download Website Updated 29 Oct 2013 Wicked

Screenshot
Pop 92.90
Vit 13.92

Wicked is a Wiki for the Horde framework. It uses PEAR's Text_Wiki package for markup rules, parsing, and rendering.

Download Website Updated 22 Aug 2013 DokuWiki

Screenshot
Pop 6,834.30
Vit 28.05

DokuWiki is a standards-compliant, simple-to-use Wiki mainly aimed at creating documentation of any kind. It is targeted at developer teams, workgroups, and small companies. It has a simple but powerful syntax which makes sure the datafiles remain readable outside the Wiki, and eases the creation of structured texts. All data is stored in plain text files, so no database is needed

Download No website Updated 24 Jul 2013 Ascii Design

Screenshot
Pop 93.19
Vit 5.27

Ascii Design is an ASCII art program based on the FIGlet engine. You can create text-based art for many types of decorations for Web sites, email, text files, etc.

Download Website Updated 30 May 2013 John the Ripper

Screenshot
Pop 1,489.52
Vit 27.11

John the Ripper is a fast password cracker, currently available for many flavors of Unix, Windows, DOS, BeOS, and OpenVMS. Its primary purpose is to detect weak Unix passwords. It supports several crypt(3) password hash types commonly found on Unix systems, as well as Windows LM hashes. On top of this, lots of other hashes and ciphers are added in the community-enhanced version (-jumbo), and some are added in John the Ripper Pro.

Download Website Updated 29 Apr 2013 ocrodjvu

Screenshot
Pop 110.50
Vit 14.06

ocrodjvu is a wrapper for OCR systems that allows you to perform OCR on DjVu files.

Download Website Updated 10 Apr 2013 YML

Screenshot
Pop 152.80
Vit 19.41

YML (Why a Markup Language?!) is an easy language to compile into XML. YSLT is an easy language for code generation, automating your software development tasks.

Screenshot

Project Spotlight

coreBOS

A business empowering tool and adaptable software program.

Screenshot

Project Spotlight

libre

A generic library for real-time communications with asynchronous I/O support.