RSS 10 projects tagged "Text Processing"

Download Website Updated 14 Aug 2008 TYM Project

Screenshot
Pop 26.31
Vit 1.70

TYM (Typo Manager) is software for managing fonts in formats like .OTF (OpenType), .TTF (TrueType), and .PFA/.PFB (Typo1). It allows you to add or "link" fonts, activate or deactivate them, and delete them. It also handles the "group font" function and stores several fonts inside one file.

Download Website Updated 10 Aug 2006 htmloptim

Screenshot
Pop 18.17
Vit 1.00

htmloptim reduces the size of an HTML file by removing unnecessary characters like spaces, tabs, line feeds, and blank lines.

Download No website Updated 07 Apr 2007 dvdmenuauthor

Screenshot
Pop 31.37
Vit 1.00

dvdmenuauthor makes it easy and efficient to author a DVD with menus in an indirect (non-WYSIWYG) way. An XML project file drives the DVD authoring, from which both menus and a dvdauthor XML file are generated. dvdauthor and spumux are then used to author the DVD filesystem. Menu items (buttons and static items such as images and text) can be specified conscisely in the project XML file with LaTeX markup (to be processed by pdfLaTeX and rendered by xpdf).

No download Website Updated 13 Jun 2007 WriteTarget

Screenshot
Pop 16.79
Vit 1.00

WriteTarget is a universal text generator based on Bash text substitution. It can be used to generate text in any programming or markup language. The generator does not define its own language; it rather defines several functions, making it possible to use Bash for creating simple or sophisticated templates.

No download Website Updated 10 Nov 2007 ZhuaShuShell

Screenshot
Pop 16.37
Vit 2.91

ZhuaShuShell is a set of scripts to crawl a collection of online e-books (in HTML format) from certain Chinese e-book sites and save the data to your local machine formatted as a single text book.

Download Website Updated 05 Oct 2011 Vee

Screenshot
Pop 27.20
Vit 1.74

Vee is a command-line blog tool that is very portable across Unix systems. It provides an interactive as well as a batch interface to maintain a log of entries. Formatting is done using a module architecture that allows a high degree of customization. There are minimal flags and no set up is required.

No download No website Updated 22 Jul 2010 uWiki

Screenshot
Pop 31.18
Vit 1.00

uWiki is a minimalistic wiki engine. All actions are implemented in external scripts. These scripts are wikified, and thus the wiki is extensible by itself. All dynamic access is protected through ACLs. Wiki content and Web content can be mixed in the same directory hierarchy. Markup engines and revision control are plugin-able. Currently, asciidoc as the markup engine and git as the revision control backend are provided. Subdirectories can form independent sub-wikis with own revision control. Features like distributed pages that syncronize between wikis, spam protection, and batch jobs to schedule mirroring of other content (bittorrent, git, rsync, and wget) are in planning.

Download Website Updated 07 Apr 2014 Docx to Text Converter (docx2txt)

Screenshot
Pop 194.54
Vit 39.93

docx2txt is a tool that attempts to generate equivalent text files from Microsoft .docx documents, preserving some formatting and document information (which MS text conversion drops) along with appropriate character conversions for a good (ASCII) text experience. It is a platform independent solution consisting of (core) Perl and (wrapper) Unix/Windows shell scripts and a configuration file to control the output text appearance to fair extent. It can very conveniently be used to build a Web based docx document conversion service. Some Makefiles and Windows batch files are provided for easy installation of the scripts. With unzippers like CakeCmd that can deal with corrupt Zip archives, this tool can extract text from corrupt docx documents in many cases, where MS word processor fails to even open them.

Download Website Updated 21 Aug 2008 smupcheck

Screenshot
Pop 13.04
Vit 1.00

smupcheck, which stands for Smart Update Checker, checks Web sites for updates automatically, even if they don't offer an RSS feed. It is a very basic tool, and does not offer advanced features such as checking password-protected Web sites, highlighting changes, or filtering results.

Download Website Updated 17 Oct 2008 CRUSH

Screenshot
Pop 38.88
Vit 1.00

CRUSH (Custom Reporting Utilities for SHell) is a collection of tools for processing delimited-text data from the command line or in shell scripts. It provides utilities for aggregating, merging, filtering, and formatting your data.

Screenshot

Project Spotlight

Paranoid TelnetD

A telnet server with chrooting, whitelisting, and other security features.

Screenshot

Project Spotlight

XOWA

An offline application for Wikipedia (and other wikis).