2138 projects tagged "Text Processing"
siconv (StreamICONV) filters its stdin to stdout while converting from one character set to another. It does this via the iconv(3) function provided by glibc (but not well documented under Linux). All you need to do is cat your data through it and redirect stdout to your desired file or program. The iconv command line function that comes with glibc appears to read everything into a single buffer before converting. If you need to translate 1GB of data, that obviously wouldn't work; thus, this program.
The Perl Embedding Engine is a Perl-based template engine that allows embedded Perl code in HTML templates similar to PHP, JSP, etc. It is simple, flexible, and easy-to-use, and supports 2 modes of operation: offline generation and dynamic interpretation. Unlike many template engines, it only has 3 special tags so parsing is a breeze and it generates plain Perl code.
The Guava tools are a set of Perl scripts for HTML pre-processing. You can create multi-page documents with contents tables, or use templates to give a consistent look to a set of pages. All output is passed through the C preprocessor, so you can use directives such as #include, #define and #if. There are also built-in macros for producing dates, cross references, etc.
Xprintutil is a package of 4 printing utilities: xdvips (a frontend to dvips), xenscript (a frontend to enscript), xa2tex (a frontend to printing text files through TeX), and printer (a printer manager). All of these parse the /etc/printcap file and let the user interactively select the printer as well as most of the command line options of dvips and enscript.