402 projects tagged "Text Processing"
AutoGen is a tool designed for generating program files that contain repetitive text with varied substitutions. Its goal is to simplify the maintenance of programs that contain large amounts of repetitious text. This is especially valuable if there are several blocks of such text that must be kept synchronized. Output is specified with a Scheme-enhanced output template. Input, if required by your template, may come from AutoGen definitions, CGI data, or XML files.
Bluefish is a programmer's Web development editor written using GTK, designed to save the experienced webmaster some keystrokes. It features a multiple file editor, multiple toolbars, custom menus, image and thumbnail dialogs, open from the Web, CSS dialogs, PHP, HTML, Java, C, and XML support, external program integration (tidy, weblint, make, javac), and lots of wizards.
Boxes is a text filter that can draw any kind of box around its input text. Box design choices range from simple boxes to complex ASCII art. A box can also be removed and repaired, even if it has been badly damaged by editing of the text inside. Since the generated boxes may be open on any side, the program can also be used to create regional comments in any programming language. New box designs of all sorts can easily be added and shared by appending to a free format configuration file. In addition to being a command line tool, Boxes integrates well with any text editor that supports filters.
Catdoc is a MS Word file decoding tool that doesn't attempt to analyze file formatting (it just extracts readable text), but is able to handle all versions of Word and convert character encodings. A Tcl/Tk graphical viewer is also included. It can also read RTF files and convert Excel and PowerPoint files.
ccp can copy stdin or file to stdout, file to file, or files and directories into a directory. While copying, it applies one or several recodings. The program includes recoding tables for converting DOS text files into Unix text files, for filtering out control characters, and for conversions between Russian encodings (koi8-r, alt (cp866), win (cp1251), iso-8859-5, LaTeX T2A). You can easily write new recoding tables.
Ciao is a complete Prolog system subsuming ISO-Prolog with a novel modular design which allows both restricting and extending the language. Ciao extensions currently include feature terms (records), higher-order, functions, constraints, objects, persistent predicates, a good base for distributed execution (agents), and concurrency. Libraries also support WWW programming, sockets, and external interfaces (C, Java, TCL/Tk, relational databases, etc.). An Emacs-based environment, a stand-alone compiler, and a toplevel shell are also provided.
Cyrprint converts postscript files generated by netscape (original pages must be in koi8 encoding). It can be used as a pass-through filter. It adds cyrillic fonts to the beginning of the file and substitutes font names in the original PS document. The substitution and addition of fonts can be controlled with styles.
Data::Locations is a virtual file manager which allows you to write and read data (text and binary) to and from virtual files (think of bubbles). Moreover, this manager allows you to (recursively) define "magic" insertion points in these virtual files (bubbles inside other bubbles) which can be filled in (inflated) later (through a "straw", i.e., the object's reference), at any convenient time and in any order you like. Since this software acts purely in memory, there is no slowing down through costly file input/output (i.e., no temporary files).