reposurgeon enables risky operations that version-control systems don't want to let you do, such as editing past comments and metadata and removing commits. It works with any version control system that can export and import git fast-import streams, including git, hg, bzr, CVS, and RCS. It can also read Subversion dump files directly and can thus be used to script production of very high-quality conversions from Subversion to any supported DVCS.
ascii lists ASCII idiomatic names and octal/decimal code-point forms. It provides easy conversion between various byte representations and the American Standard Code for Information Interchange (ASCII) character table. It knows about a wide variety of hex, binary, octal, Teletype mnemonic, ISO/ECMA code point, slang names, XML entity names, and other representations. Given any one on the command line, it will try to display all others. Called with no arguments, it displays a small, handy ASCII chart.
doclifter helps with lifting documents with nroff markup to XML-DocBook. Lifting documents from presentation level to semantic level is hard, and a really good job requires human polishing. This tool aims to do everything that can be mechanized, and to preserve any troff-level information that might have structural implications in XML comments. TBL tables are translated into DocBook table markup, PIC into SVG, and EQN into MathML (relying on pic2svg and GNU eqn for the last two).
Running sccs2rcs in a directory with an SCCS subdirectory will convert all the SCCS files to corresponding RCS master files in a new RCS directory; the SCCS data itself is accessed read-only. It will complain and terminate if any of the SCCS files are checked out, or if the RCS directory already exists and is non-empty. If the directory contains snapshot auxiliary data produced by Emacs VC mode for tagging, that will be used too.
pg_repack is a PostgreSQL extension which lets you remove bloat from tables and indexes, and optionally restore the physical order of clustered indexes. Unlike CLUSTER and VACUUM FULL, it works online, without holding an exclusive lock on the processed tables during processing. pg_repack is efficient, with performance comparable to using CLUSTER directly.
Changing directories in bash can be tedious if you have long names or nested paths. Creating aliases or adding to the CDPATH can help, but can be improved on. Bashcd adds 6 new commands to make changing directories a bit easier. This commands use find, the locate database, the mdfind database, or other contextual information to make it easier to change to other directories.
HtmlCleaner is an HTML parser. HTML found on the Web is usually dirty, ill-formed, and unsuitable for further processing. For any serious consumption of such documents, it is necessary to first clean up the mess and bring order to the tags, attributes, and ordinary text. For a given HTML document, HtmlCleaner reorders individual elements and produces well-formed XML. By default, it follows rules similar to those which most Web browsers use to create a Document Object Model. However, the user may provide custom tag and rule sets for tag filtering and balancing.