doclifter helps with lifting documents with nroff markup to XML-DocBook. Lifting documents from presentation level to semantic level is hard, and a really good job requires human polishing. This tool aims to do everything that can be mechanized, and to preserve any troff-level information that might have structural implications in XML comments. TBL tables are translated into DocBook table markup, PIC into SVG, and EQN into MathML (relying on pic2svg and GNU eqn for the last two).
reposurgeon is a tool for editing version control repository history. It enables risky operations that version control systems don't want to let you do, such as editing past comments and metadata and removing commits. It works with any version control system that can export and import git fast-import streams, including git, hg, bzr, CVS, and RCS. It can also read Subversion dump files directly and can thus be used to script production of very high-quality conversions from Subversion to any supported DVCS.
uterus is a codec library for financial tick data with an emphasis on market data integrity and maintainability. It comes with a set of tools to convert (mux) and print (demux) data from some sources, and to perform standard tasks like selecting instruments, creating snapshots and candles from tick data, etc. Special care is taken to provide longevity and consistence. All timestamps are internally converted to coordinated time, and price and quantity quotes are converted to a monetary datatype which doesn't suffer from rounding errors. Most importantly, meta data is stored along with the payload data in an inseparable unit, to provide self-contained and self-documenting files or network streams.
Plgen (Playlist Generator) is a tool that sorts files for MP3 players in a user defined order. Cheap MP3 players often come with no playlist capabilities. In addition these players often make use of the FAT16/32 file system, leading to confusion when the playing order is not alphabetical on the device even though it is shown as alphabetical in the file explorer.
HtmlCleaner is an HTML parser. HTML found on the Web is usually dirty, ill-formed, and unsuitable for further processing. For any serious consumption of such documents, it is necessary to first clean up the mess and bring order to the tags, attributes, and ordinary text. For a given HTML document, HtmlCleaner reorders individual elements and produces well-formed XML. By default, it follows rules similar to those which most Web browsers use to create a Document Object Model. However, the user may provide custom tag and rule sets for tag filtering and balancing.
Dateutils are a bunch of tools that revolve around fiddling with dates and times in the command line, with a strong focus on use cases that arise when dealing with large amounts of financial data. Their target market is shell scripts that need date calculations or calendar conversions, and as such they are highly pipe-able and modeled after their well-known cousins (e.g. dtest vs. test, or dgrep vs. grep).