HTTrack is an easy-to-use offline browser utility. It allows you to download a Web site from the Internet to a local directory, building recursively all directories, getting HTML, images, and other files from the server to your computer. HTTrack arranges the original site's relative link-structure. Simply open a page of the mirrored Web site in your browser, and you can browse the site from link to link, as if you were viewing it online. HTTrack can also update an existing mirrored site, and resume interrupted downloads. WebHTTrack is a Web-based GUI for HTTrack.
makeself is a small shell script that generates a self-extractable compressed TAR archive from a directory. The resulting file appears as a shell script, and can be launched as is. The archive will then uncompress itself to a temporary directory and an arbitrary command will be executed (for example, an installation script). This is pretty similar to archives generated with WinZip Self-Extractor in the Windows world.
mkat is a set of command line tools for burning and cataloging data CD/DVD/BD and audio CDs. The catalog can later be searched for certain files or an audio CD. Tags can be used to categorize CD/DVD/BDs, e.g. comedy, drama, kids, sports. mkat is configured to use wodim/growisofs and genisoimage to do the real work, but it should be possible to configure it to use arbitrary programs.
SILVERCODERS DocToText is a powerful utility which can convert documents in many formats to plain text. It includes a console application and C/C++ library, which allows embedding text extraction mechanisms into other applications. It supports MS Office binary formats (MS Word (DOC), MS Excel (XLS), MS PowerPoint (PPT), and Rich Text Format (RTF)), OpenDocument formats (text documents (ODT), spreadsheets (ODS), and presentations (ODP)), Office Open XML formats (MS Word (DOCX), MS Excel (XLSX), and MS PowerPoint (PPTX)), and HyperText Markup Language (HTML). DocToText can extract text not only from the document body but also from annotations (comments) embedded in odt, doc, docx, or rtf files and read metadata like author, last modification date, or number of pages. It can be used as a fast console viewer, and is able to convert corrupted OpenDocument and Office Open XML documents. It can be used to recover text even if other recovery methods failed.
deco is a command-line program that, together with the deco-archive package, allows you to extract dozens of popular archive file formats. It has a consistent interface ("deco 1.tar.bz2 2.rar 3.zip 4.flac 5.deb") and consistent behavior (never deleting archives after extraction and extracting relative to the current working directory, just verbosely enough, all unless explicitly requested otherwise). It creates an extraction directory if there is more than one file or directory at the archive top level, and it is able to fix strange permissions.
Many project are mirrored worldwide. Mirmon helps in monitoring these mirrors. In a concise graphic format, mirmon shows each site's history of the last two weeks, making it easy to spot stale or dead mirrors. Mirmon quietly probes a subset of the sites in a given list, writes the results in the "state" file, and generates a Web page with the results. The package includes a Perl module called Mirmon.pm; this makes it easy to write applications related to mirmon.