CUPS is a standards-based printing system for Mac OS X and other Unix-like operating systems. It provides the System V and Berkeley command line interfaces, and uses the Internet Printing Protocol ("IPP") as the basis for managing print jobs and queues. The Line Printer Daemon (LPD) Server Message Block (SMB), and AppSocket (a.k.a. JetDirect) protocols are also supported with reduced functionality. CUPS adds network printer browsing and PostScript Printer Description ("PPD") based printing options to support real world printing.
PdfParser is a standalone PHP library that provides various tools for extracting data from PDF files. It loads and parses objects and headers, extracts meta data, and extracts text from ordered pages. It supports compressed PDF, MAC OS Roman charset encoding, hex and octal encoding in text sections, and is compliant with PSR-0 (autoloader) and PSR-1 (code styling). Currently, secured documents are not supported.
PDFtk Server is a simple commandline tool for doing everyday things with PDF documents. You can use it to merge PDF documents or collate PDF page scans, split PDF pages into a new document, rotate PDF documents or pages, decrypt input as necessary (password required), encrypt output as desired, fill PDF forms with X/FDF data and/or flatten forms, generate FDF data stencils from PDF forms, apply a background watermark or a foreground stamp, report PDF metrics, bookmarks, and metadata, add/update PDF bookmarks or metadata, attach files to PDF pages or the PDF document, unpack PDF attachments, burst a PDF document into single pages, uncompress and re-compress page streams, and repair corrupted PDF files (where possible).
Aspose.Pdf is a .NET PDF component to write PDF documents without using Adobe Acrobat. It supports form field creation, document, text and page properties, color space, text, heading, and attachment settings. It lets you create PDF documents by using its API with XML templates and XSL-FO files. It also converts HTML, XSL-FO, and MS Word to PDF. Other features include image formats and security features, hyperlinks, the ability to add footnotes, automatic fitting to content in a table, decimal Tab stops, HTML tags, and keeping paragraphs together when breaking pages.
mdocml is a suite of tools for compiling "-mdoc", the roff macro package of choice for BSD manual pages, and "-man", the predominant historical package for UNIX manuals. The mission of mdocml is to deprecate groff, the GNU roff implementation, for displaying -mdoc pages whilst providing token support for -man.
getxbook is a collection of tools to download books from websites. There are tools to download from Google Books' "book preview", Amazon's "look inside the book", and Barnes and Noble's "book viewer". There is an optional GUI written in Tcl/Tk, and some shell scripts using OCR to create plain text or searchable PDFs and DjVu files from the downloaded books.