Solr-Connector-Files crawls and indexes directories and files from your filesystem (whatever is mountable to Linux) into Apache Solr. It features extraction of file contents with Tika, which extracts metadata and text form many document and file formats. It also integrates automatic text recognition (OCR) for images, photos, and PDFs using Tesseract OCR.
Apitron PDF Rasterizer is a .NET component that performs high-quality conversion from PDF files to images. It supports complex PDF content including text (with embedded, externally linked, standard, simple, and composite fonts), images, including masked ones, complex paths and fills, PDF Forms, annotation objects of various types, all blending modes, tiling patterns, shading patterns (function-based, axial, radial), transparency groups, masked content (stencil masks, colorkey masks, soft masks), all colorspaces specified by the PDF standard, Adobe Illustrator created files, PDF bookmarks and page navigation support, and text search and highlighting (including non-Latin alphabets).
WallCalendar is a set of LaTeX wall calendar templates and utilities for managing them. Users can customize templates or create their own. It includes a template that produces Interior and Covers, which are ready to be uploaded to lulu.com for printing on demand. It also contains a calendar Web site example (generated using the txt2site utility) that integrates with a paper calendar via QR codes.
Aspose.Pdf for Android is a PDF document creation and manipulation component that makes it possible for Android applications to read, write, and manipulate PDF document without using any other third party applications. It supports PDF compression options, table creation and manipulation, graph objects, extended security controls, custom font handling, bookmarks, table of contents, attachments and annotations, PDF form data, printing, and much more.
PdfParser is a standalone PHP library that provides various tools for extracting data from PDF files. It loads and parses objects and headers, extracts meta data, and extracts text from ordered pages. It supports compressed PDF, MAC OS Roman charset encoding, hex and octal encoding in text sections, and is compliant with PSR-0 (autoloader) and PSR-1 (code styling). Currently, secured documents are not supported.
pdf_gantt is a TCPDF wrapper class for rendering Gantt charts as a part of a created PDF document. It has features for adjusting the printing area, font colors for different text blocks, background and referencing arrows colors, and other parameters. It will automatically adjust start dates for dependant tasks to start right after "parent" tasks. List of people responsible for each task can be printed. The class can be used standalone or as a PrintFormPDF plugin.
printformPdf is a wrapper for the TCPDF class that allows you to documents from a PDF template by populating it with user data. It has features for printing "data grids" and repeating data on the page (which is useful for business cards, for example) and drawing bar codes, QR codes, polygons, and images from source picture files etc. Data priniting parameters (position, font name/size, color, rotation) can be saved in an XML configuration file.