Projects / PDF OCR X

PDF OCR X

PDF OCR is a simple drag-and-drop utility that converts PDFs and images into text documents. It uses advanced OCR (optical character recognition) technology to extract the text of the PDF or image. This is particularly useful for dealing with PDFs and images that were created via a scan-to-PDF function in a scanner or photo copier. It uses the Tesseract engine to perform OCR, and currently supports over 20 languages for OCR.

Tags
Licenses
Operating Systems
Implementation
Translations

RSS Recent releases

  •  23 Jan 2013 02:36

    Release Notes: This release fixes an issue which caused files with non-Latin characters in their filenames to fail to convert.

    •  15 Aug 2012 21:28

      Release Notes: Improved compatibility with OS X 10.5.

      •  13 Aug 2012 21:29

        Release Notes: This release adds improved support for rotated PDFs. It also fixes a bug that caused crashing in OS X 10.5.

        •  13 Aug 2012 21:29

          Release Notes: This release fixes handling of some cropped PDFs.

          •  07 Aug 2012 20:51

            Release Notes: This release fixes a compatibility issue with Leopard (10.5.x) that was introduced in the 1.9.27 release.

            Screenshot

            Project Spotlight

            PHPlot

            Dynamic plots, charts, and graphs in PHP.

            Screenshot

            Project Spotlight

            Debian GNU/Linux

            The Universal Operating System.