Projects / PDF OCR X

PDF OCR X

PDF OCR is a simple drag-and-drop utility that converts PDFs and images into text documents. It uses advanced OCR (optical character recognition) technology to extract the text of the PDF or image. This is particularly useful for dealing with PDFs and images that were created via a scan-to-PDF function in a scanner or photo copier. It uses the Tesseract engine to perform OCR, and currently supports over 20 languages for OCR.

Tags
Licenses
Operating Systems
Implementation
Translations

RSS Recent releases

  •  05 Dec 2011 22:22

    Release Notes: This release adds improved handling of PDFs which have a rotation built-in and fixes minor bugs related to OS X Lion compatibility.

    •  30 Jul 2011 04:04

      Release Notes: This release added options for overwriting original PDFs with the searchable result, specifying a custom extension for searchable PDF output, and disabling the auto-opening of the converted PDF files when the conversion is complete.

      •  12 Apr 2011 21:08

        Release Notes: Improvements to searchable PDF output so that it can handle text inside brackets .

        •  22 Dec 2010 22:24

          Release Notes: Improvements to the searchable PDF output option. A fix for a bug that caused special characters not to be output (e.g. é, áô, etc.).

          •  18 Dec 2010 07:41

            Release Notes: A problem with dragging PDFs onto the PDF OCR X panel for conversion that affected some installs of Mac OS X 10.6.5 was fixed.

            Screenshot

            Project Spotlight

            CheckPoint Debugger

            A CheckPoint troubleshooting tool.

            Screenshot

            Project Spotlight

            phlyMail

            A groupware, advanced webmail, and PIM client.