RSS 18 projects tagged "OCR"

Download Website Updated 24 Mar 2014 GNU Ocrad

Screenshot
Pop 658.80
Vit 78.14

GNU Ocrad is an OCR (Optical Character Recognition) program and library based on a feature extraction method. It reads images in pbm (bitmap), pgm (greyscale), or ppm (color) formats and produces text in byte (8-bit) or UTF-8 formats. It also includes a layout analyzer that is able to separate the columns or blocks of text normally found on printed pages. Ocrad can be used as a stand-alone console application, or as a backend to other programs.

No download Website Updated 19 Dec 2011 OCRFeeder

Screenshot
Pop 110.13
Vit 1.47

OCRFeeder is a document layout analysis and optical character recognition application. It is able to automatically outline a document image's contents, distinguish between graphics and text and perform OCR over the latter. It can export to several formats, its main one being ODT. OCRFeeder has a GTK+ graphical user interface that allows the user to control the application and, for example, edit and correct the automatic recognition. It can also be used from the command line for automation.

No download Website Updated 04 Jul 2009 FormReturn OMR

Screenshot
Pop 33.70
Vit 41.85

FormReturn is OMR (Optical Mark Recognition) software that has many features and is easy to use. It gives anyone the ability to design printable forms and distribute, capture, and automatically grade/analyze handwritten multiple choice response information instantly. All you need is the FormReturn Application, a printer, and a document scanner, and you can process hundreds of forms within minutes.

Download Website Updated 16 Nov 2013 PDF OCR X

Screenshot
Pop 271.74
Vit 24.68

PDF OCR is a simple drag-and-drop utility that converts PDFs and images into text documents. It uses advanced OCR (optical character recognition) technology to extract the text of the PDF or image. This is particularly useful for dealing with PDFs and images that were created via a scan-to-PDF function in a scanner or photo copier. It uses the Tesseract engine to perform OCR, and currently supports over 20 languages for OCR.

Download Website Updated 23 Jan 2010 FuzzyOcr

Screenshot
Pop 47.06
Vit 1.01

FuzzyOcr is a plugin for SpamAssassin that can be used on image spam. It supports optical character recognition using different engines and settings, a fuzzy word matching algorithm applied to OCR results, an image hashing system to learn the unique properties of known spam images, dimension, size, and integrity checking of images, and content-type verification for the containing email message.

Download Website Updated 21 Feb 2014 OCRKit

Screenshot
Pop 276.41
Vit 19.18

OCRKit uses OCR to recognize the text in a graphic, which is particular useful for PDFs received via email, created by DTP, office applications, or images obtained from a scanner, copier, or digital still camera.

Download Website Updated 03 Sep 2010 Paperless Office

Screenshot
Pop 86.74
Vit 1.00

Paperless Office is a document management and electronic filing system. It is similar to Paperport, but adds many new features, such as automatic document classification, synchronization with your filing cabinet, date extraction, semantic Web integration, and sophisticated natural language processing, such as extracting todo lists from documents, spam detection, urgency classification, along with planning, scheduling, and execution features. You can set due dates and interdependencies for documents and tasks, so it has workflow support.

Download Website Updated 18 Dec 2010 Eye

Screenshot
Pop 130.71
Vit 3.28

Eye is an experimental OCR application incorporating innovative recognition algorithms.

Download Website Updated 04 Nov 2012 tesseract-ocr

Screenshot
Pop 155.83
Vit 2.70

tesseract-ocr is an OCR engine originally developed by Hewlett Packard and now sponsored by Google. It is highly accurate and will read a binary, gray, or color image and output text.

No download No website Updated 04 Mar 2011 OCR2DATA

Screenshot
Pop 19.21
Vit 33.82

OCR2DATA is a full OCR stack for document digitization analysis and OCR. It provides external connection by way of an API, standard document exchange formats, and a database.

Screenshot

Project Spotlight

DB Solo

A database development and management tool for developers and administrators.

Screenshot

Project Spotlight

Mantle Business Artifacts

Business artifacts including data model and service library for common business processes.