Projects / GNU Ocrad

GNU Ocrad

GNU Ocrad is an OCR (Optical Character Recognition) program and library based on a feature extraction method. It reads images in pbm (bitmap), pgm (greyscale), or ppm (color) formats and produces text in byte (8-bit) or UTF-8 formats. It also includes a layout analyzer that is able to separate the columns or blocks of text normally found on printed pages. Ocrad can be used as a stand-alone console application, or as a backend to other programs.

Tags
Licenses
Operating Systems
Implementation

Recent releases

  •  06 Jun 2014 13:58

    Release Notes: Two new filters have been added: "upper_num" and "upper_num_only". The description of "OCRAD_result_blocks" in the manual has been fixed.

    •  24 Mar 2014 11:43

      Release Notes: Character recognition has been improved. (L vs Z). The filters "letters_only" and "numbers_only" now remove leading whitespace. "ocrad.texinfo" has been renamed to "ocrad.texi".

      •  06 Sep 2013 10:24

        Release Notes: Character recognition has been improved (L vs Z).

        •  12 Jul 2013 11:22

          Release Notes: Scaling and smoothing are now made before thresholding. Character recognition has been improved. (D-O, H-N, O-Q, V-Y, merged TT). The new library function "OCRAD_set_utf8_format" has been added. Small improvements have been made in the manual and in the man page. Quote characters in messages have been changed as advised by GNU Coding Standards.

          •  25 Jun 2013 15:29

            Release Notes: Character recognition has been improved (D vs O).

            Screenshot

            Project Spotlight

            OpenStack4j

            A Fluent OpenStack client API for Java.

            Screenshot

            Project Spotlight

            TurnKey TWiki Appliance

            A TWiki appliance that is easy to use and lightweight.