Projects / moz-hocr-edit


hOCR is a file format for representing the output of Optical Character Recognition (OCR) programs such as OCRopus. OCR programs are not perfect at recognizing text, so human editing is often necessary. moz-hocr-edit provides a line-by-line user interface for people to edit and proofread hOCR documents.

Operating Systems

Recent releases

  •  03 Jul 2009 00:38

    Release Notes: This version is the first to support documents with multiple pages. It also has the ability to scale down high-resolution text images before displaying them.

    •  06 Apr 2009 15:16

      Release Notes: Users can now edit documents without line-by-line bounding box information. In addition, it is now possible to edit where paragraph breaks occur. Many improvements to the UI were made.

      •  19 Mar 2009 13:18

        Release Notes: User interface improvements were made. Editing over HTTP is supported.

        •  13 Mar 2009 23:23

          No changes have been submitted for this release.


          Project Spotlight


          A Fluent OpenStack client API for Java.


          Project Spotlight

          TurnKey TWiki Appliance

          A TWiki appliance that is easy to use and lightweight.