Not registered? - Request an account here

Tesseract OCR to PAGE

Tesseract OCR to PAGE

Download the latest version

Overview

Tesseract to PAGE is a command line tool to analyse document page images using the open source OCR engine Tesseract and save the results to PAGE (Page Analysis and Ground truth Elements) XML format. Version 1.4 is based on the latest release of Tesseract (3.04).

For more information on Tesseract see: http://code.google.com/p/tesseract-ocr

Download the latest version

Alternative download