Not registered? - Request an account here

Extractor / Exporter

Extractor / Exporter

Download the latest version

Overview

The PAGE Extractor/Exporter is a Windows command line tool to extract snippets (image / layout description) from PAGE XML files, export the text content of a PAGE file (through serialisation according to the reading order), and export training data in the Gamera XML format (for training the open source Gamera OCR engine).

Download the latest version