Since the mid-1990s we have been advising a number of organisations and our software is in use in industry.
We carry out real-world research in document digitisation, character recognition (OCR) and other image analysis applications.
Ongoing and completed projects, from processing and recognition of historical books and newspapers to the analysis of web documents.
We develop and make available a number of document analysis tools, ranging from ground truth production to performance evaluation.
We have created several publicly-available datasets, ranging from historical books and newspapers to contemporary documents.
We have created and maintain PAGE, an XML-based representation framework for document analysis and recognition results.
PRImA at DAS2018
(09/04/2018) Read more »
ICFHR2018 Competition on Recognition of Historical Arabic Scientific Manuscripts - RASM2018
(05/02/2018) Read more »
ICDAR2017 Competition Videos
(05/04/2017) Read more »
ICDAR2017 Competitions - Registration Open
(27/03/2017) Read more »
Aletheia 3.2 released
(07/03/2017) Read more »
Aletheia 3.1 released
(30/08/2016) Read more »