Not registered? - Request an account here

Europeana Newspapers Project Dataset Newspapers from Europe's major libraries

Access the dataset »

Europeana Newspapers Project Dataset

Overview

This online repository is the main point of reference for all activities related to evaluation within the scope of the Europeana Newspapers project. Its main goal is to provide a representative collection of all the types of newspapers which are and/or might be subject of ongoing or future digitisation activities. As such, it is hosting scanned images, metadata and ground truth (a representation of the ideal result of a processing step like OCR or layout analysis) on the level of individual newspaper pages.

Supported by

EU ENP

Related Publications

A survey of OCR evaluation tools and metrics

C. Neudecker, K. Baierer, C. Clausner, A. Antonacopoulos, S. Pletschacher

In The 6th International Workshop on Historical Document Imaging and Processing (HIP '21). Association for Computing Machinery, New York, NY, USA, 13–18.

Details »  Download PDF 


Ontology and Framework for Semantic Labelling of Document Data and Software Methods

C. Clausner, A. Antonacopoulos

Proceedings of the 13th IAPR International Workshop on Document Analysis Systems (DAS2018), Vienna, Austria, April 24-27, 2018, pp. 73-78

Details »  Download PDF 


Quality Prediction System for Large-Scale Digitisation Workflows

C. Clausner, S. Pletschacher, A. Antonacopoulos

Proceedings of the 12th IAPR International Workshop on Document Analysis Systems (DAS2016), Santorini, Greece, April 11-14, 2016

Details »  Download PDF 


The ENP Image and Ground Truth Dataset of Historical Newspapers

C. Clausner, C. Papadopoulos, S. Pletschacher, A. Antonacopoulos

Proceedings of the 13th International Conference on Document Analysis and Recognition (ICDAR2015), Nancy, France, August 2015, pp. 931-935

Details »  Download PDF 


Access the dataset »