Not registered? - Request an account here
SEE Building
E-mail: S.Pletschacher (append "@primaresearch.org")
University of Salford
School of Computing, Science & Engineering
Salford
Greater Manchester, M5 4WT
United Kingdom
Tel: Tel: +44-(0)161-295 5654
Stefan Pletschacher is a Lecturer in Computer Science at the University of Salford and a member of the Pattern Recognition and Image Analysis (PRImA) Research Lab. In the past he has held positions as Research Fellow at the University of Salford and Research Assistant at the Institute for Print and Media Technology at Chemnitz University of Technology. Besides his academic career he has worked as freelance software developer as well as consultant for digitisation projects. Stefan has been developer, technical advisor, and work package leader in large-scale international projects related to media production, digitisation, and Optical Character Recognition (OCR) such as SELEAC, IMPACT, SUCCEED, Europeana Newspapers, and eMOP. As technical lead he has overseen the development of numerous open source and commercial software projects implemented by the PRImA Research Lab.
Refereed Papers
A survey of OCR evaluation tools and metrics
In The 6th International Workshop on Historical Document Imaging and Processing (HIP '21). Association for Computing Machinery, New York, NY, USA, 13–18.
Flexible character accuracy measure for reading-order-independent evaluation
Pattern Recognition Letters, Volume 131, March 2020, Pages 390-397
A cloud-hosted MapReduce architecture for syntactic parsing
Proceedings of EUROMICRO 45th Conference on Software Engineering and Advanced Applications (SEAA)
Efficient and Effective OCR Engine Training
International Journal on Document Analysis and Recognition (IJDAR), 23(1), 73-88
ICDAR2019 Competition on Recognition of Early Indian Printed Documents – REID2019
Proceedings of the 15th International Conference on Document Analysis and Recognition (ICDAR2019), Sydney, Australia, September 2019, pp. 1527-1532
ICDAR2019 Competition on Recognition of Documents with Complex Layouts – RDCL2019
Proceedings of the 15th International Conference on Document Analysis and Recognition (ICDAR2019), Sydney, Australia, September 2019, pp. 1521-1526
Effective Geometric Restoration of Distorted Historical Document for Large-Scale Digitization
IET Image Processing
ICDAR2017 Competition on Recognition of Early Indian Printed Documents – REID2017
Proceedings of the 14th International Conference on Document Analysis and Recognition (ICDAR2017), Kyoto, Japan, November 2017, pp. 1411-1416
ICDAR2017 Competition on Recognition of Documents with Complex Layouts – RDCL2017
Proceedings of the 14th International Conference on Document Analysis and Recognition (ICDAR2017), Kyoto, Japan, November 2017, pp. 1404-1410
Creating a Complete Workflow for Digitising Historical Census Documents: Considerations and Evaluation
Proceedings of the 2017 Workshop on Historical Document Imaging and Processing (HIP2017), Kyoto, Japan, November 2017, pp. 83-88
Unearthing the Recent Past: Digitising and Understanding Statistical Information from Census Tables
Proceedings of Second International Conference on Digital Access to Textual Cultural Heritage (DATeCH 2017), Goettingen, Germany, 01 - 02 June 2017
Quality Prediction System for Large-Scale Digitisation Workflows
Proceedings of the 12th IAPR International Workshop on Document Analysis Systems (DAS2016), Santorini, Greece, April 11-14, 2016
The ENP Image and Ground Truth Dataset of Historical Newspapers
Proceedings of the 13th International Conference on Document Analysis and Recognition (ICDAR2015), Nancy, France, August 2015, pp. 931-935
ICDAR2015 Competition on Recognition of Documents with Complex Layouts – RDCL2015
Proceedings of the 13th International Conference on Document Analysis and Recognition (ICDAR2015), Nancy, France, August 2015, pp. 1151-1155
Europeana Newspapers OCR Workflow Evaluation
Proceedings of the 2015 Workshop on Historical Document Imaging and Processing (HIP2015), Nancy, France, August 2015, pp. 39-46
Document Representation Refinement for Precise Region Description
Proceedings of the First International Conference on Digital Access to Textual Cultural Heritage (DATeCH2014), Madrid, Spain, May 2014, pp. 9-13
Efficient OCR Training Data Generation with Aletheia
Short Paper Booklet of the 11th International Association for Pattern Recognition (IAPR) Workshop on Document Analysis Systems (DAS2014), Tours, France, April 2014, pp. 19-20
The Significance of Reading Order in Document Recognition and its Evaluation
Proceedings of the 12th International Conference on Document Analysis and Recognition (ICDAR2013), Washington DC, USA, August 2013, pp. 688-692
ICDAR2013 Competition on Historical Newspaper Layout Analysis – HNLA2013
Proceedings of the 12th International Conference on Document Analysis and Recognition (ICDAR2013), Washington DC, USA, August 2013, pp. 1486-1490
ICDAR2013 Competition on Historical Book Recognition – HBR2013
Proceedings of the 12th International Conference on Document Analysis and Recognition (ICDAR2013), Washington DC, USA, August 2013, pp. 1491-1495
The IMPACT Dataset of Historical Document Images
Proceedings of the 2013 Workshop on Historical Document Imaging and Processing (HIP2013), Washington DC, USA, August 2013, pp. 123-130
A robust hybrid approach for text line segmentation in historical documents
Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012), Tsukuba, Japan, November 11-15, 2012, IEEE-CS Press, pp. 335-338
Scenario Driven In-Depth Performance Evaluation of Document Layout Analysis Methods
Proceedings of the 11th International Conference on Document Analysis and Recognition (ICDAR2011), Beijing, China, September 2011, pp. 1404-1408
Aletheia - An Advanced Document Layout and Text Ground-Truthing System for Production Environments
Proceedings of the 11th International Conference on Document Analysis and Recognition (ICDAR2011), Beijing, China, September 2011, pp. 48-52
Historical Document Layout Analysis Competition
Proceedings of the 11th International Conference on Document Analysis and Recognition (ICDAR2011), Beijing, China, September 2011, pp. 1516-1520
Grid-Based Modelling and Correction of Arbitrarily Warped Historical Document Images for Large-Scale Digitisation
Proceedings of the 2011 Workshop on Historical Document Imaging and Processing (HIP2011), Beijing, China, September 2011, pp. 106-111
The PAGE (Page Analysis and Ground-Truth Elements) Format Framework
Proceedings of the 20th International Conference on Pattern Recognition (ICPR2010), Istanbul, Turkey, August 23-26, 2010, IEEE-CS Press, pp. 257-260
A New Framework for Recognition of Heavily Degraded Characters in Historical Typewritten Documents Based on Semi-Supervised Clustering
Proceedings of the 10th International Conference on Document Analysis and Recognition (ICDAR2009), Barcelona, Spain, July 2009, pp. 506-510
A Self-Adaptive Method for Extraction of Document-Specific Alphabets
Proceedings of the 10th International Conference on Document Analysis and Recognition (ICDAR2009), Barcelona, Spain, July 2009, pp. 656-660
A Realistic Dataset for Performance Evaluation of Document Layout Analysis
Proceedings of the 10th International Conference on Document Analysis and Recognition (ICDAR2009), Barcelona, Spain, July 2009, pp. 296-300
ICDAR2009 Page Segmentation Competition
Proceedings of the 10th International Conference on Document Analysis and Recognition (ICDAR2009), Barcelona, Spain, July 2009, pp. 1370-1374
Representation of Digitized Documents Using Document Specific Alphabets and Fonts
Proceedings of the 5th IS&T Archiving Conference, Bern, Switzerland, June 2008, pp. 198-202
Vectorization of Glyphs and Their Representation in SVG for XML-Based Processing
Digital Spectrum: Integrating Technology and Culture, Proceedings of the 10th International Conference on Electronic Publishing, Bansko, Bulgaria, June 2006, pp. 299-308
OCR Alternatives for Electronic Publishing of Digitised Documents
From Author to Reader: Challenges for the Digital Content Chain, Proceedings of the 9th ICCC International Conference on Electronic Publishing, Leuven-Heverlee, Belgium, June 2005, pp. 35-41