Not registered? - Request an account here
Room 211, Newton Building
E-mail: A.Antonacopoulos (append "@primaresearch.org")
http://www.primaresearch.org/people/aa
School of Science, Engineering & Environment
Newton Building
University of Salford
Greater Manchester
M5 4WT
United Kingdom
Tel: +44-(0)161-295 2653 (direct)
Apostolos Antonacopoulos leads the Pattern Recognition and Image Analysis (PRImA) research Lab at the School of Science, Engineering & Environment at the University of Salford, UK where he currently holds the post of Professor of Pattern Recognition. He received his PhD from the University of Manchester, Institute of Science and Technology (UMIST), UK in 1995. From 1995 to 2004 he worked as Lecturer in the Department of Computer Science at the University of Liverpool where he founded the PRImA Lab. In 2005, he joined the University of Salford as Senior Lecturer and the PRImA Lab was established and strengthened at Salford. In the same year, he received the IAPR/ICDAR Young Investigator Award for "Outstanding service to the ICDAR community and his innovative research in historical document processing applications."
Professor Antonacopoulos has worked and published extensively on various problems in Document Analysis and Understanding (Image Enhancement, Segmentation, Recognition, Performance Evaluation) as well as on other applications of Pattern Recognition and Image Analysis. He has co-edited the first Special Issue (IJDAR) on Historical Document Analysis as well as the first book on Web Document Analysis. He has served as a member of the Editorial Boards of the International Journal on Document Analysis and Recognition and of the Electronic Letters on Computer Vision and Image Analysis as well as an Associate Editor of Cultural Heritage Digitisation.
He is currently serving on the Executive Committee of the International Association for Pattern Recognition (IAPR) as Past President, having also held the posts of President, 1st and 2nd Vice President, and Treasurer. He has also chaired or served as a member of a number of IAPR and other professional committees. He has given a number of invited talks and tutorials and has held engagements as a technical advisor to libraries and archives, among which are the British Library and the Wellcome Library. He has been active in the organisation of conferences and workshops and is a member of the programme committees of most conferences in his field. He has significant experience in leading and participating in national, European (H2020 and earlier) and industry-sponsored projects. Recent significant project involvement includes the €4M Europeana Newspapers EU-funded project (extraction and recognition of text in newspapers for the European Digital Library), the €1.8 SUCCEED EU-funded support action for the Centre of Competence in Digitisation, the US$734K Early Modern OCR project (EMOP) funded by the Andrew W. Mellon Foundation, and the £250K 1961 Census small area statistics digitisation project funded by the Office of National Statistics (ONS). Currently, he is leading a £500K project funded by ONS to digitise and model the data in the published census reports from 1921, 1931, 1951 and 1961.
In the most recent Research Excellence Framework (REF2021) - the research quality evaluation of UK Universities - the research impact case study he lead on "Enabling Digital Transformation through Effective Digitisation" received the maximum rating of "Outstanding" (4*) in terms of both significance and reach.
Listed by conference series:
Refereed Papers
A survey of OCR evaluation tools and metrics
In The 6th International Workshop on Historical Document Imaging and Processing (HIP '21). Association for Computing Machinery, New York, NY, USA, 13–18.
Flexible character accuracy measure for reading-order-independent evaluation
Pattern Recognition Letters, Volume 131, March 2020, Pages 390-397
Efficient and Effective OCR Engine Training
International Journal on Document Analysis and Recognition (IJDAR), 23(1), 73-88
ICDAR2019 Competition on Recognition of Early Indian Printed Documents – REID2019
Proceedings of the 15th International Conference on Document Analysis and Recognition (ICDAR2019), Sydney, Australia, September 2019, pp. 1527-1532
ICDAR2019 Competition on Recognition of Documents with Complex Layouts – RDCL2019
Proceedings of the 15th International Conference on Document Analysis and Recognition (ICDAR2019), Sydney, Australia, September 2019, pp. 1521-1526
Crowdsourcing Historical Tabular Data – 1961 Census of England and Wales
Proceedings of the 2019 Workshop on Historical Document Imaging and Processing (HIP2019), Sydney, Australia, September 2019, pp. 42-47
Highlights of the novel Dewaterability Estimation Test (DET) Device
Environmental Technology
Towards the Extraction of Statistical Information from Digitised Numerical Tables - The Medical Officer of Health Reports Scoping Study
Proceedings of Third International Conference on Digital Access to Textual Cultural Heritage (DATeCH 2019), Brussels, Belgium, 08 - 10 May 2019
ICFHR 2018 Competition on Recognition of Historical Arabic Scientific Manuscripts - RASM2018
Proceedings of the 17th International Workshop on Frontiers in Handwriting Recognition (ICFHR2018), Niagara Falls, USA, August 2018, pp. 471-476
Ontology and Framework for Semantic Labelling of Document Data and Software Methods
Proceedings of the 13th IAPR International Workshop on Document Analysis Systems (DAS2018), Vienna, Austria, April 24-27, 2018, pp. 73-78
Continuous Competition on Recognition of Documents with Complex Layouts - RDCL
Short Paper Booklet of the 13th International Association for Pattern Recognition (IAPR) Workshop on Document Analysis Systems (DAS2018), Vienna, Austria, April 2018, pp. 19-20
Effective Geometric Restoration of Distorted Historical Document for Large-Scale Digitization
IET Image Processing
ICDAR2017 Competition on Recognition of Early Indian Printed Documents – REID2017
Proceedings of the 14th International Conference on Document Analysis and Recognition (ICDAR2017), Kyoto, Japan, November 2017, pp. 1411-1416
ICDAR2017 Competition on Recognition of Documents with Complex Layouts – RDCL2017
Proceedings of the 14th International Conference on Document Analysis and Recognition (ICDAR2017), Kyoto, Japan, November 2017, pp. 1404-1410
Creating a Complete Workflow for Digitising Historical Census Documents: Considerations and Evaluation
Proceedings of the 2017 Workshop on Historical Document Imaging and Processing (HIP2017), Kyoto, Japan, November 2017, pp. 83-88
Unearthing the Recent Past: Digitising and Understanding Statistical Information from Census Tables
Proceedings of Second International Conference on Digital Access to Textual Cultural Heritage (DATeCH 2017), Goettingen, Germany, 01 - 02 June 2017
Making Europe’s Historical Newspapers Searchable
Proceedings of the 12th IAPR International Workshop on Document Analysis Systems (DAS2016), Santorini, Greece, April 11-14, 2016.
Quality Prediction System for Large-Scale Digitisation Workflows
Proceedings of the 12th IAPR International Workshop on Document Analysis Systems (DAS2016), Santorini, Greece, April 11-14, 2016
The ENP Image and Ground Truth Dataset of Historical Newspapers
Proceedings of the 13th International Conference on Document Analysis and Recognition (ICDAR2015), Nancy, France, August 2015, pp. 931-935
ICDAR2015 Competition on Recognition of Documents with Complex Layouts – RDCL2015
Proceedings of the 13th International Conference on Document Analysis and Recognition (ICDAR2015), Nancy, France, August 2015, pp. 1151-1155
Historical Typewritten Document Recognition Using Minimal User Interaction
Proceedings of the 2015 Workshop on Historical Document Imaging and Processing (HIP2015), Nancy, France, August 2015, pp. 31-38
Europeana Newspapers OCR Workflow Evaluation
Proceedings of the 2015 Workshop on Historical Document Imaging and Processing (HIP2015), Nancy, France, August 2015, pp. 39-46
Navigating the Storm: IMPACT, eMOP, and Agile Steering Standards
Digital Scholarship in the Humanities, 2015.
Document Representation Refinement for Precise Region Description
Proceedings of the First International Conference on Digital Access to Textual Cultural Heritage (DATeCH2014), Madrid, Spain, May 2014, pp. 9-13
Efficient OCR Training Data Generation with Aletheia
Short Paper Booklet of the 11th International Association for Pattern Recognition (IAPR) Workshop on Document Analysis Systems (DAS2014), Tours, France, April 2014, pp. 19-20
Distinction between handwritten and machine-printed text based on the bag of visual words model
Pattern Recognition, Available online 20 September 2013, ISSN 0031-3203
The Significance of Reading Order in Document Recognition and its Evaluation
Proceedings of the 12th International Conference on Document Analysis and Recognition (ICDAR2013), Washington DC, USA, August 2013, pp. 688-692
ICDAR2013 Competition on Historical Newspaper Layout Analysis – HNLA2013
Proceedings of the 12th International Conference on Document Analysis and Recognition (ICDAR2013), Washington DC, USA, August 2013, pp. 1486-1490
ICDAR2013 Competition on Historical Book Recognition – HBR2013
Proceedings of the 12th International Conference on Document Analysis and Recognition (ICDAR2013), Washington DC, USA, August 2013, pp. 1491-1495
The IMPACT Dataset of Historical Document Images
Proceedings of the 2013 Workshop on Historical Document Imaging and Processing (HIP2013), Washington DC, USA, August 2013, pp. 123-130
A robust hybrid approach for text line segmentation in historical documents
Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012), Tsukuba, Japan, November 11-15, 2012, IEEE-CS Press, pp. 335-338
Handwritten and Machine Printed Text Separation in Document Images using the Bag of Visual Words Paradigm
Proceedings of the 13th International Conference on Frontiers in Handwriting Recognition (ICFHR2012), Bari, Italy, September 2012, pp. 103-108
Restoration of Arbitrarily Warped Historical Document Images Using Flow Lines
Proceedings of the 11th International Conference on Document Analysis and Recognition (ICDAR2011), Beijing, China, September 2011, pp. 905-909
Scenario Driven In-Depth Performance Evaluation of Document Layout Analysis Methods
Proceedings of the 11th International Conference on Document Analysis and Recognition (ICDAR2011), Beijing, China, September 2011, pp. 1404-1408
Aletheia - An Advanced Document Layout and Text Ground-Truthing System for Production Environments
Proceedings of the 11th International Conference on Document Analysis and Recognition (ICDAR2011), Beijing, China, September 2011, pp. 48-52
Historical Document Layout Analysis Competition
Proceedings of the 11th International Conference on Document Analysis and Recognition (ICDAR2011), Beijing, China, September 2011, pp. 1516-1520
Grid-Based Modelling and Correction of Arbitrarily Warped Historical Document Images for Large-Scale Digitisation
Proceedings of the 2011 Workshop on Historical Document Imaging and Processing (HIP2011), Beijing, China, September 2011, pp. 106-111
Large-Scale Digitisation and Recognition of Opportunities for Image Processing and Analysis Historical Documents: Challenges and Analysis
Keynote presentation given at the Swedish Symposium on Image Analysis 2010 (SSBA2010), Uppsala, Sweden, March 11-12, 2010
The PAGE (Page Analysis and Ground-Truth Elements) Format Framework
Proceedings of the 20th International Conference on Pattern Recognition (ICPR2010), Istanbul, Turkey, August 23-26, 2010, IEEE-CS Press, pp. 257-260
A New Framework for Recognition of Heavily Degraded Characters in Historical Typewritten Documents Based on Semi-Supervised Clustering
Proceedings of the 10th International Conference on Document Analysis and Recognition (ICDAR2009), Barcelona, Spain, July 2009, pp. 506-510
Word-Based Adaptive OCR for Historical Books
Proceedings of the 10th International Conference on Document Analysis and Recognition (ICDAR2009), Barcelona, Spain, July 2009, pp. 501-505
A Realistic Dataset for Performance Evaluation of Document Layout Analysis
Proceedings of the 10th International Conference on Document Analysis and Recognition (ICDAR2009), Barcelona, Spain, July 2009, pp. 296-300
ICDAR2009 Page Segmentation Competition
Proceedings of the 10th International Conference on Document Analysis and Recognition (ICDAR2009), Barcelona, Spain, July 2009, pp. 1370-1374
A Geometric Approach for Accurate and Efficient Performance Evaluation of Layout Analysis Methods
Proceedings of the 19th International Conference on Pattern Recognition (ICPR2008), Tampa, Florida, USA, December 7-11, 2008, IEEE-CS Press
Colour text segmentation in web images based on human perception
Image and Vision Computing, Volume 25, Issue 5, Elsevier, May 2007, pp. 564-577
ICDAR2007 Handwriting Segmentation Contest
Proceedings of the 9th International Conference on Document Analysis and Recognition (ICDAR2007), Curitiba, Brazil, September 2007, pp. 1284-1288
Performance Analysis Framework for Layout Analysis Methods
Proceedings of the 9th International Conference on Document Analysis and Recognition (ICDAR2007), Curitiba, Brazil, September 2007, pp. 1258-1262
ICDAR2007 Page Segmentation Competition
Proceedings of the 9th International Conference on Document Analysis and Recognition (ICDAR2007), Curitiba, Brazil, September 2007, pp. 1279-1283
Flexible Text Recovery from Degraded Typewritten Historical Documents
Proceedings of the 18th International Conference on Pattern Recognition (ICPR2006), Hong Kong, August 20-24, 2006, IEEE-CS Press, pp. 1062-1065
Ground Truth for Layout Analysis Performance Evaluation
Document Analysis Systems VII: Proceedings of the 7th International Association for Pattern Recognition (IAPR) Workshop on Document Analysis Systems (DAS2006), H. Bunke and A.L. Spitz (Eds.), Springer Lecture Notes in Computer Science, LNCS 3872, Nelson, New Zealand, February 2006, pp. 302-311
Semantics-Based Content Extraction in Typewritten Historical Documents
Proceedings of the 8th International Conference on Document Analysis and Recognition (ICDAR2005), Seoul, South Korea, August/September 2005, pp. 48-53
ICDAR2005 Page Segmentation Competition
Proceedings of the 8th International Conference on Document Analysis and Recognition (ICDAR2005), Seoul, South Korea, August/September 2005, pp. 75-79
Text Extraction from Web Images Based on A Split-and-Merge Segmentation Method Using Color Perception
Proceedings of the 17th International Conference on Pattern Recognition (ICPR2004), Cambridge, UK, August 23-26, 2004, IEEE-CS Press, pp. 634-637
The Lifecycle of a Digital Historical Document: Structure and Content
Proceedings of the ACM Symposium on Document Engineering (DocEng2004), ACM Press, Milwaukee, Wisconsin, 28-30 October 2004, pp. 147-154
Document Image Analysis for World War II Personal Records
Proceedings of the International Workshop on Document Image Analysis for Libraries (DIAL2004), Palo Alto Research Center (PARC), USA, January 23-24, IEEE Computer Society Press, 2004, pp. 336-341
A Complete Approach to the Conversion of Typewritten Historical Documents for Digital Archives
Document Analysis Systems VI: Proceedings of the 6th International Association for Pattern Recognition (IAPR) Workshop on Document Analysis Systems (DAS2004), S. Marinai and A.R. Dengel (Eds.), Springer Lecture Notes in Computer Science, LNCS 3163, Florence, Italy, September 2004, pp. 90-101
A Robust Braille Recognition System
Document Analysis Systems VI: Proceedings of the 6th International Association for Pattern Recognition (IAPR) Workshop on Document Analysis Systems (DAS2004), S. Marinai and A.R. Dengel (Eds.), Springer Lecture Notes in Computer Science, LNCS 3163, Florence, Italy, September 2004, pp. 533-545
Two Approaches for Text Segmentation in Web Images
Proceedings of the 7th International Conference on Document Analysis and Recognition (ICDAR2003), Edinburgh, UK, August 2003, pp. 131-136
ICDAR2003 Page Segmentation Competition
Proceedings of the 7th International Conference on Document Analysis and Recognition (ICDAR2003), Edinburgh, UK, August 2003, pp. 688-692
Exploiting Human Colour Perception to Segment Complex Colour Images
Proceedings of Visual Representations and Interpretations (VRI2002), Liverpool, UK, September 2002
An Automated Tachograph Chart Analysis System
Document Analysis Systems V: Proceedings of the 5th International Association for Pattern Recognition (IAPR) Workshop on Document Analysis Systems (DAS2002), D. Lopresti, J. Hu and R. Kashi (Eds.), Springer Lecture Notes in Computer Science, LNCS 2423, Princeton NJ, USA, August 2002, pp. 544-555
A Ground-Truthing Tool for Layout Analysis Performance Evaluation
Document Analysis Systems V: Proceedings of the 5th International Association for Pattern Recognition (IAPR) Workshop on Document Analysis Systems (DAS2002), D. Lopresti, J. Hu and R. Kashi (Eds.), Springer Lecture Notes in Computer Science, LNCS 2423, Princeton NJ, USA, August 2002, pp. 263-244
Fuzzy Segmentation of Characters in Web Images Based on Human Colour Perception
Document Analysis Systems V: Proceedings of the 5th International Association for Pattern Recognition (IAPR) Workshop on Document Analysis Systems (DAS2002), D. Lopresti, J. Hu and R. Kashi (Eds.), Springer Lecture Notes in Computer Science, LNCS 2423, Princeton NJ, USA, August 2002, pp. 295-306
Text Extraction from Web Images Based on Human Perception and Fuzzy Inference
Proceedings of the First International Workshop on Web Document Analysis (WDA2001), Seattle, USA, September 2001, pp. 35-38
Accessing Textual Information Embedded in Internet Images
Proceedings of SPIE, Internet Imaging II, San Jose, USA, January 2001, Vol. 4311, pp. 198-205
Information Extraction from Complex Circular Charts
Proceedings of the 6th International Conference on Document Analysis and Recognition (ICDAR2001), Seattle, USA, September 2001, pp. 784-787
First International Newspaper Page Segmentation Contest
Proceedings of the 6th International Conference on Document Analysis and Recognition (ICDAR2001), Seattle, USA, September 2001, pp. 1190-1194
An Anthropocentric Approach to Text Extraction from WWW Images
Proceedings of the 4th IAPR International Workshop on Document Analysis Systems (DAS2000), Rio de Janeiro, Brazil, December 2000, pp. 515-525
A New Framework for Efficient and Flexible Analysis of the Performance of Document Image Analysis Subsystems
Proceedings of 7th International Conference on Image Processing and its Applications (IPA1999), Manchester, UK, July 1999, pp. 417-420
Region Description and Comparative Analysis Using a Tesseral Representation
Proceedings of 5th International Conference on Document Image Analysis and Recognition (ICDAR1999), Bangalore, India, September 1999, pp. 193-196
Methodology for Flexible and Efficient Analysis of the Performance of Page Segmentation Algorithms
Proceedings of 5th International Conference on Document Image Analysis and Recognition (ICDAR1999), Bangalore, India, September 1999, pp. 451-454
Performance Analysis of Document Image Analysis Subsystems
Digest of IEE Colloquium on Document Image Processing and Multimedia (DIPM), Manchester, UK, 1999, pp. 15/1-15/4
Document Image Compression by Structural Decomposition
Proceedings of the International Conference on Telecommunications 1998 (ICT1998), Chalkidiki, Greece, June 1998, pp. 166-171
Identification of Airfield Runways in Synthetic Aperture Radar Images
Proceedings of the 14th International Conference on Pattern Recognition (ICPR1998), Brisbane, Australia, August 17-20, 1998, IEEE-CS Press, Vol. II, pp. 1633-1636
Page Segmentation Using the Description of the Background
Computer Vision and Image Understanding, Special Issue on Document Analysis and Retrieval, Volume 70, Issue 3, June 1998, pp. 350-369
A Structural Approach for Smoothing Noisy Peak-Shaped Analytical Signals
Chemometrics and Intelligent Laboratory Systems, Volume 41, Issue 1, July 1998, pp. 31-42
Local Skew Angle Estimation from Background Space in Text Regions
Proceedings of the 4th International Conference on Document Analysis and Recognition (ICDAR1997), Ulm, Germany, August 1997, pp. 684-688
Presenting Legislation as Hypertext
Proceedings of the Fifth National/First European Conference on Law, Computers and Artificial Intelligence: EUCLID, Exeter, April 1996, pp. 1-9, ISBN 095278730X
Representation and Classification of Complex-Shaped Printed Regions Using White Tiles
Proceedings of the 3rd International Conference on Document Analysis and Recognition (ICDAR1995), Montreal, Canada, August 1995, pp. 1132-1135
Segmentation and Classification of Document Images Using the Background
Proceedings of the 5th Hellenic Conference on Informatics, Athens, Greece, December 7-9, 1995, pp. 927-937 (invited contribution)
Segmentation and Classification of Document Images
Digest of the IEE Colloquium on Document Image Processing and Multimedia Environments, The Institution of Electrical Engineers, November 1995, pp. 16/1-16/7, ISSN 0963-3308
Flexible Page Segmentation Using the Background
Proceedings of the IAPR 12th International Conference on Pattern Recognition (ICPR1994), Jerusalem, Israel, October 9-12, 1994, IEEE-CS Press, pp. 339-344
Segmentation of Layouts with Non-Rectangular Regions
Proceedings of the International Association of Pattern Recognition Workshop on Document Analysis Systems (DAS1994), Kaiserslautern, Germany, 18-20 October 1994, pp. 3-13
Edited Conference Proceedings
Proceedings of the 2010 ACM Symposium on Document Engineering (DocEng2010)
288 pages, ACM, September 21-24, 2010, ISBN: 978-1-4503-0231-9
Web Document Analysis II: Proceedings of the 2nd International Workshop on Web Document Analysis
PRImA, 2003, ISBN: 0-9541148-1-7
Proceedings of the 7th International Conference on Document Analysis and Recognition (ICDAR2003)
1304 pages, 2 volumes, IEEE Computer Society Press, August 2003, ISBN: 0-7695-1960-1
Web Document Analysis: Proceedings of the 1st International Workshop on Web Document Analysis
PRImA, 2001, ISBN: 0-9541148-0-9
Book Chapters
The Analysis of Web Documents
in the book Digital Document Processing: Major Directions and Recent Advances, B.B. Chaudhuri (Ed.), Springer, Advances in Pattern Recognition Series, December 2006, ISBN: 978-1-84628-501-1, pp. 407-419
Visual Representation of Text in Web Documents and Its interpretation
in the book Studies in Multidisciplinarity: Multidisciplinary Approaches to Visual Representations and Interpretations, G.R. Malcolm (Ed.), Volume 2, Elsevier, 2005, pp. 181-196
A Fuzzy Approach to Text Segmentation in Web Images Based on Human Colour Perception
in the book: Web Document Analysis: Challenges and Opportunities, A. Antonacopoulos and J. Hu (Eds.), Series in Machine Perception and Artificial Intelligence, World Scientific Publishing Company, 2003, pp. 203-222
Automated Interpretation of Visual Representations: Extracting Textual Information from WWW Images
in the book Visual Representations and Interpretations, R. Paton and I. Neilson (Eds.), Springer, London, 1999, pp. 88-93
Analysis of Scanned Braille Documents
in the book Document Analysis Systems, A. Dengel and A.L. Spitz (Eds.), World Scientific Publishing Co., 1995, pp. 413-421
Books
Web Document Analysis: Challenges and Opportunities
Series in Machine Perception and Artificial Intelligence, World Scientific Publishing Company, November 2003, ISBN: 978-981-238-582-6