Not registered? - Request an account here
A New Deep Wavefront Based Model for Text Localization in 3D Video
IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), Volume: 32, Issue: 6, June 2022.
A survey of OCR evaluation tools and metrics
In The 6th International Workshop on Historical Document Imaging and Processing (HIP '21). Association for Computing Machinery, New York, NY, USA, 13–18.
Flexible character accuracy measure for reading-order-independent evaluation
Pattern Recognition Letters, Volume 131, March 2020, Pages 390-397
A cloud-hosted MapReduce architecture for syntactic parsing
Proceedings of EUROMICRO 45th Conference on Software Engineering and Advanced Applications (SEAA)
Efficient and Effective OCR Engine Training
International Journal on Document Analysis and Recognition (IJDAR), 23(1), 73-88
ICDAR2019 Competition on Recognition of Early Indian Printed Documents – REID2019
Proceedings of the 15th International Conference on Document Analysis and Recognition (ICDAR2019), Sydney, Australia, September 2019, pp. 1527-1532
ICDAR2019 Competition on Recognition of Documents with Complex Layouts – RDCL2019
Proceedings of the 15th International Conference on Document Analysis and Recognition (ICDAR2019), Sydney, Australia, September 2019, pp. 1521-1526
Crowdsourcing Historical Tabular Data – 1961 Census of England and Wales
Proceedings of the 2019 Workshop on Historical Document Imaging and Processing (HIP2019), Sydney, Australia, September 2019, pp. 42-47
Highlights of the novel Dewaterability Estimation Test (DET) Device
Environmental Technology
Towards the Extraction of Statistical Information from Digitised Numerical Tables - The Medical Officer of Health Reports Scoping Study
Proceedings of Third International Conference on Digital Access to Textual Cultural Heritage (DATeCH 2019), Brussels, Belgium, 08 - 10 May 2019
ICFHR 2018 Competition on Recognition of Historical Arabic Scientific Manuscripts - RASM2018
Proceedings of the 17th International Workshop on Frontiers in Handwriting Recognition (ICFHR2018), Niagara Falls, USA, August 2018, pp. 471-476
Ontology and Framework for Semantic Labelling of Document Data and Software Methods
Proceedings of the 13th IAPR International Workshop on Document Analysis Systems (DAS2018), Vienna, Austria, April 24-27, 2018, pp. 73-78
Continuous Competition on Recognition of Documents with Complex Layouts - RDCL
Short Paper Booklet of the 13th International Association for Pattern Recognition (IAPR) Workshop on Document Analysis Systems (DAS2018), Vienna, Austria, April 2018, pp. 19-20
Effective Geometric Restoration of Distorted Historical Document for Large-Scale Digitization
IET Image Processing
ICDAR2017 Competition on Recognition of Early Indian Printed Documents – REID2017
Proceedings of the 14th International Conference on Document Analysis and Recognition (ICDAR2017), Kyoto, Japan, November 2017, pp. 1411-1416
ICDAR2017 Competition on Recognition of Documents with Complex Layouts – RDCL2017
Proceedings of the 14th International Conference on Document Analysis and Recognition (ICDAR2017), Kyoto, Japan, November 2017, pp. 1404-1410
Creating a Complete Workflow for Digitising Historical Census Documents: Considerations and Evaluation
Proceedings of the 2017 Workshop on Historical Document Imaging and Processing (HIP2017), Kyoto, Japan, November 2017, pp. 83-88
Unearthing the Recent Past: Digitising and Understanding Statistical Information from Census Tables
Proceedings of Second International Conference on Digital Access to Textual Cultural Heritage (DATeCH 2017), Goettingen, Germany, 01 - 02 June 2017
Making Europe’s Historical Newspapers Searchable
Proceedings of the 12th IAPR International Workshop on Document Analysis Systems (DAS2016), Santorini, Greece, April 11-14, 2016.
Quality Prediction System for Large-Scale Digitisation Workflows
Proceedings of the 12th IAPR International Workshop on Document Analysis Systems (DAS2016), Santorini, Greece, April 11-14, 2016
The ENP Image and Ground Truth Dataset of Historical Newspapers
Proceedings of the 13th International Conference on Document Analysis and Recognition (ICDAR2015), Nancy, France, August 2015, pp. 931-935
ICDAR2015 Competition on Recognition of Documents with Complex Layouts – RDCL2015
Proceedings of the 13th International Conference on Document Analysis and Recognition (ICDAR2015), Nancy, France, August 2015, pp. 1151-1155
Historical Typewritten Document Recognition Using Minimal User Interaction
Proceedings of the 2015 Workshop on Historical Document Imaging and Processing (HIP2015), Nancy, France, August 2015, pp. 31-38
Europeana Newspapers OCR Workflow Evaluation
Proceedings of the 2015 Workshop on Historical Document Imaging and Processing (HIP2015), Nancy, France, August 2015, pp. 39-46
Navigating the Storm: IMPACT, eMOP, and Agile Steering Standards
Digital Scholarship in the Humanities, 2015.
Document Representation Refinement for Precise Region Description
Proceedings of the First International Conference on Digital Access to Textual Cultural Heritage (DATeCH2014), Madrid, Spain, May 2014, pp. 9-13
Efficient OCR Training Data Generation with Aletheia
Short Paper Booklet of the 11th International Association for Pattern Recognition (IAPR) Workshop on Document Analysis Systems (DAS2014), Tours, France, April 2014, pp. 19-20
Distinction between handwritten and machine-printed text based on the bag of visual words model
Pattern Recognition, Available online 20 September 2013, ISSN 0031-3203
The Significance of Reading Order in Document Recognition and its Evaluation
Proceedings of the 12th International Conference on Document Analysis and Recognition (ICDAR2013), Washington DC, USA, August 2013, pp. 688-692
ICDAR2013 Competition on Historical Newspaper Layout Analysis – HNLA2013
Proceedings of the 12th International Conference on Document Analysis and Recognition (ICDAR2013), Washington DC, USA, August 2013, pp. 1486-1490
ICDAR2013 Competition on Historical Book Recognition – HBR2013
Proceedings of the 12th International Conference on Document Analysis and Recognition (ICDAR2013), Washington DC, USA, August 2013, pp. 1491-1495
The IMPACT Dataset of Historical Document Images
Proceedings of the 2013 Workshop on Historical Document Imaging and Processing (HIP2013), Washington DC, USA, August 2013, pp. 123-130
A robust hybrid approach for text line segmentation in historical documents
Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012), Tsukuba, Japan, November 11-15, 2012, IEEE-CS Press, pp. 335-338
Handwritten and Machine Printed Text Separation in Document Images using the Bag of Visual Words Paradigm
Proceedings of the 13th International Conference on Frontiers in Handwriting Recognition (ICFHR2012), Bari, Italy, September 2012, pp. 103-108
Restoration of Arbitrarily Warped Historical Document Images Using Flow Lines
Proceedings of the 11th International Conference on Document Analysis and Recognition (ICDAR2011), Beijing, China, September 2011, pp. 905-909
Scenario Driven In-Depth Performance Evaluation of Document Layout Analysis Methods
Proceedings of the 11th International Conference on Document Analysis and Recognition (ICDAR2011), Beijing, China, September 2011, pp. 1404-1408
Aletheia - An Advanced Document Layout and Text Ground-Truthing System for Production Environments
Proceedings of the 11th International Conference on Document Analysis and Recognition (ICDAR2011), Beijing, China, September 2011, pp. 48-52
Historical Document Layout Analysis Competition
Proceedings of the 11th International Conference on Document Analysis and Recognition (ICDAR2011), Beijing, China, September 2011, pp. 1516-1520
Grid-Based Modelling and Correction of Arbitrarily Warped Historical Document Images for Large-Scale Digitisation
Proceedings of the 2011 Workshop on Historical Document Imaging and Processing (HIP2011), Beijing, China, September 2011, pp. 106-111
Large-Scale Digitisation and Recognition of Opportunities for Image Processing and Analysis Historical Documents: Challenges and Analysis
Keynote presentation given at the Swedish Symposium on Image Analysis 2010 (SSBA2010), Uppsala, Sweden, March 11-12, 2010
The PAGE (Page Analysis and Ground-Truth Elements) Format Framework
Proceedings of the 20th International Conference on Pattern Recognition (ICPR2010), Istanbul, Turkey, August 23-26, 2010, IEEE-CS Press, pp. 257-260
A New Framework for Recognition of Heavily Degraded Characters in Historical Typewritten Documents Based on Semi-Supervised Clustering
Proceedings of the 10th International Conference on Document Analysis and Recognition (ICDAR2009), Barcelona, Spain, July 2009, pp. 506-510
A Self-Adaptive Method for Extraction of Document-Specific Alphabets
Proceedings of the 10th International Conference on Document Analysis and Recognition (ICDAR2009), Barcelona, Spain, July 2009, pp. 656-660
Word-Based Adaptive OCR for Historical Books
Proceedings of the 10th International Conference on Document Analysis and Recognition (ICDAR2009), Barcelona, Spain, July 2009, pp. 501-505
A Realistic Dataset for Performance Evaluation of Document Layout Analysis
Proceedings of the 10th International Conference on Document Analysis and Recognition (ICDAR2009), Barcelona, Spain, July 2009, pp. 296-300
ICDAR2009 Page Segmentation Competition
Proceedings of the 10th International Conference on Document Analysis and Recognition (ICDAR2009), Barcelona, Spain, July 2009, pp. 1370-1374
A Geometric Approach for Accurate and Efficient Performance Evaluation of Layout Analysis Methods
Proceedings of the 19th International Conference on Pattern Recognition (ICPR2008), Tampa, Florida, USA, December 7-11, 2008, IEEE-CS Press
Representation of Digitized Documents Using Document Specific Alphabets and Fonts
Proceedings of the 5th IS&T Archiving Conference, Bern, Switzerland, June 2008, pp. 198-202
Colour text segmentation in web images based on human perception
Image and Vision Computing, Volume 25, Issue 5, Elsevier, May 2007, pp. 564-577
ICDAR2007 Handwriting Segmentation Contest
Proceedings of the 9th International Conference on Document Analysis and Recognition (ICDAR2007), Curitiba, Brazil, September 2007, pp. 1284-1288
Performance Analysis Framework for Layout Analysis Methods
Proceedings of the 9th International Conference on Document Analysis and Recognition (ICDAR2007), Curitiba, Brazil, September 2007, pp. 1258-1262
ICDAR2007 Page Segmentation Competition
Proceedings of the 9th International Conference on Document Analysis and Recognition (ICDAR2007), Curitiba, Brazil, September 2007, pp. 1279-1283
Flexible Text Recovery from Degraded Typewritten Historical Documents
Proceedings of the 18th International Conference on Pattern Recognition (ICPR2006), Hong Kong, August 20-24, 2006, IEEE-CS Press, pp. 1062-1065
Vectorization of Glyphs and Their Representation in SVG for XML-Based Processing
Digital Spectrum: Integrating Technology and Culture, Proceedings of the 10th International Conference on Electronic Publishing, Bansko, Bulgaria, June 2006, pp. 299-308
Ground Truth for Layout Analysis Performance Evaluation
Document Analysis Systems VII: Proceedings of the 7th International Association for Pattern Recognition (IAPR) Workshop on Document Analysis Systems (DAS2006), H. Bunke and A.L. Spitz (Eds.), Springer Lecture Notes in Computer Science, LNCS 3872, Nelson, New Zealand, February 2006, pp. 302-311
Semantics-Based Content Extraction in Typewritten Historical Documents
Proceedings of the 8th International Conference on Document Analysis and Recognition (ICDAR2005), Seoul, South Korea, August/September 2005, pp. 48-53
ICDAR2005 Page Segmentation Competition
Proceedings of the 8th International Conference on Document Analysis and Recognition (ICDAR2005), Seoul, South Korea, August/September 2005, pp. 75-79
OCR Alternatives for Electronic Publishing of Digitised Documents
From Author to Reader: Challenges for the Digital Content Chain, Proceedings of the 9th ICCC International Conference on Electronic Publishing, Leuven-Heverlee, Belgium, June 2005, pp. 35-41
Text Extraction from Web Images Based on A Split-and-Merge Segmentation Method Using Color Perception
Proceedings of the 17th International Conference on Pattern Recognition (ICPR2004), Cambridge, UK, August 23-26, 2004, IEEE-CS Press, pp. 634-637
The Lifecycle of a Digital Historical Document: Structure and Content
Proceedings of the ACM Symposium on Document Engineering (DocEng2004), ACM Press, Milwaukee, Wisconsin, 28-30 October 2004, pp. 147-154
Document Image Analysis for World War II Personal Records
Proceedings of the International Workshop on Document Image Analysis for Libraries (DIAL2004), Palo Alto Research Center (PARC), USA, January 23-24, IEEE Computer Society Press, 2004, pp. 336-341
A Complete Approach to the Conversion of Typewritten Historical Documents for Digital Archives
Document Analysis Systems VI: Proceedings of the 6th International Association for Pattern Recognition (IAPR) Workshop on Document Analysis Systems (DAS2004), S. Marinai and A.R. Dengel (Eds.), Springer Lecture Notes in Computer Science, LNCS 3163, Florence, Italy, September 2004, pp. 90-101
A Robust Braille Recognition System
Document Analysis Systems VI: Proceedings of the 6th International Association for Pattern Recognition (IAPR) Workshop on Document Analysis Systems (DAS2004), S. Marinai and A.R. Dengel (Eds.), Springer Lecture Notes in Computer Science, LNCS 3163, Florence, Italy, September 2004, pp. 533-545
The Effects Of Inter and Intra Speaker Variability on Pathological Voice Quality Assessment
Proceedings of the 3rd International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, Firenze, Italy 2003
Two Approaches for Text Segmentation in Web Images
Proceedings of the 7th International Conference on Document Analysis and Recognition (ICDAR2003), Edinburgh, UK, August 2003, pp. 131-136
ICDAR2003 Page Segmentation Competition
Proceedings of the 7th International Conference on Document Analysis and Recognition (ICDAR2003), Edinburgh, UK, August 2003, pp. 688-692
Exploiting Human Colour Perception to Segment Complex Colour Images
Proceedings of Visual Representations and Interpretations (VRI2002), Liverpool, UK, September 2002
Intelligent Classification and Staging of Acoustic Voice Data in Cancer of the Larynx Patients
Proceedings of the IEE Medical Applications of Signal Processing, London Oct. 2002
An Automated Tachograph Chart Analysis System
Document Analysis Systems V: Proceedings of the 5th International Association for Pattern Recognition (IAPR) Workshop on Document Analysis Systems (DAS2002), D. Lopresti, J. Hu and R. Kashi (Eds.), Springer Lecture Notes in Computer Science, LNCS 2423, Princeton NJ, USA, August 2002, pp. 544-555
A Ground-Truthing Tool for Layout Analysis Performance Evaluation
Document Analysis Systems V: Proceedings of the 5th International Association for Pattern Recognition (IAPR) Workshop on Document Analysis Systems (DAS2002), D. Lopresti, J. Hu and R. Kashi (Eds.), Springer Lecture Notes in Computer Science, LNCS 2423, Princeton NJ, USA, August 2002, pp. 263-244
Fuzzy Segmentation of Characters in Web Images Based on Human Colour Perception
Document Analysis Systems V: Proceedings of the 5th International Association for Pattern Recognition (IAPR) Workshop on Document Analysis Systems (DAS2002), D. Lopresti, J. Hu and R. Kashi (Eds.), Springer Lecture Notes in Computer Science, LNCS 2423, Princeton NJ, USA, August 2002, pp. 295-306
Text Extraction from Web Images Based on Human Perception and Fuzzy Inference
Proceedings of the First International Workshop on Web Document Analysis (WDA2001), Seattle, USA, September 2001, pp. 35-38
Accessing Textual Information Embedded in Internet Images
Proceedings of SPIE, Internet Imaging II, San Jose, USA, January 2001, Vol. 4311, pp. 198-205
Information Extraction from Complex Circular Charts
Proceedings of the 6th International Conference on Document Analysis and Recognition (ICDAR2001), Seattle, USA, September 2001, pp. 784-787
First International Newspaper Page Segmentation Contest
Proceedings of the 6th International Conference on Document Analysis and Recognition (ICDAR2001), Seattle, USA, September 2001, pp. 1190-1194
An Anthropocentric Approach to Text Extraction from WWW Images
Proceedings of the 4th IAPR International Workshop on Document Analysis Systems (DAS2000), Rio de Janeiro, Brazil, December 2000, pp. 515-525
A New Framework for Efficient and Flexible Analysis of the Performance of Document Image Analysis Subsystems
Proceedings of 7th International Conference on Image Processing and its Applications (IPA1999), Manchester, UK, July 1999, pp. 417-420
Region Description and Comparative Analysis Using a Tesseral Representation
Proceedings of 5th International Conference on Document Image Analysis and Recognition (ICDAR1999), Bangalore, India, September 1999, pp. 193-196
Methodology for Flexible and Efficient Analysis of the Performance of Page Segmentation Algorithms
Proceedings of 5th International Conference on Document Image Analysis and Recognition (ICDAR1999), Bangalore, India, September 1999, pp. 451-454
Performance Analysis of Document Image Analysis Subsystems
Digest of IEE Colloquium on Document Image Processing and Multimedia (DIPM), Manchester, UK, 1999, pp. 15/1-15/4
Document Image Compression by Structural Decomposition
Proceedings of the International Conference on Telecommunications 1998 (ICT1998), Chalkidiki, Greece, June 1998, pp. 166-171
Identification of Airfield Runways in Synthetic Aperture Radar Images
Proceedings of the 14th International Conference on Pattern Recognition (ICPR1998), Brisbane, Australia, August 17-20, 1998, IEEE-CS Press, Vol. II, pp. 1633-1636
Page Segmentation Using the Description of the Background
Computer Vision and Image Understanding, Special Issue on Document Analysis and Retrieval, Volume 70, Issue 3, June 1998, pp. 350-369
A Structural Approach for Smoothing Noisy Peak-Shaped Analytical Signals
Chemometrics and Intelligent Laboratory Systems, Volume 41, Issue 1, July 1998, pp. 31-42
Local Skew Angle Estimation from Background Space in Text Regions
Proceedings of the 4th International Conference on Document Analysis and Recognition (ICDAR1997), Ulm, Germany, August 1997, pp. 684-688
Presenting Legislation as Hypertext
Proceedings of the Fifth National/First European Conference on Law, Computers and Artificial Intelligence: EUCLID, Exeter, April 1996, pp. 1-9, ISBN 095278730X
Representation and Classification of Complex-Shaped Printed Regions Using White Tiles
Proceedings of the 3rd International Conference on Document Analysis and Recognition (ICDAR1995), Montreal, Canada, August 1995, pp. 1132-1135
Segmentation and Classification of Document Images Using the Background
Proceedings of the 5th Hellenic Conference on Informatics, Athens, Greece, December 7-9, 1995, pp. 927-937 (invited contribution)
Segmentation and Classification of Document Images
Digest of the IEE Colloquium on Document Image Processing and Multimedia Environments, The Institution of Electrical Engineers, November 1995, pp. 16/1-16/7, ISSN 0963-3308
Flexible Page Segmentation Using the Background
Proceedings of the IAPR 12th International Conference on Pattern Recognition (ICPR1994), Jerusalem, Israel, October 9-12, 1994, IEEE-CS Press, pp. 339-344
Segmentation of Layouts with Non-Rectangular Regions
Proceedings of the International Association of Pattern Recognition Workshop on Document Analysis Systems (DAS1994), Kaiserslautern, Germany, 18-20 October 1994, pp. 3-13
Proceedings of the 2010 ACM Symposium on Document Engineering (DocEng2010)
288 pages, ACM, September 21-24, 2010, ISBN: 978-1-4503-0231-9
Web Document Analysis II: Proceedings of the 2nd International Workshop on Web Document Analysis
PRImA, 2003, ISBN: 0-9541148-1-7
Proceedings of the 7th International Conference on Document Analysis and Recognition (ICDAR2003)
1304 pages, 2 volumes, IEEE Computer Society Press, August 2003, ISBN: 0-7695-1960-1
Web Document Analysis: Proceedings of the 1st International Workshop on Web Document Analysis
PRImA, 2001, ISBN: 0-9541148-0-9
The Analysis of Web Documents
in the book Digital Document Processing: Major Directions and Recent Advances, B.B. Chaudhuri (Ed.), Springer, Advances in Pattern Recognition Series, December 2006, ISBN: 978-1-84628-501-1, pp. 407-419
Visual Representation of Text in Web Documents and Its interpretation
in the book Studies in Multidisciplinarity: Multidisciplinary Approaches to Visual Representations and Interpretations, G.R. Malcolm (Ed.), Volume 2, Elsevier, 2005, pp. 181-196
A Fuzzy Approach to Text Segmentation in Web Images Based on Human Colour Perception
in the book: Web Document Analysis: Challenges and Opportunities, A. Antonacopoulos and J. Hu (Eds.), Series in Machine Perception and Artificial Intelligence, World Scientific Publishing Company, 2003, pp. 203-222
Automated Interpretation of Visual Representations: Extracting Textual Information from WWW Images
in the book Visual Representations and Interpretations, R. Paton and I. Neilson (Eds.), Springer, London, 1999, pp. 88-93
Analysis of Scanned Braille Documents
in the book Document Analysis Systems, A. Dengel and A.L. Spitz (Eds.), World Scientific Publishing Co., 1995, pp. 413-421
Web Document Analysis: Challenges and Opportunities
Series in Machine Perception and Artificial Intelligence, World Scientific Publishing Company, November 2003, ISBN: 978-981-238-582-6