Cookie Name	Cookie Description	When not logged in	When logged in
prima_cookies	Remembers whether you have already closed this message.	Yes	Yes
prima_notice	Remembers if you have alreaded viewed any notice/warning message(s). Such a message is used to inform users of potential downtime or issues that might affect the normal operation of the website. It is set to expire after the date when such notice is obsolete (eg after an expected downtime/error is fixed).	Yes	Yes
PHPSESSID	The ID of your session.	Yes	Yes
__utma	This is set by Google Analytics. It stores each user's amount of visits, and the time of the first visit, the previous visit, and the current visit.	Yes	Yes
__utmb, __utmc	These are set by Google Analytics. They are used to check approximately how long you stay on a site (when a visit starts, and approximately ends).	Yes	Yes
__utmz	This is set by Google Analytics. It stores where a visitor came from (search engine, search keyword, link).	Yes	Yes

Document Representation Refinement for Precise Region Description

C. Clausner, S. Pletschacher, A. Antonacopoulos

Proceedings of the First International Conference on Digital Access to Textual Cultural Heritage (DATeCH2014), Madrid, Spain, May 2014, pp. 9-13

Abstract

Precise description of layout entities (content regions on a page) is crucial for all but the most trivial document analysis and recognition applications. The output of layout analysis methods and state-of-the-art OCR systems varies significantly, from bounding boxes (e.g. Tesseract) to stacks of text line rectangles (e.g. ABBYY FineReader). There is a clear need for a consistent and accurate representation of regions (e.g. text paragraphs, graphics entities etc.) for further processing, correction and performance evaluation (comparison of segmentation results with ground truth regions). This paper describes a method for refinement of document representations by fitting polygons around lower-level layout objects (such as text lines, words and glyphs) in a systematic way that reconstructs region outlines and preserves the fine details of complex layouts. Experimental results on a standard dataset demonstrate the validity and usefulness of the proposed approach.

Citation

C. Clausner, S. Pletschacher, A. Antonacopoulos , "Document Representation Refinement for Precise Region Description", Proceedings of the First International Conference on Digital Access to Textual Cultural Heritage (DATeCH2014), Madrid, Spain, May 2014, pp. 9-13

DOI

10.1145/2595188.2595198

Full Paper

Download PDF

Related Projects

"") { $image_src = Constants::SITE_ROOT().'/www/media/projects/no_image.png'; $image_alt = "No image."; } else $image_src = Constants::SITE_ROOT().'/www/media/projects/no_image.png'; ?>
Warning: Undefined variable $image_src in /media/PrimaStorage/wwwroot/www/www/views/publication_details.phtml on line 63
"") { $image_src = Constants::SITE_ROOT().'/www/media/projects/no_image.png'; $image_alt = "No image."; } else $image_src = Constants::SITE_ROOT().'/www/media/projects/no_image.png'; ?>
Warning: Undefined variable $image_src in /media/PrimaStorage/wwwroot/www/www/views/publication_details.phtml on line 63

PRImA

Document Representation Refinement for Precise Region Description

Abstract

Citation

DOI

Full Paper

Related Projects