Cookie Name	Cookie Description	When not logged in	When logged in
prima_cookies	Remembers whether you have already closed this message.	Yes	Yes
prima_notice	Remembers if you have alreaded viewed any notice/warning message(s). Such a message is used to inform users of potential downtime or issues that might affect the normal operation of the website. It is set to expire after the date when such notice is obsolete (eg after an expected downtime/error is fixed).	Yes	Yes
PHPSESSID	The ID of your session.	Yes	Yes
__utma	This is set by Google Analytics. It stores each user's amount of visits, and the time of the first visit, the previous visit, and the current visit.	Yes	Yes
__utmb, __utmc	These are set by Google Analytics. They are used to check approximately how long you stay on a site (when a visit starts, and approximately ends).	Yes	Yes
__utmz	This is set by Google Analytics. It stores where a visitor came from (search engine, search keyword, link).	Yes	Yes

ICDAR2017 Competition on Recognition of Documents with Complex Layouts – RDCL2017

C. Clausner, A. Antonacopoulos, S. Pletschacher

Proceedings of the 14th International Conference on Document Analysis and Recognition (ICDAR2017), Kyoto, Japan, November 2017, pp. 1404-1410

Abstract

This paper presents an objective comparative evaluation of page segmentation and region classification methods for documents with complex layouts. It describes the competition (modus operandi, dataset and evaluation methodology) held in the context of ICDAR2017, presenting the results of the evaluation of seven methods – five submitted, two state-of-the-art systems (commercial and open-source). Three scenarios are reported in this paper, one evaluating the ability of methods to accurately segment regions and two evaluating both segmentation and region classification (one focusing only on text regions). For the first time, nested region content (table cells, chart labels etc.) are evaluated in addition to the top-level page content. Text recognition was a bonus challenge and was not taken up by all participants. The results indicate that an innovative approach has a clear advantage but there is still a considerable need to develop robust methods that deal with layout challenges, especially with the non-textual content.

Citation

C. Clausner, A. Antonacopoulos, S. Pletschacher , "ICDAR2017 Competition on Recognition of Documents with Complex Layouts – RDCL2017", Proceedings of the 14th International Conference on Document Analysis and Recognition (ICDAR2017), Kyoto, Japan, November 2017, pp. 1404-1410

DOI

10.1109/ICDAR.2017.229

Full Paper

Download PDF

PRImA

ICDAR2017 Competition on Recognition of Documents with Complex Layouts – RDCL2017

Abstract

Citation

DOI

Full Paper