Not registered? - Request an account here
Method | Authors and Description |
---|---|
BINYAS | Showmik Bhowmik, Soumyadeep Kundu, Ram Sarkar - Department of Computer Science and Engineering, Jadavpur University, India. Mainly based on connected component analysis and morphology. |
BKZA | Duc Nguyen, Cuong Ha - Ho Chi Minh City Uni-versity of Technology. Using deep learning to segment page and heuristic algorithms for post-processing. |
DSPH | Tan Lu, Ann Dooms - Vrije Universiteit Brussel. Document Segmentation with Probabilistic Homogeneity, using: binarization, text / non-text classification, text region extraction, non-text structure extraction. |
JBM | Klára Janoušková, Michal Bušta and Jiří Matas - Czech Technical University, Prague. The method uses following steps: Obtaining segmentation maps from a CNN, post-processing based on polygons, OCR (CNN-based). |
LingDIAR | Dou Haobin - Lingban Tech Co., Ltd. Based on a multi-task deep network model trained with synthetic document images. |
MHS | Tuan Anh Trana, Nam Quan Nguyena, Quoc Thang Nguyena, Hai Duong Nguyenb, Soo Hyung Kimc - (a): HoChiMinh National University City - HCMUT, Viet Nam & Cinnamon AI, (b): Concor-dia University, Canada, (c): Chonnam National University, Gwangju, Republic of Korea. Based on following steps: Negative/positive image detection, binarization, text / non-text classification, text segmentation, image classification, region refinement + labelling, OCR. |
MICS | Yassine Ouali, Céline Hudelot - MICS, Centrale-Supélec, France. Method similar to Pyramid Scene Parsing Network in combination with training on augmented data and final inference / post-processing steps. |
TAQ | Nam Quan Nguyen, Tran Hai Anh Vo, Quoc Thang Nguyen - Cinnamon AI Lab Inc. Based on following steps: text / non-text classification, text region improvements / smoothing, non-text classification. |
ZLCW | Chendi Zang, Hui Li, Xinfeng Chang, Yaqiang Wu - Lenovo Research. Using FCN(fully convolutional networks) trained with 15 original training images, augmented to 3960 images (using cropping, gaussian blur, adding random noise as well as colour jittering). |
FRE11 | ABBYY FineReader Engine 11 with PRImA FineReader-to-PAGE wrapper. Not trained or optimised for this competition. |
FRE12 | ABBYY FineReader Engine 12 with PRImA FineReader-to-PAGE wrapper. Not trained or optimised for this competition. |
Tess.4 | Tesseract 4 with PRImA Tesseract-to-PAGE wrap-per. Not trained or optimised for this competition. |