DOI: 10.46298/jdmdh.9806 ISSN: 2416-5999
You Actually Look Twice At it (YALTAi): using an object detection approach instead of region segmentation within the Kraken engine
Thibault Clérice - General Earth and Planetary Sciences
- General Engineering
- General Environmental Science
Layout Analysis (the identification of zones and their classification) is the first step along line segmentation in Optical Character Recognition and similar tasks. The ability of identifying main body of text from marginal text or running titles makes the difference between extracting the work full text of a digitized book and noisy outputs. We show that most segmenters focus on pixel classification and that polygonization of this output has not been used as a target for the latest competition on historical document (ICDAR 2017 and onwards), despite being the focus in the early 2010s. We propose to shift, for efficiency, the task from a pixel classification-based polygonization to an object detection using isothetic rectangles. We compare the output of Kraken and YOLOv5 in terms of segmentation and show that the later severely outperforms the first on small datasets (1110 samples and below). We release two datasets for training and evaluation on historical documents as well as a new package, YALTAi, which injects YOLOv5 in the segmentation pipeline of Kraken 4.1.
More from our Archive
-
DOI: 10.1029/2023gl105332 2023
Two Competing Drivers of the Recent Walker Circulation Trend Masahiro Watanabe, Tomoki Iwakiri, Yue Dong, Sarah M. Kang
-
DOI: 10.3390/rs15235552 2023
MFTSC: A Semantically Constrained Method for Urban Building Height Estimation Using Multiple Source Images Yuhan Chen, Qingyun Yan, Weimin Huang
-
DOI: 10.1093/femsmc/xtad021 2023
A targeted approach to enrich host-associated bacteria for metagenomic sequencing Ashley M Dungan, Kshitij Tandon, Vanta Jameson, Cecilie Ravn Gotze, Linda L Blackall, Madeleine J H van Oppen
-
DOI: 10.1029/2023gl105435 2023
Dynamic Response to Ice Shelf Basal Meltwater Relevant to Explain Observed Sea Ice Trends Near the Antarctic Continental Shelf Wilma G. C. Huneke, William R. Hobbs, Andreas Klocker, Kaitlin A. Naughten
-
DOI: 10.46298/jdmdh.9806 2023
You Actually Look Twice At it (YALTAi): using an object detection approach instead of region segmentation within the Kraken engine Thibault Clérice
-
DOI: 10.1029/2023gl105948 2023
Winds and Meltwater Together Lead to Southern Ocean Surface Cooling and Sea Ice Expansion Lettie A. Roach, Kenneth D. Mankoff, Anastasia Romanou, Edward Blanchard‐Wrigglesworth, Thomas W. N. Haine, Gavin. A. Schmidt
-
DOI: 10.1029/2023gl105755 2023
Measuring Carbon Dioxide Emissions From Liquefied Natural Gas (LNG) Terminals With Imaging Spectroscopy Zhan Zhang, Daniel H. Cusworth, Alana K. Ayasse, Evan D. Sherwin, Adam R. Brandt
-
DOI: 10.3390/rs15235575 2023
ERF-RTMDet: An Improved Small Object Detection Method in Remote Sensing Images Shuo Liu, Huanxin Zou, Yazhe Huang, Xu Cao, Shitian He, Meilin Li, Yuqing Zhang
-
DOI: 10.3390/rs15235580 2023
Remote Sensing Application in Chinese Medicinal Plant Identification and Acreage Estimation—A Review Jihua Meng, Xinyan You, Xiaobo Zhang, Tingting Shi, Lei Zhang, Xingfeng Chen, Hailan Zhao, Meng Xu
-
DOI: 10.3390/rs15235579 2023
Enhancing Path Planning Efficiency for Underwater Gravity Matching Navigation with a Novel Three-Dimensional Along-Path Obstacle Profiling Algorithm Xiaocong Zhou, Wei Zheng, Zhaowei Li, Panlong Wu, Yongjin Sun