DOI: 10.46298/jdmdh.9806 ISSN: 2416-5999  
  
You Actually Look Twice At it (YALTAi): using an object detection approach instead of region segmentation within the Kraken engine
 Thibault Clérice       - General Earth and Planetary Sciences
- General Engineering
- General Environmental Science
    Layout Analysis (the identification of zones and their classification) is the first step along line segmentation in Optical Character Recognition and similar tasks. The ability of identifying main body of text from marginal text or running titles makes the difference between extracting the work full text of a digitized book and noisy outputs. We show that most segmenters focus on pixel classification and that polygonization of this output has not been used as a target for the latest competition on historical document (ICDAR 2017 and onwards), despite being the focus in the early 2010s. We propose to shift, for efficiency, the task from a pixel classification-based polygonization to an object detection using isothetic rectangles. We compare the output of Kraken and YOLOv5 in terms of segmentation and show that the later severely outperforms the first on small datasets (1110 samples and below). We release two datasets for training and evaluation on historical documents as well as a new package, YALTAi, which injects YOLOv5 in the segmentation pipeline of Kraken 4.1.      
    More from our Archive
   -    DOI: 10.1029/2023gl105332 2023  Two Competing Drivers of the Recent Walker Circulation TrendMasahiro Watanabe, Tomoki Iwakiri, Yue Dong, Sarah M. Kang 
-    DOI: 10.3390/rs15235552 2023  MFTSC: A Semantically Constrained Method for Urban Building Height Estimation Using Multiple Source ImagesYuhan Chen, Qingyun Yan, Weimin Huang 
-    DOI: 10.1093/femsmc/xtad021 2023  A targeted approach to enrich host-associated bacteria for metagenomic sequencingAshley M Dungan, Kshitij Tandon, Vanta Jameson, Cecilie Ravn Gotze, Linda L Blackall, Madeleine J H van Oppen 
-    DOI: 10.1029/2023gl105435 2023  Dynamic Response to Ice Shelf Basal Meltwater Relevant to Explain Observed Sea Ice Trends Near the Antarctic Continental ShelfWilma G. C. Huneke, William R. Hobbs, Andreas Klocker, Kaitlin A. Naughten 
-    DOI: 10.46298/jdmdh.9806 2023  You Actually Look Twice At it (YALTAi): using an object detection approach instead of region segmentation within the Kraken engineThibault Clérice 
-    DOI: 10.1029/2023gl105948 2023  Winds and Meltwater Together Lead to Southern Ocean Surface Cooling and Sea Ice ExpansionLettie A. Roach, Kenneth D. Mankoff, Anastasia Romanou, Edward Blanchard‐Wrigglesworth, Thomas W. N. Haine, Gavin. A. Schmidt 
-    DOI: 10.1029/2023gl105755 2023  Measuring Carbon Dioxide Emissions From Liquefied Natural Gas (LNG) Terminals With Imaging SpectroscopyZhan Zhang, Daniel H. Cusworth, Alana K. Ayasse, Evan D. Sherwin, Adam R. Brandt 
-    DOI: 10.3390/rs15235575 2023  ERF-RTMDet: An Improved Small Object Detection Method in Remote Sensing ImagesShuo Liu, Huanxin Zou, Yazhe Huang, Xu Cao, Shitian He, Meilin Li, Yuqing Zhang 
-    DOI: 10.3390/rs15235580 2023  Remote Sensing Application in Chinese Medicinal Plant Identification and Acreage Estimation—A ReviewJihua Meng, Xinyan You, Xiaobo Zhang, Tingting Shi, Lei Zhang, Xingfeng Chen, Hailan Zhao, Meng Xu 
-    DOI: 10.3390/rs15235579 2023  Enhancing Path Planning Efficiency for Underwater Gravity Matching Navigation with a Novel Three-Dimensional Along-Path Obstacle Profiling AlgorithmXiaocong Zhou, Wei Zheng, Zhaowei Li, Panlong Wu, Yongjin Sun