A New Semi-automatic Annotation Model via Semantic Boundary Estimation for Scene Text Detection

Zhenzhou Zhuang, Zonghao Liu, Kin Man Lam, Shuangping Huang, Gang Dai

Research output: Chapter in book / Conference proceedingConference article published in proceeding or bookAcademic researchpeer-review

Abstract

Manually annotating a data set for scene text detection is extremely time-consuming. In this paper, we propose a new semi-automatic annotation model to produce tight polygonal annotations for text instances in scene images, based on the input of manually annotated text center lines. Our approach first generates multiple candidate boundaries, which share the same input center line. Then, by training a fastidious content recognizer, optimal boundary selection is performed. The bounded text region, which achieves the smallest recognition loss, is selected as the tightest of the text. As this optimal boundary estimation is guided by semantic recognition, our method is called Semantic Boundary Estimation. Experiment results show that only half clicks compared to manually annotated polygon, are input to annotate center line, and precise polygon text region annotation is automatically produced. A high recall of more than 95% at IoU > 0.5 and 80% at IoU > 0.7 is achieved, demonstrating the high agreement with the original ground truth. In addition, using the generated annotations on benchmarks, such as Total-Text, CTW1500 and ICDAR2015, to train state-of-the-art detectors can achieve similar performance to those trained with manual annotations. This further verifies the good annotation performance. A annotation toolkit based on the proposed model is available at CenterlineAnnotation.

Original languageEnglish
Title of host publicationDocument Analysis and Recognition - ICDAR 2021 - 16th International Conference, Proceedings
EditorsJosep Lladós, Daniel Lopresti, Seiichi Uchida
PublisherSpringer Science and Business Media Deutschland GmbH
Pages257-273
Number of pages17
ISBN (Print)9783030863333
DOIs
Publication statusPublished - Sep 2021
Event16th International Conference on Document Analysis and Recognition, ICDAR 2021 - Lausanne, Switzerland
Duration: 5 Sep 202110 Sep 2021

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume12823 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference16th International Conference on Document Analysis and Recognition, ICDAR 2021
Country/TerritorySwitzerland
CityLausanne
Period5/09/2110/09/21

Keywords

  • Scene text detection
  • Semantic boundary estimation
  • Semi-automatic annotation algorithm

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)

Cite this