Skew detection and correction in document images based on straight-line fitting

Yang Cao, Shuhua Wang, Heng Li

Research output: Journal article publicationJournal articleAcademic researchpeer-review

40 Citations (Scopus)

Abstract

During document scanning, skew is inevitably introduced into the incoming document image. Since the algorithms for layout analysis and character recognition are generally very sensitive to the page skew, skew detection and correction in document images are the critical steps before layout analysis. In this paper, a novel skew detection method based on straight-line fitting is proposed. And a concept of eigen-point is introduced. After the relations between the successive eigen-points in every text line within a suitable sub-region were analyzed, the eigen-points most possibly laid on the baselines are selected as samples for the straight-line fitting. The average of these baseline directions is computed, which corresponds to the degree of skew of the whole document image. Then a fast skew correction method based on the scanning line model is also presented. Experiments prove that the proposed approaches are fast and accurate.
Original languageEnglish
Pages (from-to)1871-1879
Number of pages9
JournalPattern Recognition Letters
Volume24
Issue number12
DOIs
Publication statusPublished - 1 Jan 2003

Keywords

  • Connected component
  • Document analysis
  • Eigen-point
  • Skew correction
  • Skew detection

ASJC Scopus subject areas

  • Software
  • Signal Processing
  • Computer Vision and Pattern Recognition
  • Artificial Intelligence

Cite this