Abstract
During document scanning, skew is inevitably introduced into the incoming document image. Since the algorithms for layout analysis and character recognition are generally very sensitive to the page skew, skew detection and correction in document images are the critical steps before layout analysis. In this paper, a novel skew detection method based on straight-line fitting is proposed. And a concept of eigen-point is introduced. After the relations between the successive eigen-points in every text line within a suitable sub-region were analyzed, the eigen-points most possibly laid on the baselines are selected as samples for the straight-line fitting. The average of these baseline directions is computed, which corresponds to the degree of skew of the whole document image. Then a fast skew correction method based on the scanning line model is also presented. Experiments prove that the proposed approaches are fast and accurate.
Original language | English |
---|---|
Pages (from-to) | 1871-1879 |
Number of pages | 9 |
Journal | Pattern Recognition Letters |
Volume | 24 |
Issue number | 12 |
DOIs | |
Publication status | Published - 1 Jan 2003 |
Keywords
- Connected component
- Document analysis
- Eigen-point
- Skew correction
- Skew detection
ASJC Scopus subject areas
- Software
- Signal Processing
- Computer Vision and Pattern Recognition
- Artificial Intelligence