CGSANet: A Contour-Guided and Local Structure-Aware Encoder-Decoder Network for Accurate Building Extraction from Very High-Resolution Remote Sensing Imagery

Shanxiong Chen, Wenzhong Shi, Mingting Zhou, Min Zhang, Zhaoxin Xuan

Research output: Journal article publicationJournal articleAcademic researchpeer-review

2 Citations (Scopus)

Abstract

Extracting buildings accurately from very high-resolution (VHR) remote sensing imagery is challenging due to diverse building appearances, spectral variability, and complex background in VHR remote sensing images. Recent studies mainly adopt a variant of the fully convolutional network (FCN) with an encoder-decoder architecture to extract buildings, which has shown promising improvement over conventional methods. However, FCN-based encoder-decoder models still fail to fully utilize the implicit characteristics of building shapes. This adversely affects the accurate localization of building boundaries, which is particularly relevant in building mapping. A contour-guided and local structure-aware encoder-decoder network (CGSANet) is proposed to extract buildings with more accurate boundaries. CGSANet is a multitask network composed of a contour-guided (CG) and a multiregion-guided (MRG) module. The CG module is supervised by a building contour that effectively learns building contour-related spatial features to retain the shape pattern of buildings. The MRG module is deeply supervised by four building regions that further capture multiscale and contextual features of buildings. In addition, a hybrid loss function was designed to improve the structure learning ability of CGSANet. These three improvements benefit each other synergistically to produce high-quality building extraction results. Experimental results on the WHU and NZ32km2 building datasets demonstrate that compared with the tested algorithms, CGSANet can produce more accurate building extraction results and achieve the best intersection over union value 91.55% and 90.02%, respectively. Experiments on the INRIA building dataset further demonstrate the ability for generalization of the proposed framework, indicating great practical potential.

Original languageEnglish
Pages (from-to)1526-1542
Number of pages17
JournalIEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Volume15
DOIs
Publication statusPublished - Dec 2021

Keywords

  • Building extraction
  • fully convolutional network (FCN)
  • hybrid loss function
  • multitask learning
  • very high resolution (VHR) remote sensing imagery

ASJC Scopus subject areas

  • Computers in Earth Sciences
  • Atmospheric Science

Fingerprint

Dive into the research topics of 'CGSANet: A Contour-Guided and Local Structure-Aware Encoder-Decoder Network for Accurate Building Extraction from Very High-Resolution Remote Sensing Imagery'. Together they form a unique fingerprint.

Cite this