Please use this identifier to cite or link to this item: http://hdl.handle.net/10553/121364
Title: SEG-ESRGAN: A Multi-Task Network for Super-Resolution and Semantic Segmentation of Remote Sensing Images
Authors: Salgueiro, Luis
Marcello Ruiz, Francisco Javier 
Vilaplana, Verónica
Keywords: Multi-task network
Super-resolution
Semantic segmentation
Sentinel-2
WorldView-2
Issue Date: 2022
Project: Procesado Avanzado de Datos de Teledetección Para la Monitorizacióny Gestión Sostenible de Recursos Marinosy Terrestres en Ecosistemas Vulnerables. 
Journal: Remote Sensing 
Abstract: The production of highly accurate land cover maps is one of the primary challenges in remote sensing, which depends on the spatial resolution of the input images. Sometimes, high-resolution imagery is not available or is too expensive to cover large areas or to perform multitemporal analysis. In this context, we propose a multi-task network to take advantage of the freely available Sentinel-2 imagery to produce a super-resolution image, with a scaling factor of 5, and the corresponding high-resolution land cover map. Our proposal, named SEG-ESRGAN, consists of two branches: the super-resolution branch, that produces Sentinel-2 multispectral images at 2 m resolution, and an encoder–decoder architecture for the semantic segmentation branch, that generates the enhanced land cover map. From the super-resolution branch, several skip connections are retrieved and concatenated with features from the different stages of the encoder part of the segmentation branch, promoting the flow of meaningful information to boost the accuracy in the segmentation task. Our model is trained with a multi-loss approach using a novel dataset to train and test the super-resolution stage, which is developed from Sentinel-2 and WorldView-2 image pairs. In addition, we generated a dataset with ground-truth labels for the segmentation task. To assess the super-resolution improvement, the PSNR, SSIM, ERGAS, and SAM metrics were considered, while to measure the classification performance, we used the IoU, confusion matrix and the F1-score. Experimental results demonstrate that the SEG-ESRGAN model outperforms different full segmentation and dual network models (U-Net, DeepLabV3+, HRNet and Dual_DeepLab), allowing the generation of high-resolution land cover maps in challenging scenarios using Sentinel-2 10 m bands.
URI: http://hdl.handle.net/10553/121364
ISSN: 2072-4292
DOI: 10.3390/rs14225862
Source: Remote Sensing [ISSN 2072-4292], v. 14 (22), 5862, (Noviembre 2022)
Appears in Collections:Artículos
Adobe PDF (19,56 MB)
Show full item record

Google ScholarTM

Check

Altmetric


Share



Export metadata



Items in accedaCRIS are protected by copyright, with all rights reserved, unless otherwise indicated.