SEG-ESRGAN: A Multi-Task Network for Super-Resolution and Semantic Segmentation of Remote Sensing Images

Salgueiro, Luis; Marcello Ruiz, Francisco Javier; Vilaplana, Verónica

Título:	SEG-ESRGAN: A Multi-Task Network for Super-Resolution and Semantic Segmentation of Remote Sensing Images
Autores/as:	Salgueiro, Luis Marcello Ruiz, Francisco Javier Vilaplana, Verónica
Palabras clave:	Multi-task network Super-resolution Semantic segmentation Sentinel-2 WorldView-2
Fecha de publicación:	2022
Proyectos:	Procesado Avanzado de Datos de Teledetección Para la Monitorizacióny Gestión Sostenible de Recursos Marinosy Terrestres en Ecosistemas Vulnerables.
Publicación seriada:	Remote Sensing
Resumen:	The production of highly accurate land cover maps is one of the primary challenges in remote sensing, which depends on the spatial resolution of the input images. Sometimes, high-resolution imagery is not available or is too expensive to cover large areas or to perform multitemporal analysis. In this context, we propose a multi-task network to take advantage of the freely available Sentinel-2 imagery to produce a super-resolution image, with a scaling factor of 5, and the corresponding high-resolution land cover map. Our proposal, named SEG-ESRGAN, consists of two branches: the super-resolution branch, that produces Sentinel-2 multispectral images at 2 m resolution, and an encoder–decoder architecture for the semantic segmentation branch, that generates the enhanced land cover map. From the super-resolution branch, several skip connections are retrieved and concatenated with features from the different stages of the encoder part of the segmentation branch, promoting the flow of meaningful information to boost the accuracy in the segmentation task. Our model is trained with a multi-loss approach using a novel dataset to train and test the super-resolution stage, which is developed from Sentinel-2 and WorldView-2 image pairs. In addition, we generated a dataset with ground-truth labels for the segmentation task. To assess the super-resolution improvement, the PSNR, SSIM, ERGAS, and SAM metrics were considered, while to measure the classification performance, we used the IoU, confusion matrix and the F1-score. Experimental results demonstrate that the SEG-ESRGAN model outperforms different full segmentation and dual network models (U-Net, DeepLabV3+, HRNet and Dual_DeepLab), allowing the generation of high-resolution land cover maps in challenging scenarios using Sentinel-2 10 m bands.
URI:	https://accedacris.ulpgc.es/handle/10553/121364
ISSN:	2072-4292
DOI:	10.3390/rs14225862
Fuente:	Remote Sensing [ISSN 2072-4292], v. 14 (22), 5862, (Noviembre 2022)
Colección:	Artículos

Adobe PDF (19,56 MB)

Vista completa

Adobe PDF (19,56 MB)

Citas SCOPUS^TM

Citas de WEB OF SCIENCE^TM
Citations

Descargas

Google Scholar^TM

Altmetric

Comparte

Exporta metadatos

Dirección

Contacto

Legal

De interés

Adobe PDF (19,56 MB)

Citas SCOPUSTM

Citas de WEB OF SCIENCETM Citations

Descargas

Google ScholarTM

Altmetric

Comparte

Exporta metadatos

Dirección

Citas SCOPUS^TM

Citas de WEB OF SCIENCE^TM
Citations

Google Scholar^TM