Empirical Study of Multi-Task Hourglass Model for Semantic Segmentation Task

The semantic segmentation (SS) task aims to create a dense classification by labeling at the pixel level each object present on images. Convolutional neural network (CNN) approaches have been widely used, and exhibited the best results in this task. However, the loss of spatial precision on the resu...

Full description

Bibliographic Details
Main Authors: Darwin Saire, Adin Ramirez Rivera
Format: Article
Language:English
Published: IEEE 2021-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9444361/
id doaj-d5a6ce39da584f208203baddc7952ad3
record_format Article
spelling doaj-d5a6ce39da584f208203baddc7952ad32021-06-07T23:00:32ZengIEEEIEEE Access2169-35362021-01-019806548067010.1109/ACCESS.2021.30852189444361Empirical Study of Multi-Task Hourglass Model for Semantic Segmentation TaskDarwin Saire0https://orcid.org/0000-0001-9683-5405Adin Ramirez Rivera1https://orcid.org/0000-0002-4321-9075Institute of Computing, University of Campinas, Campinas, BrazilInstitute of Computing, University of Campinas, Campinas, BrazilThe semantic segmentation (SS) task aims to create a dense classification by labeling at the pixel level each object present on images. Convolutional neural network (CNN) approaches have been widely used, and exhibited the best results in this task. However, the loss of spatial precision on the results is a main drawback that has not been solved. In this work, we propose to use a multi-task approach by complementing the semantic segmentation task with edge detection, semantic contour, and distance transform tasks. We propose that by sharing a common latent space, the complementary tasks can produce more robust representations that can enhance the semantic labels. We explore the influence of contour-based tasks on latent space, as well as their impact on the final results of SS. We demonstrate the effectiveness of learning in a multi-task setting for hourglass models in the Cityscapes, CamVid, and Freiburg Forest datasets by improving the state-of-the-art without any refinement post-processing.https://ieeexplore.ieee.org/document/9444361/Explainable latent spacesmulti-task learningsemantic segmentation
collection DOAJ
language English
format Article
sources DOAJ
author Darwin Saire
Adin Ramirez Rivera
spellingShingle Darwin Saire
Adin Ramirez Rivera
Empirical Study of Multi-Task Hourglass Model for Semantic Segmentation Task
IEEE Access
Explainable latent spaces
multi-task learning
semantic segmentation
author_facet Darwin Saire
Adin Ramirez Rivera
author_sort Darwin Saire
title Empirical Study of Multi-Task Hourglass Model for Semantic Segmentation Task
title_short Empirical Study of Multi-Task Hourglass Model for Semantic Segmentation Task
title_full Empirical Study of Multi-Task Hourglass Model for Semantic Segmentation Task
title_fullStr Empirical Study of Multi-Task Hourglass Model for Semantic Segmentation Task
title_full_unstemmed Empirical Study of Multi-Task Hourglass Model for Semantic Segmentation Task
title_sort empirical study of multi-task hourglass model for semantic segmentation task
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2021-01-01
description The semantic segmentation (SS) task aims to create a dense classification by labeling at the pixel level each object present on images. Convolutional neural network (CNN) approaches have been widely used, and exhibited the best results in this task. However, the loss of spatial precision on the results is a main drawback that has not been solved. In this work, we propose to use a multi-task approach by complementing the semantic segmentation task with edge detection, semantic contour, and distance transform tasks. We propose that by sharing a common latent space, the complementary tasks can produce more robust representations that can enhance the semantic labels. We explore the influence of contour-based tasks on latent space, as well as their impact on the final results of SS. We demonstrate the effectiveness of learning in a multi-task setting for hourglass models in the Cityscapes, CamVid, and Freiburg Forest datasets by improving the state-of-the-art without any refinement post-processing.
topic Explainable latent spaces
multi-task learning
semantic segmentation
url https://ieeexplore.ieee.org/document/9444361/
work_keys_str_mv AT darwinsaire empiricalstudyofmultitaskhourglassmodelforsemanticsegmentationtask
AT adinramirezrivera empiricalstudyofmultitaskhourglassmodelforsemanticsegmentationtask
_version_ 1721391184215015424