Empirical Study of Multi-Task Hourglass Model for Semantic Segmentation Task
The semantic segmentation (SS) task aims to create a dense classification by labeling at the pixel level each object present on images. Convolutional neural network (CNN) approaches have been widely used, and exhibited the best results in this task. However, the loss of spatial precision on the resu...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2021-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9444361/ |
id |
doaj-d5a6ce39da584f208203baddc7952ad3 |
---|---|
record_format |
Article |
spelling |
doaj-d5a6ce39da584f208203baddc7952ad32021-06-07T23:00:32ZengIEEEIEEE Access2169-35362021-01-019806548067010.1109/ACCESS.2021.30852189444361Empirical Study of Multi-Task Hourglass Model for Semantic Segmentation TaskDarwin Saire0https://orcid.org/0000-0001-9683-5405Adin Ramirez Rivera1https://orcid.org/0000-0002-4321-9075Institute of Computing, University of Campinas, Campinas, BrazilInstitute of Computing, University of Campinas, Campinas, BrazilThe semantic segmentation (SS) task aims to create a dense classification by labeling at the pixel level each object present on images. Convolutional neural network (CNN) approaches have been widely used, and exhibited the best results in this task. However, the loss of spatial precision on the results is a main drawback that has not been solved. In this work, we propose to use a multi-task approach by complementing the semantic segmentation task with edge detection, semantic contour, and distance transform tasks. We propose that by sharing a common latent space, the complementary tasks can produce more robust representations that can enhance the semantic labels. We explore the influence of contour-based tasks on latent space, as well as their impact on the final results of SS. We demonstrate the effectiveness of learning in a multi-task setting for hourglass models in the Cityscapes, CamVid, and Freiburg Forest datasets by improving the state-of-the-art without any refinement post-processing.https://ieeexplore.ieee.org/document/9444361/Explainable latent spacesmulti-task learningsemantic segmentation |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Darwin Saire Adin Ramirez Rivera |
spellingShingle |
Darwin Saire Adin Ramirez Rivera Empirical Study of Multi-Task Hourglass Model for Semantic Segmentation Task IEEE Access Explainable latent spaces multi-task learning semantic segmentation |
author_facet |
Darwin Saire Adin Ramirez Rivera |
author_sort |
Darwin Saire |
title |
Empirical Study of Multi-Task Hourglass Model for Semantic Segmentation Task |
title_short |
Empirical Study of Multi-Task Hourglass Model for Semantic Segmentation Task |
title_full |
Empirical Study of Multi-Task Hourglass Model for Semantic Segmentation Task |
title_fullStr |
Empirical Study of Multi-Task Hourglass Model for Semantic Segmentation Task |
title_full_unstemmed |
Empirical Study of Multi-Task Hourglass Model for Semantic Segmentation Task |
title_sort |
empirical study of multi-task hourglass model for semantic segmentation task |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2021-01-01 |
description |
The semantic segmentation (SS) task aims to create a dense classification by labeling at the pixel level each object present on images. Convolutional neural network (CNN) approaches have been widely used, and exhibited the best results in this task. However, the loss of spatial precision on the results is a main drawback that has not been solved. In this work, we propose to use a multi-task approach by complementing the semantic segmentation task with edge detection, semantic contour, and distance transform tasks. We propose that by sharing a common latent space, the complementary tasks can produce more robust representations that can enhance the semantic labels. We explore the influence of contour-based tasks on latent space, as well as their impact on the final results of SS. We demonstrate the effectiveness of learning in a multi-task setting for hourglass models in the Cityscapes, CamVid, and Freiburg Forest datasets by improving the state-of-the-art without any refinement post-processing. |
topic |
Explainable latent spaces multi-task learning semantic segmentation |
url |
https://ieeexplore.ieee.org/document/9444361/ |
work_keys_str_mv |
AT darwinsaire empiricalstudyofmultitaskhourglassmodelforsemanticsegmentationtask AT adinramirezrivera empiricalstudyofmultitaskhourglassmodelforsemanticsegmentationtask |
_version_ |
1721391184215015424 |