Image Object Extraction Based on Semantic Segmentation and Label Loss
Object extraction refers to the operation of obtaining an object area from an image based on a small amount of mark information given by users, which is a key step in image processing. In order to obtain a complete object profile, current methods usually require a large number of manual annotations,...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2020-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9108274/ |
id |
doaj-4c31f6c9ca324357a08a6647843646b5 |
---|---|
record_format |
Article |
spelling |
doaj-4c31f6c9ca324357a08a6647843646b52021-03-30T02:35:58ZengIEEEIEEE Access2169-35362020-01-01810932510933410.1109/ACCESS.2020.29999429108274Image Object Extraction Based on Semantic Segmentation and Label LossXiaoru Wang0https://orcid.org/0000-0001-7171-2783Peirong Xu1https://orcid.org/0000-0001-9080-5803Zhihong Yu2https://orcid.org/0000-0002-6379-1976Fu Li3https://orcid.org/0000-0001-8819-0547Beijing Key Laboratory of Network System and Network Culture, Beijing University of Posts and Telecommunications, Beijing, ChinaBeijing Key Laboratory of Network System and Network Culture, Beijing University of Posts and Telecommunications, Beijing, ChinaIntel China Research Center, Beijing, ChinaDepartment of Electrical and Computer Engineering, Portland State University, Portland, OR, USAObject extraction refers to the operation of obtaining an object area from an image based on a small amount of mark information given by users, which is a key step in image processing. In order to obtain a complete object profile, current methods usually require a large number of manual annotations, especially for objects with irregular contours. Since traditional algorithms rely on low-level pixel features without semantic information, and are based on obvious mathematical assumptions (ie, strong inductive bias), it is difficult to completely identify objects. At present, in order to improve the integrity of object extraction, semantic segmentation-based methods increase the complexity and latancy by adding more pre-processing and post-processing steps. In this paper, we propose a novel model named IOEBSS, which includes a fast binary plane pre-processing, an improved Deeplab v3+ semantic segmentation model, and an auxiliary loss function named Label Loss. Through the fast binary plane pre-processing, the model can accelerate the transformation of interactive inputs. The improved semantic segmentation model makes the extracted results more semantically complete, and Label Loss is more conducive to gradient flow and accelerates training convergence. For the above reasons, IOEBSS can accurately and quickly identify objects with complex contours and colors. On Pascal VOC and COCO datasets, compared to current methods, IOEBSS has a significant improvement in accuracy, inference speed, and convergence speed.https://ieeexplore.ieee.org/document/9108274/Label lossobject extractionsemantic segmentation |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Xiaoru Wang Peirong Xu Zhihong Yu Fu Li |
spellingShingle |
Xiaoru Wang Peirong Xu Zhihong Yu Fu Li Image Object Extraction Based on Semantic Segmentation and Label Loss IEEE Access Label loss object extraction semantic segmentation |
author_facet |
Xiaoru Wang Peirong Xu Zhihong Yu Fu Li |
author_sort |
Xiaoru Wang |
title |
Image Object Extraction Based on Semantic Segmentation and Label Loss |
title_short |
Image Object Extraction Based on Semantic Segmentation and Label Loss |
title_full |
Image Object Extraction Based on Semantic Segmentation and Label Loss |
title_fullStr |
Image Object Extraction Based on Semantic Segmentation and Label Loss |
title_full_unstemmed |
Image Object Extraction Based on Semantic Segmentation and Label Loss |
title_sort |
image object extraction based on semantic segmentation and label loss |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2020-01-01 |
description |
Object extraction refers to the operation of obtaining an object area from an image based on a small amount of mark information given by users, which is a key step in image processing. In order to obtain a complete object profile, current methods usually require a large number of manual annotations, especially for objects with irregular contours. Since traditional algorithms rely on low-level pixel features without semantic information, and are based on obvious mathematical assumptions (ie, strong inductive bias), it is difficult to completely identify objects. At present, in order to improve the integrity of object extraction, semantic segmentation-based methods increase the complexity and latancy by adding more pre-processing and post-processing steps. In this paper, we propose a novel model named IOEBSS, which includes a fast binary plane pre-processing, an improved Deeplab v3+ semantic segmentation model, and an auxiliary loss function named Label Loss. Through the fast binary plane pre-processing, the model can accelerate the transformation of interactive inputs. The improved semantic segmentation model makes the extracted results more semantically complete, and Label Loss is more conducive to gradient flow and accelerates training convergence. For the above reasons, IOEBSS can accurately and quickly identify objects with complex contours and colors. On Pascal VOC and COCO datasets, compared to current methods, IOEBSS has a significant improvement in accuracy, inference speed, and convergence speed. |
topic |
Label loss object extraction semantic segmentation |
url |
https://ieeexplore.ieee.org/document/9108274/ |
work_keys_str_mv |
AT xiaoruwang imageobjectextractionbasedonsemanticsegmentationandlabelloss AT peirongxu imageobjectextractionbasedonsemanticsegmentationandlabelloss AT zhihongyu imageobjectextractionbasedonsemanticsegmentationandlabelloss AT fuli imageobjectextractionbasedonsemanticsegmentationandlabelloss |
_version_ |
1724184828328280064 |