Image Object Extraction Based on Semantic Segmentation and Label Loss

Object extraction refers to the operation of obtaining an object area from an image based on a small amount of mark information given by users, which is a key step in image processing. In order to obtain a complete object profile, current methods usually require a large number of manual annotations,...

Full description

Bibliographic Details
Main Authors: Xiaoru Wang, Peirong Xu, Zhihong Yu, Fu Li
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9108274/
id doaj-4c31f6c9ca324357a08a6647843646b5
record_format Article
spelling doaj-4c31f6c9ca324357a08a6647843646b52021-03-30T02:35:58ZengIEEEIEEE Access2169-35362020-01-01810932510933410.1109/ACCESS.2020.29999429108274Image Object Extraction Based on Semantic Segmentation and Label LossXiaoru Wang0https://orcid.org/0000-0001-7171-2783Peirong Xu1https://orcid.org/0000-0001-9080-5803Zhihong Yu2https://orcid.org/0000-0002-6379-1976Fu Li3https://orcid.org/0000-0001-8819-0547Beijing Key Laboratory of Network System and Network Culture, Beijing University of Posts and Telecommunications, Beijing, ChinaBeijing Key Laboratory of Network System and Network Culture, Beijing University of Posts and Telecommunications, Beijing, ChinaIntel China Research Center, Beijing, ChinaDepartment of Electrical and Computer Engineering, Portland State University, Portland, OR, USAObject extraction refers to the operation of obtaining an object area from an image based on a small amount of mark information given by users, which is a key step in image processing. In order to obtain a complete object profile, current methods usually require a large number of manual annotations, especially for objects with irregular contours. Since traditional algorithms rely on low-level pixel features without semantic information, and are based on obvious mathematical assumptions (ie, strong inductive bias), it is difficult to completely identify objects. At present, in order to improve the integrity of object extraction, semantic segmentation-based methods increase the complexity and latancy by adding more pre-processing and post-processing steps. In this paper, we propose a novel model named IOEBSS, which includes a fast binary plane pre-processing, an improved Deeplab v3+ semantic segmentation model, and an auxiliary loss function named Label Loss. Through the fast binary plane pre-processing, the model can accelerate the transformation of interactive inputs. The improved semantic segmentation model makes the extracted results more semantically complete, and Label Loss is more conducive to gradient flow and accelerates training convergence. For the above reasons, IOEBSS can accurately and quickly identify objects with complex contours and colors. On Pascal VOC and COCO datasets, compared to current methods, IOEBSS has a significant improvement in accuracy, inference speed, and convergence speed.https://ieeexplore.ieee.org/document/9108274/Label lossobject extractionsemantic segmentation
collection DOAJ
language English
format Article
sources DOAJ
author Xiaoru Wang
Peirong Xu
Zhihong Yu
Fu Li
spellingShingle Xiaoru Wang
Peirong Xu
Zhihong Yu
Fu Li
Image Object Extraction Based on Semantic Segmentation and Label Loss
IEEE Access
Label loss
object extraction
semantic segmentation
author_facet Xiaoru Wang
Peirong Xu
Zhihong Yu
Fu Li
author_sort Xiaoru Wang
title Image Object Extraction Based on Semantic Segmentation and Label Loss
title_short Image Object Extraction Based on Semantic Segmentation and Label Loss
title_full Image Object Extraction Based on Semantic Segmentation and Label Loss
title_fullStr Image Object Extraction Based on Semantic Segmentation and Label Loss
title_full_unstemmed Image Object Extraction Based on Semantic Segmentation and Label Loss
title_sort image object extraction based on semantic segmentation and label loss
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2020-01-01
description Object extraction refers to the operation of obtaining an object area from an image based on a small amount of mark information given by users, which is a key step in image processing. In order to obtain a complete object profile, current methods usually require a large number of manual annotations, especially for objects with irregular contours. Since traditional algorithms rely on low-level pixel features without semantic information, and are based on obvious mathematical assumptions (ie, strong inductive bias), it is difficult to completely identify objects. At present, in order to improve the integrity of object extraction, semantic segmentation-based methods increase the complexity and latancy by adding more pre-processing and post-processing steps. In this paper, we propose a novel model named IOEBSS, which includes a fast binary plane pre-processing, an improved Deeplab v3+ semantic segmentation model, and an auxiliary loss function named Label Loss. Through the fast binary plane pre-processing, the model can accelerate the transformation of interactive inputs. The improved semantic segmentation model makes the extracted results more semantically complete, and Label Loss is more conducive to gradient flow and accelerates training convergence. For the above reasons, IOEBSS can accurately and quickly identify objects with complex contours and colors. On Pascal VOC and COCO datasets, compared to current methods, IOEBSS has a significant improvement in accuracy, inference speed, and convergence speed.
topic Label loss
object extraction
semantic segmentation
url https://ieeexplore.ieee.org/document/9108274/
work_keys_str_mv AT xiaoruwang imageobjectextractionbasedonsemanticsegmentationandlabelloss
AT peirongxu imageobjectextractionbasedonsemanticsegmentationandlabelloss
AT zhihongyu imageobjectextractionbasedonsemanticsegmentationandlabelloss
AT fuli imageobjectextractionbasedonsemanticsegmentationandlabelloss
_version_ 1724184828328280064