DenseU-Net-Based Semantic Segmentation of Small Objects in Urban Remote Sensing Images

Class imbalance is a serious problem that plagues the semantic segmentation task in urban remote sensing images. Since large object classes dominate the segmentation task, small object classes are usually suppressed, so the solutions based on optimizing the overall accuracy are often unsatisfactory....

Full description

Bibliographic Details
Main Authors:	Rongsheng Dong, Xiaoquan Pan, Fengying Li
Format:	Article
Language:	English
Published:	IEEE 2019-01-01
Series:	IEEE Access
Subjects:	Class imbalance deep convolutional neural networks median frequency balancing semantic segmentation urban remote sensing images
Online Access:	https://ieeexplore.ieee.org/document/8718619/

id	doaj-02af183612034fc0a652d9a9cc9649cd
record_format	Article
spelling	doaj-02af183612034fc0a652d9a9cc9649cd2021-03-30T00:14:59ZengIEEEIEEE Access2169-35362019-01-017653476535610.1109/ACCESS.2019.29179528718619DenseU-Net-Based Semantic Segmentation of Small Objects in Urban Remote Sensing ImagesRongsheng Dong0https://orcid.org/0000-0002-0540-4659Xiaoquan Pan1Fengying Li2https://orcid.org/0000-0002-8531-0125Guangxi Key Laboratory of Trusted Software, Guilin University of Electronic Technology, Guilin, ChinaGuangxi Key Laboratory of Trusted Software, Guilin University of Electronic Technology, Guilin, ChinaGuangxi Key Laboratory of Trusted Software, Guilin University of Electronic Technology, Guilin, ChinaClass imbalance is a serious problem that plagues the semantic segmentation task in urban remote sensing images. Since large object classes dominate the segmentation task, small object classes are usually suppressed, so the solutions based on optimizing the overall accuracy are often unsatisfactory. In the light of the class imbalance of the semantic segmentation in urban remote sensing images, we developed the concept of the Down-sampling Block (DownBlock) for obtaining context information and the Up-sampling Block (UpBlock) for restoring the original resolution. We proposed an end-to-end deep convolutional neural network (DenseU-Net) architecture for pixel-wise urban remote sensing image segmentation. The main idea of the DenseU-Net is to connect convolutional neural network features through cascade operations and use its symmetrical structure to fuse the detail features in shallow layers and the abstract semantic features in deep layers. A focal loss function weighted by the median frequency balancing (MFB_Focal<sub>loss</sub>) is proposed; the accuracy of the small object classes and the overall accuracy are improved effectively with our approach. Our experiments were based on the 2016 ISPRS Vaihingen 2D semantic labeling dataset and demonstrated the following outcomes. In the case where boundary pixels were considered (GT), MFB_Focal<sub>loss</sub> achieved a good overall segmentation performance using the same U-Net model, and the F1-score of the small object class “car” was improved by 9.28% compared with the cross-entropy loss function. Using the same MFB_Focal<sub>loss</sub> loss function, the overall accuracy of the DenseU-Net was better than that of U-Net, where the F1-score of the “car” class was 6.71% higher. Finally, without any post-processing, the DenseU-Net+MFB_Focal<sub>loss</sub> achieved the overall accuracy of 85.63%, and the F1-score of the “car” class was 83.23%, which is superior to HSN+OI+WBP both numerically and visually.https://ieeexplore.ieee.org/document/8718619/Class imbalancedeep convolutional neural networksmedian frequency balancingsemantic segmentationurban remote sensing images
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Rongsheng Dong Xiaoquan Pan Fengying Li
spellingShingle	Rongsheng Dong Xiaoquan Pan Fengying Li DenseU-Net-Based Semantic Segmentation of Small Objects in Urban Remote Sensing Images IEEE Access Class imbalance deep convolutional neural networks median frequency balancing semantic segmentation urban remote sensing images
author_facet	Rongsheng Dong Xiaoquan Pan Fengying Li
author_sort	Rongsheng Dong
title	DenseU-Net-Based Semantic Segmentation of Small Objects in Urban Remote Sensing Images
title_short	DenseU-Net-Based Semantic Segmentation of Small Objects in Urban Remote Sensing Images
title_full	DenseU-Net-Based Semantic Segmentation of Small Objects in Urban Remote Sensing Images
title_fullStr	DenseU-Net-Based Semantic Segmentation of Small Objects in Urban Remote Sensing Images
title_full_unstemmed	DenseU-Net-Based Semantic Segmentation of Small Objects in Urban Remote Sensing Images
title_sort	denseu-net-based semantic segmentation of small objects in urban remote sensing images
publisher	IEEE
series	IEEE Access
issn	2169-3536
publishDate	2019-01-01
description	Class imbalance is a serious problem that plagues the semantic segmentation task in urban remote sensing images. Since large object classes dominate the segmentation task, small object classes are usually suppressed, so the solutions based on optimizing the overall accuracy are often unsatisfactory. In the light of the class imbalance of the semantic segmentation in urban remote sensing images, we developed the concept of the Down-sampling Block (DownBlock) for obtaining context information and the Up-sampling Block (UpBlock) for restoring the original resolution. We proposed an end-to-end deep convolutional neural network (DenseU-Net) architecture for pixel-wise urban remote sensing image segmentation. The main idea of the DenseU-Net is to connect convolutional neural network features through cascade operations and use its symmetrical structure to fuse the detail features in shallow layers and the abstract semantic features in deep layers. A focal loss function weighted by the median frequency balancing (MFB_Focal<sub>loss</sub>) is proposed; the accuracy of the small object classes and the overall accuracy are improved effectively with our approach. Our experiments were based on the 2016 ISPRS Vaihingen 2D semantic labeling dataset and demonstrated the following outcomes. In the case where boundary pixels were considered (GT), MFB_Focal<sub>loss</sub> achieved a good overall segmentation performance using the same U-Net model, and the F1-score of the small object class “car” was improved by 9.28% compared with the cross-entropy loss function. Using the same MFB_Focal<sub>loss</sub> loss function, the overall accuracy of the DenseU-Net was better than that of U-Net, where the F1-score of the “car” class was 6.71% higher. Finally, without any post-processing, the DenseU-Net+MFB_Focal<sub>loss</sub> achieved the overall accuracy of 85.63%, and the F1-score of the “car” class was 83.23%, which is superior to HSN+OI+WBP both numerically and visually.
topic	Class imbalance deep convolutional neural networks median frequency balancing semantic segmentation urban remote sensing images
url	https://ieeexplore.ieee.org/document/8718619/
work_keys_str_mv	AT rongshengdong denseunetbasedsemanticsegmentationofsmallobjectsinurbanremotesensingimages AT xiaoquanpan denseunetbasedsemanticsegmentationofsmallobjectsinurbanremotesensingimages AT fengyingli denseunetbasedsemanticsegmentationofsmallobjectsinurbanremotesensingimages
_version_	1724188403367411712

DenseU-Net-Based Semantic Segmentation of Small Objects in Urban Remote Sensing Images

Similar Items