Building Extraction Based on an Optimized Stacked Sparse Autoencoder of Structure and Training Samples Using LIDAR DSM and Optical Images

In this paper, a building extraction method is proposed based on a stacked sparse autoencoder with an optimized structure and training samples. Building extraction plays an important role in urban construction and planning. However, some negative effects will reduce the accuracy of extraction, such...

Full description

Bibliographic Details
Main Authors: Yiming Yan, Zhichao Tan, Nan Su, Chunhui Zhao
Format: Article
Language:English
Published: MDPI AG 2017-08-01
Series:Sensors
Subjects:
Online Access:https://www.mdpi.com/1424-8220/17/9/1957
id doaj-5e8c5271a5464ae980a093d0e18afb0e
record_format Article
spelling doaj-5e8c5271a5464ae980a093d0e18afb0e2020-11-24T23:08:34ZengMDPI AGSensors1424-82202017-08-01179195710.3390/s17091957s17091957Building Extraction Based on an Optimized Stacked Sparse Autoencoder of Structure and Training Samples Using LIDAR DSM and Optical ImagesYiming Yan0Zhichao Tan1Nan Su2Chunhui Zhao3Department of information engineering, Harbin Engineering University, Harbin 150001, ChinaDepartment of information engineering, Harbin Engineering University, Harbin 150001, ChinaDepartment of information engineering, Harbin Engineering University, Harbin 150001, ChinaDepartment of information engineering, Harbin Engineering University, Harbin 150001, ChinaIn this paper, a building extraction method is proposed based on a stacked sparse autoencoder with an optimized structure and training samples. Building extraction plays an important role in urban construction and planning. However, some negative effects will reduce the accuracy of extraction, such as exceeding resolution, bad correction and terrain influence. Data collected by multiple sensors, as light detection and ranging (LIDAR), optical sensor etc., are used to improve the extraction. Using digital surface model (DSM) obtained from LIDAR data and optical images, traditional method can improve the extraction effect to a certain extent, but there are some defects in feature extraction. Since stacked sparse autoencoder (SSAE) neural network can learn the essential characteristics of the data in depth, SSAE was employed to extract buildings from the combined DSM data and optical image. A better setting strategy of SSAE network structure is given, and an idea of setting the number and proportion of training samples for better training of SSAE was presented. The optical data and DSM were combined as input of the optimized SSAE, and after training by an optimized samples, the appropriate network structure can extract buildings with great accuracy and has good robustness.https://www.mdpi.com/1424-8220/17/9/1957stacked sparse autoencoderLIDAR DSMremote sensing imagebuilding extraction
collection DOAJ
language English
format Article
sources DOAJ
author Yiming Yan
Zhichao Tan
Nan Su
Chunhui Zhao
spellingShingle Yiming Yan
Zhichao Tan
Nan Su
Chunhui Zhao
Building Extraction Based on an Optimized Stacked Sparse Autoencoder of Structure and Training Samples Using LIDAR DSM and Optical Images
Sensors
stacked sparse autoencoder
LIDAR DSM
remote sensing image
building extraction
author_facet Yiming Yan
Zhichao Tan
Nan Su
Chunhui Zhao
author_sort Yiming Yan
title Building Extraction Based on an Optimized Stacked Sparse Autoencoder of Structure and Training Samples Using LIDAR DSM and Optical Images
title_short Building Extraction Based on an Optimized Stacked Sparse Autoencoder of Structure and Training Samples Using LIDAR DSM and Optical Images
title_full Building Extraction Based on an Optimized Stacked Sparse Autoencoder of Structure and Training Samples Using LIDAR DSM and Optical Images
title_fullStr Building Extraction Based on an Optimized Stacked Sparse Autoencoder of Structure and Training Samples Using LIDAR DSM and Optical Images
title_full_unstemmed Building Extraction Based on an Optimized Stacked Sparse Autoencoder of Structure and Training Samples Using LIDAR DSM and Optical Images
title_sort building extraction based on an optimized stacked sparse autoencoder of structure and training samples using lidar dsm and optical images
publisher MDPI AG
series Sensors
issn 1424-8220
publishDate 2017-08-01
description In this paper, a building extraction method is proposed based on a stacked sparse autoencoder with an optimized structure and training samples. Building extraction plays an important role in urban construction and planning. However, some negative effects will reduce the accuracy of extraction, such as exceeding resolution, bad correction and terrain influence. Data collected by multiple sensors, as light detection and ranging (LIDAR), optical sensor etc., are used to improve the extraction. Using digital surface model (DSM) obtained from LIDAR data and optical images, traditional method can improve the extraction effect to a certain extent, but there are some defects in feature extraction. Since stacked sparse autoencoder (SSAE) neural network can learn the essential characteristics of the data in depth, SSAE was employed to extract buildings from the combined DSM data and optical image. A better setting strategy of SSAE network structure is given, and an idea of setting the number and proportion of training samples for better training of SSAE was presented. The optical data and DSM were combined as input of the optimized SSAE, and after training by an optimized samples, the appropriate network structure can extract buildings with great accuracy and has good robustness.
topic stacked sparse autoencoder
LIDAR DSM
remote sensing image
building extraction
url https://www.mdpi.com/1424-8220/17/9/1957
work_keys_str_mv AT yimingyan buildingextractionbasedonanoptimizedstackedsparseautoencoderofstructureandtrainingsamplesusinglidardsmandopticalimages
AT zhichaotan buildingextractionbasedonanoptimizedstackedsparseautoencoderofstructureandtrainingsamplesusinglidardsmandopticalimages
AT nansu buildingextractionbasedonanoptimizedstackedsparseautoencoderofstructureandtrainingsamplesusinglidardsmandopticalimages
AT chunhuizhao buildingextractionbasedonanoptimizedstackedsparseautoencoderofstructureandtrainingsamplesusinglidardsmandopticalimages
_version_ 1725613578631774208