Semantic 3D Reconstruction with Learning MVS and 2D Segmentation of Aerial Images

Semantic modeling is a challenging task that has received widespread attention in recent years. With the help of mini Unmanned Aerial Vehicles (UAVs), multi-view high-resolution aerial images of large-scale scenes can be conveniently collected. In this paper, we propose a semantic Multi-View Stereo...

Full description

Bibliographic Details
Main Authors:	Zizhuang Wei, Yao Wang, Hongwei Yi, Yisong Chen, Guoping Wang
Format:	Article
Language:	English
Published:	MDPI AG 2020-02-01
Series:	Applied Sciences
Subjects:	semantic 3d reconstruction deep learning multi-view stereo probabilistic fusion graph-based refinement
Online Access:	https://www.mdpi.com/2076-3417/10/4/1275

id	doaj-c7502f319f1c4525a1040f6d8d90cc75
record_format	Article
spelling	doaj-c7502f319f1c4525a1040f6d8d90cc752020-11-25T02:11:40ZengMDPI AGApplied Sciences2076-34172020-02-01104127510.3390/app10041275app10041275Semantic 3D Reconstruction with Learning MVS and 2D Segmentation of Aerial ImagesZizhuang Wei0Yao Wang1Hongwei Yi2Yisong Chen3Guoping Wang4Graphics & Interaction Lab, School of Electronics Engineering and Computer Sciences, Peking University, Beijing 100871, ChinaGraphics & Interaction Lab, School of Electronics Engineering and Computer Sciences, Peking University, Beijing 100871, ChinaGraphics & Interaction Lab, School of Electronics Engineering and Computer Sciences, Peking University, Beijing 100871, ChinaGraphics & Interaction Lab, School of Electronics Engineering and Computer Sciences, Peking University, Beijing 100871, ChinaGraphics & Interaction Lab, School of Electronics Engineering and Computer Sciences, Peking University, Beijing 100871, ChinaSemantic modeling is a challenging task that has received widespread attention in recent years. With the help of mini Unmanned Aerial Vehicles (UAVs), multi-view high-resolution aerial images of large-scale scenes can be conveniently collected. In this paper, we propose a semantic Multi-View Stereo (MVS) method to reconstruct 3D semantic models from 2D images. Firstly, 2D semantic probability distribution is obtained by Convolutional Neural Network (CNN). Secondly, the calibrated cameras poses are determined by Structure from Motion (SfM), while the depth maps are estimated by learning MVS. Combining 2D segmentation and 3D geometry information, dense point clouds with semantic labels are generated by a probability-based semantic fusion method. In the final stage, the coarse 3D semantic point cloud is optimized by both local and global refinements. By making full use of the multi-view consistency, the proposed method efficiently produces a fine-level 3D semantic point cloud. The experimental result evaluated by re-projection maps achieves 88.4% Pixel Accuracy on the Urban Drone Dataset (UDD). In conclusion, our graph-based semantic fusion procedure and refinement based on local and global information can suppress and reduce the re-projection error.https://www.mdpi.com/2076-3417/10/4/1275semantic 3d reconstructiondeep learningmulti-view stereoprobabilistic fusiongraph-based refinement
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Zizhuang Wei Yao Wang Hongwei Yi Yisong Chen Guoping Wang
spellingShingle	Zizhuang Wei Yao Wang Hongwei Yi Yisong Chen Guoping Wang Semantic 3D Reconstruction with Learning MVS and 2D Segmentation of Aerial Images Applied Sciences semantic 3d reconstruction deep learning multi-view stereo probabilistic fusion graph-based refinement
author_facet	Zizhuang Wei Yao Wang Hongwei Yi Yisong Chen Guoping Wang
author_sort	Zizhuang Wei
title	Semantic 3D Reconstruction with Learning MVS and 2D Segmentation of Aerial Images
title_short	Semantic 3D Reconstruction with Learning MVS and 2D Segmentation of Aerial Images
title_full	Semantic 3D Reconstruction with Learning MVS and 2D Segmentation of Aerial Images
title_fullStr	Semantic 3D Reconstruction with Learning MVS and 2D Segmentation of Aerial Images
title_full_unstemmed	Semantic 3D Reconstruction with Learning MVS and 2D Segmentation of Aerial Images
title_sort	semantic 3d reconstruction with learning mvs and 2d segmentation of aerial images
publisher	MDPI AG
series	Applied Sciences
issn	2076-3417
publishDate	2020-02-01
description	Semantic modeling is a challenging task that has received widespread attention in recent years. With the help of mini Unmanned Aerial Vehicles (UAVs), multi-view high-resolution aerial images of large-scale scenes can be conveniently collected. In this paper, we propose a semantic Multi-View Stereo (MVS) method to reconstruct 3D semantic models from 2D images. Firstly, 2D semantic probability distribution is obtained by Convolutional Neural Network (CNN). Secondly, the calibrated cameras poses are determined by Structure from Motion (SfM), while the depth maps are estimated by learning MVS. Combining 2D segmentation and 3D geometry information, dense point clouds with semantic labels are generated by a probability-based semantic fusion method. In the final stage, the coarse 3D semantic point cloud is optimized by both local and global refinements. By making full use of the multi-view consistency, the proposed method efficiently produces a fine-level 3D semantic point cloud. The experimental result evaluated by re-projection maps achieves 88.4% Pixel Accuracy on the Urban Drone Dataset (UDD). In conclusion, our graph-based semantic fusion procedure and refinement based on local and global information can suppress and reduce the re-projection error.
topic	semantic 3d reconstruction deep learning multi-view stereo probabilistic fusion graph-based refinement
url	https://www.mdpi.com/2076-3417/10/4/1275
work_keys_str_mv	AT zizhuangwei semantic3dreconstructionwithlearningmvsand2dsegmentationofaerialimages AT yaowang semantic3dreconstructionwithlearningmvsand2dsegmentationofaerialimages AT hongweiyi semantic3dreconstructionwithlearningmvsand2dsegmentationofaerialimages AT yisongchen semantic3dreconstructionwithlearningmvsand2dsegmentationofaerialimages AT guopingwang semantic3dreconstructionwithlearningmvsand2dsegmentationofaerialimages
_version_	1724913411336175616

Semantic 3D Reconstruction with Learning MVS and 2D Segmentation of Aerial Images

Similar Items