Large-Scale Place Recognition Based on Camera-LiDAR Fused Descriptor

In the field of autonomous driving, carriers are equipped with a variety of sensors, including cameras and LiDARs. However, the camera suffers from problems of illumination and occlusion, and the LiDAR encounters motion distortion, degenerate environment and limited ranging distance. Therefore, fusi...

Full description

Bibliographic Details
Main Authors: Shaorong Xie, Chao Pan, Yaxin Peng, Ke Liu, Shihui Ying
Format: Article
Language:English
Published: MDPI AG 2020-05-01
Series:Sensors
Subjects:
Online Access:https://www.mdpi.com/1424-8220/20/10/2870
id doaj-21f68cb2d9a4424c829d605694b319d9
record_format Article
spelling doaj-21f68cb2d9a4424c829d605694b319d92020-11-25T03:53:29ZengMDPI AGSensors1424-82202020-05-01202870287010.3390/s20102870Large-Scale Place Recognition Based on Camera-LiDAR Fused DescriptorShaorong Xie0Chao Pan1Yaxin Peng2Ke Liu3Shihui Ying4School of Computer Engineering and Science, Shanghai University, Shanghai 200444, ChinaSchool of Mechatronic Engineering and Automation, Shanghai University, Shanghai 200444, ChinaDepartment of Mathematics, School of Science, Shanghai University, Shanghai 200444, ChinaDepartment of Mathematics, School of Science, Shanghai University, Shanghai 200444, ChinaDepartment of Mathematics, School of Science, Shanghai University, Shanghai 200444, ChinaIn the field of autonomous driving, carriers are equipped with a variety of sensors, including cameras and LiDARs. However, the camera suffers from problems of illumination and occlusion, and the LiDAR encounters motion distortion, degenerate environment and limited ranging distance. Therefore, fusing the information from these two sensors deserves to be explored. In this paper, we propose a fusion network which robustly captures both the image and point cloud descriptors to solve the place recognition problem. Our contribution can be summarized as: (1) applying the trimmed strategy in the point cloud global feature aggregation to improve the recognition performance, (2) building a compact fusion framework which captures both the robust representation of the image and 3D point cloud, and (3) learning a proper metric to describe the similarity of our fused global feature. The experiments on KITTI and KAIST datasets show that the proposed fused descriptor is more robust and discriminative than the single sensor descriptor.https://www.mdpi.com/1424-8220/20/10/2870place recognitionretrievalsensor fusiondeep learning
collection DOAJ
language English
format Article
sources DOAJ
author Shaorong Xie
Chao Pan
Yaxin Peng
Ke Liu
Shihui Ying
spellingShingle Shaorong Xie
Chao Pan
Yaxin Peng
Ke Liu
Shihui Ying
Large-Scale Place Recognition Based on Camera-LiDAR Fused Descriptor
Sensors
place recognition
retrieval
sensor fusion
deep learning
author_facet Shaorong Xie
Chao Pan
Yaxin Peng
Ke Liu
Shihui Ying
author_sort Shaorong Xie
title Large-Scale Place Recognition Based on Camera-LiDAR Fused Descriptor
title_short Large-Scale Place Recognition Based on Camera-LiDAR Fused Descriptor
title_full Large-Scale Place Recognition Based on Camera-LiDAR Fused Descriptor
title_fullStr Large-Scale Place Recognition Based on Camera-LiDAR Fused Descriptor
title_full_unstemmed Large-Scale Place Recognition Based on Camera-LiDAR Fused Descriptor
title_sort large-scale place recognition based on camera-lidar fused descriptor
publisher MDPI AG
series Sensors
issn 1424-8220
publishDate 2020-05-01
description In the field of autonomous driving, carriers are equipped with a variety of sensors, including cameras and LiDARs. However, the camera suffers from problems of illumination and occlusion, and the LiDAR encounters motion distortion, degenerate environment and limited ranging distance. Therefore, fusing the information from these two sensors deserves to be explored. In this paper, we propose a fusion network which robustly captures both the image and point cloud descriptors to solve the place recognition problem. Our contribution can be summarized as: (1) applying the trimmed strategy in the point cloud global feature aggregation to improve the recognition performance, (2) building a compact fusion framework which captures both the robust representation of the image and 3D point cloud, and (3) learning a proper metric to describe the similarity of our fused global feature. The experiments on KITTI and KAIST datasets show that the proposed fused descriptor is more robust and discriminative than the single sensor descriptor.
topic place recognition
retrieval
sensor fusion
deep learning
url https://www.mdpi.com/1424-8220/20/10/2870
work_keys_str_mv AT shaorongxie largescaleplacerecognitionbasedoncameralidarfuseddescriptor
AT chaopan largescaleplacerecognitionbasedoncameralidarfuseddescriptor
AT yaxinpeng largescaleplacerecognitionbasedoncameralidarfuseddescriptor
AT keliu largescaleplacerecognitionbasedoncameralidarfuseddescriptor
AT shihuiying largescaleplacerecognitionbasedoncameralidarfuseddescriptor
_version_ 1724477722302873600