Combined Auxiliary Networks and Bird’s Eye View Method for Real-Time Multicategory Object Recognition

Object recognition based on LIDAR data is crucial in automotive driving and is the subject of extensive research. However, the lack of accuracy and stability in complex environments obstructs the practical application of real-time recognition algorithms. In this study, we proposed a new real-time ne...

Full description

Bibliographic Details
Main Authors: Zhangpeng Gong, Luansu Wei, Guoye Wang, Dongxin Xu, Chang Ge
Format: Article
Language:English
Published: Hindawi Limited 2021-01-01
Series:Mathematical Problems in Engineering
Online Access:http://dx.doi.org/10.1155/2021/5585212
id doaj-eace0a9b941741f4ae0fcf08d6535450
record_format Article
spelling doaj-eace0a9b941741f4ae0fcf08d65354502021-03-22T00:03:30ZengHindawi LimitedMathematical Problems in Engineering1563-51472021-01-01202110.1155/2021/5585212Combined Auxiliary Networks and Bird’s Eye View Method for Real-Time Multicategory Object RecognitionZhangpeng Gong0Luansu Wei1Guoye Wang2Dongxin Xu3Chang Ge4College of EngineeringJiangsu Key Laboratory of Construction MaterialsCollege of EngineeringCollege of EngineeringCollege of EngineeringObject recognition based on LIDAR data is crucial in automotive driving and is the subject of extensive research. However, the lack of accuracy and stability in complex environments obstructs the practical application of real-time recognition algorithms. In this study, we proposed a new real-time network for multicategory object recognition. The manually extracted bird’s eye view (BEV) features were adopted to replace the resource-consuming 3D convolutional operation. Besides the subject network, we designed two auxiliary networks to help the network learn the pointwise features and boxwise features, aiming to improve the category and bounding boxes’ accuracy. The KITTI dataset was adopted to train and validate the proposed network. Experimental results showed that, for hard mode, the total average precision (AP) of the category reached 97.4%. For an intersection over a union threshold of 0.5 and 0.7, the total AP of regression reached 93.2% and 85.5%; especially, the AP of car’s regression reached 95.7% and 92.2%. The proposed network also showed consistent performance in the Apollo dataset with a processing duration of 37 ms. The proposed network exhibits stable and robust object recognition performance in complex environments (multiobject, unordered objects, and multicategory). And it shows sensitivity to occlusion of the LIDAR system and insensitivity to close large objects. The proposed multifunction method simultaneously achieves real-time operation, high accuracy, and stable performance, indicating its great potential value in practical application.http://dx.doi.org/10.1155/2021/5585212
collection DOAJ
language English
format Article
sources DOAJ
author Zhangpeng Gong
Luansu Wei
Guoye Wang
Dongxin Xu
Chang Ge
spellingShingle Zhangpeng Gong
Luansu Wei
Guoye Wang
Dongxin Xu
Chang Ge
Combined Auxiliary Networks and Bird’s Eye View Method for Real-Time Multicategory Object Recognition
Mathematical Problems in Engineering
author_facet Zhangpeng Gong
Luansu Wei
Guoye Wang
Dongxin Xu
Chang Ge
author_sort Zhangpeng Gong
title Combined Auxiliary Networks and Bird’s Eye View Method for Real-Time Multicategory Object Recognition
title_short Combined Auxiliary Networks and Bird’s Eye View Method for Real-Time Multicategory Object Recognition
title_full Combined Auxiliary Networks and Bird’s Eye View Method for Real-Time Multicategory Object Recognition
title_fullStr Combined Auxiliary Networks and Bird’s Eye View Method for Real-Time Multicategory Object Recognition
title_full_unstemmed Combined Auxiliary Networks and Bird’s Eye View Method for Real-Time Multicategory Object Recognition
title_sort combined auxiliary networks and bird’s eye view method for real-time multicategory object recognition
publisher Hindawi Limited
series Mathematical Problems in Engineering
issn 1563-5147
publishDate 2021-01-01
description Object recognition based on LIDAR data is crucial in automotive driving and is the subject of extensive research. However, the lack of accuracy and stability in complex environments obstructs the practical application of real-time recognition algorithms. In this study, we proposed a new real-time network for multicategory object recognition. The manually extracted bird’s eye view (BEV) features were adopted to replace the resource-consuming 3D convolutional operation. Besides the subject network, we designed two auxiliary networks to help the network learn the pointwise features and boxwise features, aiming to improve the category and bounding boxes’ accuracy. The KITTI dataset was adopted to train and validate the proposed network. Experimental results showed that, for hard mode, the total average precision (AP) of the category reached 97.4%. For an intersection over a union threshold of 0.5 and 0.7, the total AP of regression reached 93.2% and 85.5%; especially, the AP of car’s regression reached 95.7% and 92.2%. The proposed network also showed consistent performance in the Apollo dataset with a processing duration of 37 ms. The proposed network exhibits stable and robust object recognition performance in complex environments (multiobject, unordered objects, and multicategory). And it shows sensitivity to occlusion of the LIDAR system and insensitivity to close large objects. The proposed multifunction method simultaneously achieves real-time operation, high accuracy, and stable performance, indicating its great potential value in practical application.
url http://dx.doi.org/10.1155/2021/5585212
work_keys_str_mv AT zhangpenggong combinedauxiliarynetworksandbirdseyeviewmethodforrealtimemulticategoryobjectrecognition
AT luansuwei combinedauxiliarynetworksandbirdseyeviewmethodforrealtimemulticategoryobjectrecognition
AT guoyewang combinedauxiliarynetworksandbirdseyeviewmethodforrealtimemulticategoryobjectrecognition
AT dongxinxu combinedauxiliarynetworksandbirdseyeviewmethodforrealtimemulticategoryobjectrecognition
AT changge combinedauxiliarynetworksandbirdseyeviewmethodforrealtimemulticategoryobjectrecognition
_version_ 1714772826611777536