Combined Auxiliary Networks and Bird’s Eye View Method for Real-Time Multicategory Object Recognition
Object recognition based on LIDAR data is crucial in automotive driving and is the subject of extensive research. However, the lack of accuracy and stability in complex environments obstructs the practical application of real-time recognition algorithms. In this study, we proposed a new real-time ne...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Hindawi Limited
2021-01-01
|
Series: | Mathematical Problems in Engineering |
Online Access: | http://dx.doi.org/10.1155/2021/5585212 |
id |
doaj-eace0a9b941741f4ae0fcf08d6535450 |
---|---|
record_format |
Article |
spelling |
doaj-eace0a9b941741f4ae0fcf08d65354502021-03-22T00:03:30ZengHindawi LimitedMathematical Problems in Engineering1563-51472021-01-01202110.1155/2021/5585212Combined Auxiliary Networks and Bird’s Eye View Method for Real-Time Multicategory Object RecognitionZhangpeng Gong0Luansu Wei1Guoye Wang2Dongxin Xu3Chang Ge4College of EngineeringJiangsu Key Laboratory of Construction MaterialsCollege of EngineeringCollege of EngineeringCollege of EngineeringObject recognition based on LIDAR data is crucial in automotive driving and is the subject of extensive research. However, the lack of accuracy and stability in complex environments obstructs the practical application of real-time recognition algorithms. In this study, we proposed a new real-time network for multicategory object recognition. The manually extracted bird’s eye view (BEV) features were adopted to replace the resource-consuming 3D convolutional operation. Besides the subject network, we designed two auxiliary networks to help the network learn the pointwise features and boxwise features, aiming to improve the category and bounding boxes’ accuracy. The KITTI dataset was adopted to train and validate the proposed network. Experimental results showed that, for hard mode, the total average precision (AP) of the category reached 97.4%. For an intersection over a union threshold of 0.5 and 0.7, the total AP of regression reached 93.2% and 85.5%; especially, the AP of car’s regression reached 95.7% and 92.2%. The proposed network also showed consistent performance in the Apollo dataset with a processing duration of 37 ms. The proposed network exhibits stable and robust object recognition performance in complex environments (multiobject, unordered objects, and multicategory). And it shows sensitivity to occlusion of the LIDAR system and insensitivity to close large objects. The proposed multifunction method simultaneously achieves real-time operation, high accuracy, and stable performance, indicating its great potential value in practical application.http://dx.doi.org/10.1155/2021/5585212 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Zhangpeng Gong Luansu Wei Guoye Wang Dongxin Xu Chang Ge |
spellingShingle |
Zhangpeng Gong Luansu Wei Guoye Wang Dongxin Xu Chang Ge Combined Auxiliary Networks and Bird’s Eye View Method for Real-Time Multicategory Object Recognition Mathematical Problems in Engineering |
author_facet |
Zhangpeng Gong Luansu Wei Guoye Wang Dongxin Xu Chang Ge |
author_sort |
Zhangpeng Gong |
title |
Combined Auxiliary Networks and Bird’s Eye View Method for Real-Time Multicategory Object Recognition |
title_short |
Combined Auxiliary Networks and Bird’s Eye View Method for Real-Time Multicategory Object Recognition |
title_full |
Combined Auxiliary Networks and Bird’s Eye View Method for Real-Time Multicategory Object Recognition |
title_fullStr |
Combined Auxiliary Networks and Bird’s Eye View Method for Real-Time Multicategory Object Recognition |
title_full_unstemmed |
Combined Auxiliary Networks and Bird’s Eye View Method for Real-Time Multicategory Object Recognition |
title_sort |
combined auxiliary networks and bird’s eye view method for real-time multicategory object recognition |
publisher |
Hindawi Limited |
series |
Mathematical Problems in Engineering |
issn |
1563-5147 |
publishDate |
2021-01-01 |
description |
Object recognition based on LIDAR data is crucial in automotive driving and is the subject of extensive research. However, the lack of accuracy and stability in complex environments obstructs the practical application of real-time recognition algorithms. In this study, we proposed a new real-time network for multicategory object recognition. The manually extracted bird’s eye view (BEV) features were adopted to replace the resource-consuming 3D convolutional operation. Besides the subject network, we designed two auxiliary networks to help the network learn the pointwise features and boxwise features, aiming to improve the category and bounding boxes’ accuracy. The KITTI dataset was adopted to train and validate the proposed network. Experimental results showed that, for hard mode, the total average precision (AP) of the category reached 97.4%. For an intersection over a union threshold of 0.5 and 0.7, the total AP of regression reached 93.2% and 85.5%; especially, the AP of car’s regression reached 95.7% and 92.2%. The proposed network also showed consistent performance in the Apollo dataset with a processing duration of 37 ms. The proposed network exhibits stable and robust object recognition performance in complex environments (multiobject, unordered objects, and multicategory). And it shows sensitivity to occlusion of the LIDAR system and insensitivity to close large objects. The proposed multifunction method simultaneously achieves real-time operation, high accuracy, and stable performance, indicating its great potential value in practical application. |
url |
http://dx.doi.org/10.1155/2021/5585212 |
work_keys_str_mv |
AT zhangpenggong combinedauxiliarynetworksandbirdseyeviewmethodforrealtimemulticategoryobjectrecognition AT luansuwei combinedauxiliarynetworksandbirdseyeviewmethodforrealtimemulticategoryobjectrecognition AT guoyewang combinedauxiliarynetworksandbirdseyeviewmethodforrealtimemulticategoryobjectrecognition AT dongxinxu combinedauxiliarynetworksandbirdseyeviewmethodforrealtimemulticategoryobjectrecognition AT changge combinedauxiliarynetworksandbirdseyeviewmethodforrealtimemulticategoryobjectrecognition |
_version_ |
1714772826611777536 |