Double Mask R-CNN for Pedestrian Detection in a Crowd

Aiming at the difficulty of feature extraction and the limitation of NMS (nonmaximum suppression) in crowded pedestrian detection, a new detection network named Double Mask R-CNN based on Mask R-CNN with FPN (Feature Pyramid Network) is proposed in this article. The algorithm has two improvements: F...

Full description

Bibliographic Details
Main Authors: Liu, C. (Author), Wang, H. (Author)
Format: Article
Language:English
Published: Hindawi Limited 2022
Subjects:
Online Access:View Fulltext in Publisher
LEADER 02040nam a2200349Ia 4500
001 10.1155-2022-4012252
008 220425s2022 CNT 000 0 und d
020 |a 1574017X (ISSN) 
245 1 0 |a Double Mask R-CNN for Pedestrian Detection in a Crowd 
260 0 |b Hindawi Limited  |c 2022 
856 |z View Fulltext in Publisher  |u https://doi.org/10.1155/2022/4012252 
520 3 |a Aiming at the difficulty of feature extraction and the limitation of NMS (nonmaximum suppression) in crowded pedestrian detection, a new detection network named Double Mask R-CNN based on Mask R-CNN with FPN (Feature Pyramid Network) is proposed in this article. The algorithm has two improvements: Firstly, we add a semantic segmentation branch on the FPN to strengthen the feature extraction of crowded pedestrians; secondly, we design a rule to estimate the pedestrian visibility of detected image according to the human keypoints information, and this rule can cover binary mask on the image whose pedestrian visibility is less than a certain threshold. Then we input the masked image into the network to locate occluded pedestrians. Experimental results on the CrowdHuman dataset show that the log-average miss rate (MR) of Double Mask R-CNN is 13, 12% lower than the best results of other mainstream networks. Similar improvements on WiderPerson dataset are also achieved by the Double Mask R-CNN. © 2022 Congqiang Liu et al. 
650 0 4 |a Binary masks 
650 0 4 |a Detection networks 
650 0 4 |a Extraction 
650 0 4 |a Feature extraction 
650 0 4 |a Feature pyramid 
650 0 4 |a Features extraction 
650 0 4 |a Image enhancement 
650 0 4 |a Keypoints 
650 0 4 |a Miss-rate 
650 0 4 |a Non-maximum suppression 
650 0 4 |a Pedestrian detection 
650 0 4 |a Pyramid network 
650 0 4 |a Semantic segmentation 
650 0 4 |a Semantic Segmentation 
650 0 4 |a Semantics 
650 0 4 |a Visibility 
700 1 |a Liu, C.  |e author 
700 1 |a Liu, C.  |e author 
700 1 |a Wang, H.  |e author 
773 |t Mobile Information Systems