Multi-Scale and Occlusion Aware Network for Vehicle Detection and Segmentation on UAV Aerial Images

With the advantage of high maneuverability, Unmanned Aerial Vehicles (UAVs) have been widely deployed in vehicle monitoring and controlling. However, processing the images captured by UAV for the extracting vehicle information is hindered by some challenges including arbitrary orientations, huge sca...

Full description

Bibliographic Details
Main Authors: Wang Zhang, Chunsheng Liu, Faliang Chang, Ye Song
Format: Article
Language:English
Published: MDPI AG 2020-05-01
Series:Remote Sensing
Subjects:
Online Access:https://www.mdpi.com/2072-4292/12/11/1760
id doaj-47d4a902d7804e199ea393fb7149b11f
record_format Article
spelling doaj-47d4a902d7804e199ea393fb7149b11f2020-11-25T03:18:18ZengMDPI AGRemote Sensing2072-42922020-05-01121760176010.3390/rs12111760Multi-Scale and Occlusion Aware Network for Vehicle Detection and Segmentation on UAV Aerial ImagesWang Zhang0Chunsheng Liu1Faliang Chang2Ye Song3School of Control Science and Engineering, Shandong University, Jinan 250061, ChinaSchool of Control Science and Engineering, Shandong University, Jinan 250061, ChinaSchool of Control Science and Engineering, Shandong University, Jinan 250061, ChinaSchool of Control Science and Engineering, Shandong University, Jinan 250061, ChinaWith the advantage of high maneuverability, Unmanned Aerial Vehicles (UAVs) have been widely deployed in vehicle monitoring and controlling. However, processing the images captured by UAV for the extracting vehicle information is hindered by some challenges including arbitrary orientations, huge scale variations and partial occlusion. In seeking to address these challenges, we propose a novel Multi-Scale and Occlusion Aware Network (MSOA-Net) for UAV based vehicle segmentation, which consists of two parts including a Multi-Scale Feature Adaptive Fusion Network (MSFAF-Net) and a Regional Attention based Triple Head Network (RATH-Net). In MSFAF-Net, a self-adaptive feature fusion module is proposed, which can adaptively aggregate hierarchical feature maps from multiple levels to help Feature Pyramid Network (FPN) deal with the scale change of vehicles. The RATH-Net with a self-attention mechanism is proposed to guide the location-sensitive sub-networks to enhance the vehicle of interest and suppress background noise caused by occlusions. In this study, we release a large comprehensive UAV based vehicle segmentation dataset (UVSD), which is the first public dataset for UAV based vehicle detection and segmentation. Experiments are conducted on the challenging UVSD dataset. Experimental results show that the proposed method is efficient in detecting and segmenting vehicles, and outperforms the compared state-of-the-art works.https://www.mdpi.com/2072-4292/12/11/1760vehicle segmentationvehicle detectionfeature pyramid networkself-attention mechanismaerial images
collection DOAJ
language English
format Article
sources DOAJ
author Wang Zhang
Chunsheng Liu
Faliang Chang
Ye Song
spellingShingle Wang Zhang
Chunsheng Liu
Faliang Chang
Ye Song
Multi-Scale and Occlusion Aware Network for Vehicle Detection and Segmentation on UAV Aerial Images
Remote Sensing
vehicle segmentation
vehicle detection
feature pyramid network
self-attention mechanism
aerial images
author_facet Wang Zhang
Chunsheng Liu
Faliang Chang
Ye Song
author_sort Wang Zhang
title Multi-Scale and Occlusion Aware Network for Vehicle Detection and Segmentation on UAV Aerial Images
title_short Multi-Scale and Occlusion Aware Network for Vehicle Detection and Segmentation on UAV Aerial Images
title_full Multi-Scale and Occlusion Aware Network for Vehicle Detection and Segmentation on UAV Aerial Images
title_fullStr Multi-Scale and Occlusion Aware Network for Vehicle Detection and Segmentation on UAV Aerial Images
title_full_unstemmed Multi-Scale and Occlusion Aware Network for Vehicle Detection and Segmentation on UAV Aerial Images
title_sort multi-scale and occlusion aware network for vehicle detection and segmentation on uav aerial images
publisher MDPI AG
series Remote Sensing
issn 2072-4292
publishDate 2020-05-01
description With the advantage of high maneuverability, Unmanned Aerial Vehicles (UAVs) have been widely deployed in vehicle monitoring and controlling. However, processing the images captured by UAV for the extracting vehicle information is hindered by some challenges including arbitrary orientations, huge scale variations and partial occlusion. In seeking to address these challenges, we propose a novel Multi-Scale and Occlusion Aware Network (MSOA-Net) for UAV based vehicle segmentation, which consists of two parts including a Multi-Scale Feature Adaptive Fusion Network (MSFAF-Net) and a Regional Attention based Triple Head Network (RATH-Net). In MSFAF-Net, a self-adaptive feature fusion module is proposed, which can adaptively aggregate hierarchical feature maps from multiple levels to help Feature Pyramid Network (FPN) deal with the scale change of vehicles. The RATH-Net with a self-attention mechanism is proposed to guide the location-sensitive sub-networks to enhance the vehicle of interest and suppress background noise caused by occlusions. In this study, we release a large comprehensive UAV based vehicle segmentation dataset (UVSD), which is the first public dataset for UAV based vehicle detection and segmentation. Experiments are conducted on the challenging UVSD dataset. Experimental results show that the proposed method is efficient in detecting and segmenting vehicles, and outperforms the compared state-of-the-art works.
topic vehicle segmentation
vehicle detection
feature pyramid network
self-attention mechanism
aerial images
url https://www.mdpi.com/2072-4292/12/11/1760
work_keys_str_mv AT wangzhang multiscaleandocclusionawarenetworkforvehicledetectionandsegmentationonuavaerialimages
AT chunshengliu multiscaleandocclusionawarenetworkforvehicledetectionandsegmentationonuavaerialimages
AT faliangchang multiscaleandocclusionawarenetworkforvehicledetectionandsegmentationonuavaerialimages
AT yesong multiscaleandocclusionawarenetworkforvehicledetectionandsegmentationonuavaerialimages
_version_ 1724627661178798080