Multi-Scale and Occlusion Aware Network for Vehicle Detection and Segmentation on UAV Aerial Images
With the advantage of high maneuverability, Unmanned Aerial Vehicles (UAVs) have been widely deployed in vehicle monitoring and controlling. However, processing the images captured by UAV for the extracting vehicle information is hindered by some challenges including arbitrary orientations, huge sca...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2020-05-01
|
Series: | Remote Sensing |
Subjects: | |
Online Access: | https://www.mdpi.com/2072-4292/12/11/1760 |
id |
doaj-47d4a902d7804e199ea393fb7149b11f |
---|---|
record_format |
Article |
spelling |
doaj-47d4a902d7804e199ea393fb7149b11f2020-11-25T03:18:18ZengMDPI AGRemote Sensing2072-42922020-05-01121760176010.3390/rs12111760Multi-Scale and Occlusion Aware Network for Vehicle Detection and Segmentation on UAV Aerial ImagesWang Zhang0Chunsheng Liu1Faliang Chang2Ye Song3School of Control Science and Engineering, Shandong University, Jinan 250061, ChinaSchool of Control Science and Engineering, Shandong University, Jinan 250061, ChinaSchool of Control Science and Engineering, Shandong University, Jinan 250061, ChinaSchool of Control Science and Engineering, Shandong University, Jinan 250061, ChinaWith the advantage of high maneuverability, Unmanned Aerial Vehicles (UAVs) have been widely deployed in vehicle monitoring and controlling. However, processing the images captured by UAV for the extracting vehicle information is hindered by some challenges including arbitrary orientations, huge scale variations and partial occlusion. In seeking to address these challenges, we propose a novel Multi-Scale and Occlusion Aware Network (MSOA-Net) for UAV based vehicle segmentation, which consists of two parts including a Multi-Scale Feature Adaptive Fusion Network (MSFAF-Net) and a Regional Attention based Triple Head Network (RATH-Net). In MSFAF-Net, a self-adaptive feature fusion module is proposed, which can adaptively aggregate hierarchical feature maps from multiple levels to help Feature Pyramid Network (FPN) deal with the scale change of vehicles. The RATH-Net with a self-attention mechanism is proposed to guide the location-sensitive sub-networks to enhance the vehicle of interest and suppress background noise caused by occlusions. In this study, we release a large comprehensive UAV based vehicle segmentation dataset (UVSD), which is the first public dataset for UAV based vehicle detection and segmentation. Experiments are conducted on the challenging UVSD dataset. Experimental results show that the proposed method is efficient in detecting and segmenting vehicles, and outperforms the compared state-of-the-art works.https://www.mdpi.com/2072-4292/12/11/1760vehicle segmentationvehicle detectionfeature pyramid networkself-attention mechanismaerial images |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Wang Zhang Chunsheng Liu Faliang Chang Ye Song |
spellingShingle |
Wang Zhang Chunsheng Liu Faliang Chang Ye Song Multi-Scale and Occlusion Aware Network for Vehicle Detection and Segmentation on UAV Aerial Images Remote Sensing vehicle segmentation vehicle detection feature pyramid network self-attention mechanism aerial images |
author_facet |
Wang Zhang Chunsheng Liu Faliang Chang Ye Song |
author_sort |
Wang Zhang |
title |
Multi-Scale and Occlusion Aware Network for Vehicle Detection and Segmentation on UAV Aerial Images |
title_short |
Multi-Scale and Occlusion Aware Network for Vehicle Detection and Segmentation on UAV Aerial Images |
title_full |
Multi-Scale and Occlusion Aware Network for Vehicle Detection and Segmentation on UAV Aerial Images |
title_fullStr |
Multi-Scale and Occlusion Aware Network for Vehicle Detection and Segmentation on UAV Aerial Images |
title_full_unstemmed |
Multi-Scale and Occlusion Aware Network for Vehicle Detection and Segmentation on UAV Aerial Images |
title_sort |
multi-scale and occlusion aware network for vehicle detection and segmentation on uav aerial images |
publisher |
MDPI AG |
series |
Remote Sensing |
issn |
2072-4292 |
publishDate |
2020-05-01 |
description |
With the advantage of high maneuverability, Unmanned Aerial Vehicles (UAVs) have been widely deployed in vehicle monitoring and controlling. However, processing the images captured by UAV for the extracting vehicle information is hindered by some challenges including arbitrary orientations, huge scale variations and partial occlusion. In seeking to address these challenges, we propose a novel Multi-Scale and Occlusion Aware Network (MSOA-Net) for UAV based vehicle segmentation, which consists of two parts including a Multi-Scale Feature Adaptive Fusion Network (MSFAF-Net) and a Regional Attention based Triple Head Network (RATH-Net). In MSFAF-Net, a self-adaptive feature fusion module is proposed, which can adaptively aggregate hierarchical feature maps from multiple levels to help Feature Pyramid Network (FPN) deal with the scale change of vehicles. The RATH-Net with a self-attention mechanism is proposed to guide the location-sensitive sub-networks to enhance the vehicle of interest and suppress background noise caused by occlusions. In this study, we release a large comprehensive UAV based vehicle segmentation dataset (UVSD), which is the first public dataset for UAV based vehicle detection and segmentation. Experiments are conducted on the challenging UVSD dataset. Experimental results show that the proposed method is efficient in detecting and segmenting vehicles, and outperforms the compared state-of-the-art works. |
topic |
vehicle segmentation vehicle detection feature pyramid network self-attention mechanism aerial images |
url |
https://www.mdpi.com/2072-4292/12/11/1760 |
work_keys_str_mv |
AT wangzhang multiscaleandocclusionawarenetworkforvehicledetectionandsegmentationonuavaerialimages AT chunshengliu multiscaleandocclusionawarenetworkforvehicledetectionandsegmentationonuavaerialimages AT faliangchang multiscaleandocclusionawarenetworkforvehicledetectionandsegmentationonuavaerialimages AT yesong multiscaleandocclusionawarenetworkforvehicledetectionandsegmentationonuavaerialimages |
_version_ |
1724627661178798080 |