Searching and Tracking an Unknown Number of Targets: A Learning-Based Method Enhanced with Maps Merging

Unmanned aerial vehicles (UAVs) have been widely used in search and rescue (SAR) missions due to their high flexibility. A key problem in SAR missions is to search and track moving targets in an area of interest. In this paper, we focus on the problem of Cooperative Multi-UAV Observation of Multiple...

Full description

Bibliographic Details
Main Authors:	Peng Yan, Tao Jia, Chengchao Bai
Format:	Article
Language:	English
Published:	MDPI AG 2021-02-01
Series:	Sensors
Subjects:	unmanned aerial vehicle (UAV) search and track deep reinforcement learning (DRL) maps merging convolutional neural network (CNN)
Online Access:	https://www.mdpi.com/1424-8220/21/4/1076

id	doaj-e5c7af224ba54220adeabee26a72e9fa
record_format	Article
spelling	doaj-e5c7af224ba54220adeabee26a72e9fa2021-02-05T00:05:35ZengMDPI AGSensors1424-82202021-02-01211076107610.3390/s21041076Searching and Tracking an Unknown Number of Targets: A Learning-Based Method Enhanced with Maps MergingPeng Yan0Tao Jia1Chengchao Bai2School of Astronautics, Harbin Institute of Technology, Harbin 150001, ChinaAerospace Technology Research Institute, China Aerodynamics Research and Development Center, Mianyang 621000, ChinaSchool of Astronautics, Harbin Institute of Technology, Harbin 150001, ChinaUnmanned aerial vehicles (UAVs) have been widely used in search and rescue (SAR) missions due to their high flexibility. A key problem in SAR missions is to search and track moving targets in an area of interest. In this paper, we focus on the problem of Cooperative Multi-UAV Observation of Multiple Moving Targets (CMUOMMT). In contrast to the existing literature, we not only optimize the average observation rate of the discovered targets, but we also emphasize the fairness of the observation of the discovered targets and the continuous exploration of the undiscovered targets, under the assumption that the total number of targets is unknown. To achieve this objective, a deep reinforcement learning (DRL)-based method is proposed under the Partially Observable Markov Decision Process (POMDP) framework, where each UAV maintains four observation history maps, and maps from different UAVs within a communication range can be merged to enhance UAVs’ awareness of the environment. A deep convolutional neural network (CNN) is used to process the merged maps and generate the control commands to UAVs. The simulation results show that our policy can enable UAVs to balance between giving the discovered targets a fair observation and exploring the search region compared with other methods.https://www.mdpi.com/1424-8220/21/4/1076unmanned aerial vehicle (UAV)search and trackdeep reinforcement learning (DRL)maps mergingconvolutional neural network (CNN)
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Peng Yan Tao Jia Chengchao Bai
spellingShingle	Peng Yan Tao Jia Chengchao Bai Searching and Tracking an Unknown Number of Targets: A Learning-Based Method Enhanced with Maps Merging Sensors unmanned aerial vehicle (UAV) search and track deep reinforcement learning (DRL) maps merging convolutional neural network (CNN)
author_facet	Peng Yan Tao Jia Chengchao Bai
author_sort	Peng Yan
title	Searching and Tracking an Unknown Number of Targets: A Learning-Based Method Enhanced with Maps Merging
title_short	Searching and Tracking an Unknown Number of Targets: A Learning-Based Method Enhanced with Maps Merging
title_full	Searching and Tracking an Unknown Number of Targets: A Learning-Based Method Enhanced with Maps Merging
title_fullStr	Searching and Tracking an Unknown Number of Targets: A Learning-Based Method Enhanced with Maps Merging
title_full_unstemmed	Searching and Tracking an Unknown Number of Targets: A Learning-Based Method Enhanced with Maps Merging
title_sort	searching and tracking an unknown number of targets: a learning-based method enhanced with maps merging
publisher	MDPI AG
series	Sensors
issn	1424-8220
publishDate	2021-02-01
description	Unmanned aerial vehicles (UAVs) have been widely used in search and rescue (SAR) missions due to their high flexibility. A key problem in SAR missions is to search and track moving targets in an area of interest. In this paper, we focus on the problem of Cooperative Multi-UAV Observation of Multiple Moving Targets (CMUOMMT). In contrast to the existing literature, we not only optimize the average observation rate of the discovered targets, but we also emphasize the fairness of the observation of the discovered targets and the continuous exploration of the undiscovered targets, under the assumption that the total number of targets is unknown. To achieve this objective, a deep reinforcement learning (DRL)-based method is proposed under the Partially Observable Markov Decision Process (POMDP) framework, where each UAV maintains four observation history maps, and maps from different UAVs within a communication range can be merged to enhance UAVs’ awareness of the environment. A deep convolutional neural network (CNN) is used to process the merged maps and generate the control commands to UAVs. The simulation results show that our policy can enable UAVs to balance between giving the discovered targets a fair observation and exploring the search region compared with other methods.
topic	unmanned aerial vehicle (UAV) search and track deep reinforcement learning (DRL) maps merging convolutional neural network (CNN)
url	https://www.mdpi.com/1424-8220/21/4/1076
work_keys_str_mv	AT pengyan searchingandtrackinganunknownnumberoftargetsalearningbasedmethodenhancedwithmapsmerging AT taojia searchingandtrackinganunknownnumberoftargetsalearningbasedmethodenhancedwithmapsmerging AT chengchaobai searchingandtrackinganunknownnumberoftargetsalearningbasedmethodenhancedwithmapsmerging
_version_	1724284453771018240

Searching and Tracking an Unknown Number of Targets: A Learning-Based Method Enhanced with Maps Merging

Similar Items