A Framework for Multi-Agent UAV Exploration and Target-Finding in GPS-Denied and Partially Observable Environments

The problem of multi-agent remote sensing for the purposes of finding survivors or surveying points of interest in GPS-denied and partially observable environments remains a challenge. This paper presents a framework for multi-agent target-finding using a combination of online POMDP-based planning a...

Full description

Bibliographic Details
Main Authors: Ory Walker, Fernando Vanegas, Felipe Gonzalez
Format: Article
Language:English
Published: MDPI AG 2020-08-01
Series:Sensors
Subjects:
UAV
Online Access:https://www.mdpi.com/1424-8220/20/17/4739
Description
Summary:The problem of multi-agent remote sensing for the purposes of finding survivors or surveying points of interest in GPS-denied and partially observable environments remains a challenge. This paper presents a framework for multi-agent target-finding using a combination of online POMDP-based planning and Deep Reinforcement Learning-based control. The implementation considers the problems of planning and control as separate problem components of a single problem. The planning problem is defined as a decentralised multi-agent graph search problem and is solved using a modern online POMDP solver. The control problem is defined as a local continuous-environment exploration problem and is solved using modern Deep Reinforcement Learning techniques. The proposed framework combines the solution to both of these problems and testing shows that it enables multiple agents to find a target within large, simulated test environments in the presence of unknown obstacles and obstructions. The proposed approach could also be extended or adapted to a number of time sensitive remote-sensing problems, from searching for multiple survivors during a disaster to surveying points of interest in a hazardous environment by adjusting the individual model definitions.
ISSN:1424-8220