Deep reinforcement learning for time-critical wilderness search and rescue using drones

Jan-Hendrik Ewers; David Anderson; Douglas Thomson

doi:10.3389/frobt.2024.1527095

Frontiers in Robotics and AI (Feb 2025)

Deep reinforcement learning for time-critical wilderness search and rescue using drones

Jan-Hendrik Ewers,
David Anderson,
Douglas Thomson

Affiliations

Jan-Hendrik Ewers
David Anderson
Douglas Thomson

DOI: https://doi.org/10.3389/frobt.2024.1527095
Journal volume & issue: Vol. 11

Abstract

Read online

Traditional search and rescue methods in wilderness areas can be time-consuming and have limited coverage. Drones offer a faster and more flexible solution, but optimizing their search paths is crucial for effective operations. This paper proposes a novel algorithm using deep reinforcement learning to create efficient search paths for drones in wilderness environments. Our approach leverages a priori data about the search area and the missing person in the form of a probability distribution map. This allows the policy to learn optimal flight paths that maximize the probability of finding the missing person quickly. Experimental results show that our method achieves a significant improvement in search times compared to traditional coverage planning and search planning algorithms by over 160%, a difference that can mean life or death in real-world search operations Additionally, unlike previous work, our approach incorporates a continuous action space enabled by cubature, allowing for more nuanced flight patterns.

Published in Frontiers in Robotics and AI

ISSN: 2296-9144 (Online)
Publisher: Frontiers Media S.A.
Country of publisher: Switzerland
LCC subjects: Technology: Mechanical engineering and machinery; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://www.frontiersin.org/journals/robotics-and-ai

About the journal

Abstract

Keywords