A Dimensional Comparison between Evolutionary Algorithm and Deep Reinforcement Learning Methodologies for Autonomous Surface Vehicles with Water Quality Sensors

Samuel Yanes Luis; Daniel Gutiérrez-Reina; Sergio Toral Marín

doi:10.3390/s21082862

Sensors (Apr 2021)

A Dimensional Comparison between Evolutionary Algorithm and Deep Reinforcement Learning Methodologies for Autonomous Surface Vehicles with Water Quality Sensors

Samuel Yanes Luis,
Daniel Gutiérrez-Reina,
Sergio Toral Marín

Affiliations

Samuel Yanes Luis: Department of Electronic Engineering, University of Seville, 41009 Seville, Spain
Daniel Gutiérrez-Reina: Department of Electronic Engineering, University of Seville, 41009 Seville, Spain
Sergio Toral Marín: Department of Electronic Engineering, University of Seville, 41009 Seville, Spain

DOI: https://doi.org/10.3390/s21082862
Journal volume & issue: Vol. 21, no. 8
p. 2862

Abstract

Read online

The monitoring of water resources using Autonomous Surface Vehicles with water-quality sensors has been a recent approach due to the advances in unmanned transportation technology. The Ypacaraí Lake, the biggest water resource in Paraguay, suffers from a major contamination problem because of cyanobacteria blooms. In order to supervise the blooms using these on-board sensor modules, a Non-Homogeneous Patrolling Problem (a NP-hard problem) must be solved in a feasible amount of time. A dimensionality study is addressed to compare the most common methodologies, Evolutionary Algorithm and Deep Reinforcement Learning, in different map scales and fleet sizes with changes in the environmental conditions. The results determined that Deep Q-Learning overcomes the evolutionary method in terms of sample-efficiency by 50–70% in higher resolutions. Furthermore, it reacts better than the Evolutionary Algorithm in high space-state actions. In contrast, the evolutionary approach shows a better efficiency in lower resolutions and needs fewer parameters to synthesize robust solutions. This study reveals that Deep Q-learning approaches exceed in efficiency for the Non-Homogeneous Patrolling Problem but with many hyper-parameters involved in the stability and convergence.

Published in Sensors

ISSN: 1424-8220 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Chemical technology
Website: http://www.mdpi.com/journal/sensors

About the journal

Abstract

Keywords