Adaptive Control for Underwater Simultaneous Lightwave Information and Power Transfer: A Hierarchical Deep-Reinforcement Approach

Huicheol Shin; Sangki Jeong; Seungjae Baek; Yujae Song

doi:10.3390/jmse12091647

Journal of Marine Science and Engineering (Sep 2024)

Adaptive Control for Underwater Simultaneous Lightwave Information and Power Transfer: A Hierarchical Deep-Reinforcement Approach

Huicheol Shin,
Sangki Jeong,
Seungjae Baek,
Yujae Song

Affiliations

Huicheol Shin: Maritime ICT & Mobility Research Department, Korea Institute of Ocean Science and Technology, Busan 49111, Republic of Korea
Sangki Jeong: Maritime ICT & Mobility Research Department, Korea Institute of Ocean Science and Technology, Busan 49111, Republic of Korea
Seungjae Baek: Maritime ICT & Mobility Research Department, Korea Institute of Ocean Science and Technology, Busan 49111, Republic of Korea
Yujae Song: Department of Robotics Engineering, Yeungnam University, Gyeongsan 38541, Republic of Korea

DOI: https://doi.org/10.3390/jmse12091647
Journal volume & issue: Vol. 12, no. 9
p. 1647

Abstract

Read online

In this work, we consider a point-to-point underwater optical wireless communication scenario where an underwater sensor (US) transmits its sensing data to a remotely operated vehicle (ROV). Before the US transmits its data to the ROV, the ROV performs simultaneous lightwave information and power transfer (SLIPT), delivering both control data and lightwave power to the US. Under the considered scenario, our objective is to maximize energy harvesting at the US while supporting predetermined communication performance between the two nodes. To achieve this objective, we develop a hierarchical deep Q-network (DQN)–deep deterministic policy gradient (DDPG)-based online algorithm. This algorithm involves two reinforcement learning agents: the ROV and US. The role of the ROV agent is to determine an optimal beam-divergence angle that maximizes the received optical signal power at the US while ensuring a seamless optical link. Meanwhile, the US agent, which is influenced by the decision of the ROV agent, is responsible for determining the time-switching and power-splitting ratios to maximize energy harvesting without compromising the required communication performance. Unlike existing studies that do not account for adaptive parameter control in underwater SLIPT, the proposed algorithm’s adaptive nature allows for the dynamic fine-tuning of optimization parameters in response to varying underwater environmental conditions and diverse user requirements.

Published in Journal of Marine Science and Engineering

ISSN: 2077-1312 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Naval Science: Naval architecture. Shipbuilding. Marine engineering; Geography. Anthropology. Recreation: Oceanography
Website: http://www.mdpi.com/journal/jmse

About the journal

Abstract

Keywords