Actor–Critic Reinforcement Learning and Application in Developing Computer-Vision-Based Interface Tracking

Oguzhan Dogru; Kirubakaran Velswamy; Biao Huang

doi:10.1016/j.eng.2021.04.027

Engineering (Sep 2021)

Actor–Critic Reinforcement Learning and Application in Developing Computer-Vision-Based Interface Tracking

Oguzhan Dogru,
Kirubakaran Velswamy,
Biao Huang

Affiliations

Oguzhan Dogru: Department of Chemical and Materials Engineering, University of Alberta, Edmonton, AB T6G 1H9, Canada
Kirubakaran Velswamy: Department of Chemical and Materials Engineering, University of Alberta, Edmonton, AB T6G 1H9, Canada
Biao Huang: Corresponding author.; Department of Chemical and Materials Engineering, University of Alberta, Edmonton, AB T6G 1H9, Canada

DOI: https://doi.org/10.1016/j.eng.2021.04.027
Journal volume & issue: Vol. 7, no. 9
pp. 1248 – 1261

Abstract

Read online

This paper synchronizes control theory with computer vision by formalizing object tracking as a sequential decision-making process. A reinforcement learning (RL) agent successfully tracks an interface between two liquids, which is often a critical variable to track in many chemical, petrochemical, metallurgical, and oil industries. This method utilizes less than 100 images for creating an environment, from which the agent generates its own data without the need for expert knowledge. Unlike supervised learning (SL) methods that rely on a huge number of parameters, this approach requires far fewer parameters, which naturally reduces its maintenance cost. Besides its frugal nature, the agent is robust to environmental uncertainties such as occlusion, intensity changes, and excessive noise. From a closed-loop control context, an interface location-based deviation is chosen as the optimization goal during training. The methodology showcases RL for real-time object-tracking applications in the oil sands industry. Along with a presentation of the interface tracking problem, this paper provides a detailed review of one of the most effective RL methodologies: actor–critic policy.

Published in Engineering

ISSN: 2095-8099 (Print); 2096-0026 (Online)
Publisher: Elsevier
Country of publisher: Netherlands
LCC subjects: Technology: Engineering (General). Civil engineering (General)
Website: https://www.sciencedirect.com/journal/engineering

About the journal

Abstract

Keywords