Joint learning of agents and graph embeddings in a conveyor belt control problem

Konstantin E. Rybkin; Andrey A. Filchenkov; Artur A. Azarov; Alexey S. Zabashta; Anatoly A. Shalyto

doi:10.17586/2226-1494-2022-22-6-1187-1196

Naučno-tehničeskij Vestnik Informacionnyh Tehnologij, Mehaniki i Optiki (Dec 2022)

Joint learning of agents and graph embeddings in a conveyor belt control problem

Konstantin E. Rybkin,
Andrey A. Filchenkov,
Artur A. Azarov,
Alexey S. Zabashta,
Anatoly A. Shalyto

Affiliations

Konstantin E. Rybkin: ORCiD; Engineer, ITMO University, Saint Petersburg, 197101, Russian Federation
Andrey A. Filchenkov: ORCiD; PhD (Physics & Mathematics), Engineer, ITMO University, Saint Petersburg, 197101, Russian Federation, sc 55507568200
Artur A. Azarov: ORCiD; PhD, Scientific Researcher, ITMO University, Saint Petersburg, 197101, Russian Federation; Deputy Director, North-West Institute of Management - branch of the Russian Presidential Academy of National Economy and Public Administration, Saint Petersburg, 199178, Russian Federation, sc 56938354700
Alexey S. Zabashta: ORCiD; PhD, Associate Professor, ITMO University, Saint Petersburg, 197101, Russian Federation, sc 56902663900
Anatoly A. Shalyto: ORCiD; D. Sc., Professor, Chief Reseacher, ITMO University, Saint Petersburg, 197101, Russian Federation, sc 56131789500

DOI: https://doi.org/10.17586/2226-1494-2022-22-6-1187-1196
Journal volume & issue: Vol. 22, no. 6
pp. 1187 – 1196

Abstract

Read online

We focus on the problem of routing a conveyor belts system based on a multi-agent approach. Most of these airport baggage belt conveyor systems use routing algorithms based on manual simulation of conveyor behavior. This approach does not scale well, and new research in machine learning proposes to solve the routing problem using reinforcement learning. To solve this problem, we propose an approach to joint learning of agents and vector representations of a graph. Within this approach, we develop a QSDNE algorithm, which uses DQN agents and SDNE embeddings. A comparative analysis was carried out with multi-agent routing algorithms without joint learning. The results of the QSDNE algorithm showed its effectiveness in optimizing the delivery time and energy consumption in conveyor systems as it helped to reduce mean delivery time by 6 %. The proposed approach can be used to solve routing problems with complex path estimation functions and dynamically changing graph topologies, and the proposed algorithm can be used to control conveyor belts at airports and in manufacturing workshops.

Published in Naučno-tehničeskij Vestnik Informacionnyh Tehnologij, Mehaniki i Optiki

ISSN: 2226-1494 (Print); 2500-0373 (Online)
Publisher: Saint Petersburg National Research University of Information Technologies, Mechanics and Optics (ITMO University)
Country of publisher: Russian Federation
LCC subjects: Science: Physics: Optics. Light; Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: http://ntv.ifmo.ru/en/english.htm

About the journal

Abstract

Keywords