Constrained distributed online convex optimization with bandit feedback for unbalanced digraphs

Keishin Tada; Naoki Hayashi; Shigemasa Takai

doi:10.1049/cth2.12548

IET Control Theory & Applications (Jan 2024)

Constrained distributed online convex optimization with bandit feedback for unbalanced digraphs

Keishin Tada,
Naoki Hayashi,
Shigemasa Takai

Affiliations

Keishin Tada: Graduate School of Engineering Osaka University Suita Osaka Japan
Naoki Hayashi: Graduate School of Engineering Science Osaka University Toyonaka Osaka Japan
Shigemasa Takai: Graduate School of Engineering Osaka University Suita Osaka Japan

DOI: https://doi.org/10.1049/cth2.12548
Journal volume & issue: Vol. 18, no. 2
pp. 184 – 200

Abstract

Read online

Abstract In this study, a distributed primal‐dual bandit feedback method for online convex optimization with time‐varying coupled inequality constraints on unbalanced directed graphs is proposed. A multiagent network is considered in which agents exchange the estimations of the dual optimizer and the scaling variable with their neighbors. The scaling variable is used to resolve the bias of the estimations caused by a directed communication network. Each agent does not have prior knowledge of the loss function, and its value at a queried point is sequentially disclosed to each agent. Each agent performs a projected subgradient‐based primal‐dual algorithm to estimate the optimal solution. It is confirmed that both the expected dynamic regret of the loss function and the cumulative error of the constraint violation achieve sublinearity using the proposed online algorithm with the two‐point bandit feedback.

Published in IET Control Theory & Applications

ISSN: 1751-8644 (Print); 1751-8652 (Online)
Publisher: Wiley
Country of publisher: United Kingdom
LCC subjects: Technology: Mechanical engineering and machinery: Control engineering systems. Automatic machinery (General)
Website: https://ietresearch.onlinelibrary.wiley.com/journal/17518652

About the journal

Abstract

Keywords