Causal contextual bandits with one-shot data integration

Chandrasekar Subramanian; Chandrasekar Subramanian; Balaraman Ravindran; Balaraman Ravindran

doi:10.3389/frai.2024.1346700

Frontiers in Artificial Intelligence (Dec 2024)

Causal contextual bandits with one-shot data integration

Chandrasekar Subramanian,
Chandrasekar Subramanian,
Balaraman Ravindran,
Balaraman Ravindran

Affiliations

Chandrasekar Subramanian: Robert Bosch Center for Data Science and Artificial Intelligence, Indian Institute of Technology Madras, Chennai, India
Chandrasekar Subramanian: Department of Computer Science and Engineering, Indian Institute of Technology Madras, Chennai, India
Balaraman Ravindran: Robert Bosch Center for Data Science and Artificial Intelligence, Indian Institute of Technology Madras, Chennai, India
Balaraman Ravindran: Department of Computer Science and Engineering, Indian Institute of Technology Madras, Chennai, India

DOI: https://doi.org/10.3389/frai.2024.1346700
Journal volume & issue: Vol. 7

Abstract

Read online

We study a contextual bandit setting where the agent has access to causal side information, in addition to the ability to perform multiple targeted experiments corresponding to potentially different context-action pairs—simultaneously in one-shot within a budget. This new formalism provides a natural model for several real-world scenarios where parallel targeted experiments can be conducted and where some domain knowledge of causal relationships is available. We propose a new algorithm that utilizes a novel entropy-like measure that we introduce. We perform several experiments, both using purely synthetic data and using a real-world dataset. In addition, we study sensitivity of our algorithm's performance to various aspects of the problem setting. The results show that our algorithm performs better than baselines in all of the experiments. We also show that the algorithm is sound; that is, as budget increases, the learned policy eventually converges to an optimal policy. Further, we theoretically bound our algorithm's regret under additional assumptions. Finally, we provide ways to achieve two popular notions of fairness, namely counterfactual fairness and demographic parity, with our algorithm.

Published in Frontiers in Artificial Intelligence

ISSN: 2624-8212 (Online)
Publisher: Frontiers Media S.A.
Country of publisher: Switzerland
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: https://www.frontiersin.org/journals/artificial-intelligence#

About the journal

Abstract

Keywords