Counterfactual Mix-Up for Visual Question Answering

Jae Won Cho; Dong-Jin Kim; Yunjae Jung; In So Kweon

doi:10.1109/ACCESS.2023.3303891

IEEE Access (Jan 2023)

Counterfactual Mix-Up for Visual Question Answering

Jae Won Cho,
Dong-Jin Kim,
Yunjae Jung,
In So Kweon

Affiliations

Jae Won Cho: ORCiD; Department of Electrical Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon, South Korea
Dong-Jin Kim: ORCiD; Department of Data Science, Hanyang University, Seoul, South Korea
Yunjae Jung: Department of Electrical Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon, South Korea
In So Kweon: ORCiD; Department of Electrical Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon, South Korea

DOI: https://doi.org/10.1109/ACCESS.2023.3303891
Journal volume & issue: Vol. 11
pp. 95201 – 95212

Abstract

Read online

Counterfactuals have been shown to be a powerful method in Visual Question Answering in the alleviation of Visual Question Answering’s unimodal bias. However, existing counterfactual methods tend to generate samples that are not diverse or require auxiliary models to synthesize additional data. In this regard, we propose a more diverse and simple counterfactual sample synthesis method called Counterfactual Mix-Up (CoMiU), which generates counterfactual image features and questions through batch-wise swapping in local object- and word-level. This method efficiently facilitates the generation of more abundant and diverse counterfactual samples, which help improve the robustness of Visual Question Answering models. Moreover, with the creation of diverse counterfactual samples, we introduce two more robust and stable contrastive loss functions, namely Batch-Contrastive loss and Answer-Contrastive loss. We test our method on various challenging Visual Question Answering robustness testing setups to show the advantages of the proposed method compared with the current state-of-the-art methods.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords