Bias Detection for Customer Interaction Data: A Survey on Datasets, Methods, and Tools

Andy Donald; Apostolos Galanopoulos; Edward Curry; Emir Munoz; Ihsan Ullah; M. A. Waskow; Maciej Dabrowski; Manan Kalra

doi:10.1109/access.2023.3276757

IEEE Access (Jan 2023)

Bias Detection for Customer Interaction Data: A Survey on Datasets, Methods, and Tools

Andy Donald,
Apostolos Galanopoulos,
Edward Curry,
Emir Munoz,
Ihsan Ullah,
M. A. Waskow,
Maciej Dabrowski,
Manan Kalra

Affiliations

Andy Donald: ORCiD; Insight SFI Research Center for Data Analytics, University of Galway, Lower Dangan, Galway, Ireland
Apostolos Galanopoulos: Genesys Cloud Services Inc., Galway, Ireland
Edward Curry: ORCiD; Insight SFI Research Center for Data Analytics, University of Galway, Lower Dangan, Galway, Ireland
Emir Munoz: ORCiD; Genesys Cloud Services Inc., Galway, Ireland
Ihsan Ullah: ORCiD; Insight SFI Research Center for Data Analytics, University of Galway, Lower Dangan, Galway, Ireland
M. A. Waskow: ORCiD; Insight SFI Research Center for Data Analytics, University of Galway, Lower Dangan, Galway, Ireland
Maciej Dabrowski: Genesys Cloud Services Inc., Galway, Ireland
Manan Kalra: Genesys Cloud Services Inc., Galway, Ireland

DOI: https://doi.org/10.1109/access.2023.3276757
Journal volume & issue: Vol. 11
pp. 53703 – 53715

Abstract

Read online

With the increase in usage of machine learning models within many different aspects of customer interactions, it has become very clear that bias detection within associated customer interaction datasets has led to a critical focus on issues such as the identification of bias prior to model building, lack of understanding and transparency within models, and ultimately the prevention of biased predictions or classifications. This has never been more important since the introduction of the EU General Data Protection Regulation (GDPR) and the associated rule of “right of explanation”. In this paper, we survey the state of the art for bias detection, avoidance and mitigation within datasets, and the associated methods and tools available. Our purpose is to establish an understanding of how established customer interaction-based use cases can utilise these techniques. The focus is primarily on tackling the bias in unstructured text data as a pre-process prior to the machine learning model training phase. We hope that this research encourages the further establishment of responsible usage of customer interaction datasets to allow the prevention of bias being introduced into machine learning pipelines and to also allow greater awareness of the potential for further research in this area.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords