IEEE Access (Jan 2023)

Bias Detection for Customer Interaction Data: A Survey on Datasets, Methods, and Tools

  • Andy Donald,
  • Apostolos Galanopoulos,
  • Edward Curry,
  • Emir Munoz,
  • Ihsan Ullah,
  • M. A. Waskow,
  • Maciej Dabrowski,
  • Manan Kalra

DOI
https://doi.org/10.1109/ACCESS.2023.3276757
Journal volume & issue
Vol. 11
pp. 53703 – 53715

Abstract

Read online

With the increase in usage of machine learning models within many different aspects of customer interactions, it has become very clear that bias detection within associated customer interaction datasets has led to a critical focus on issues such as the identification of bias prior to model building, lack of understanding and transparency within models, and ultimately the prevention of biased predictions or classifications. This has never been more important since the introduction of the EU General Data Protection Regulation (GDPR) and the associated rule of “right of explanation”. In this paper, we survey the state of the art for bias detection, avoidance and mitigation within datasets, and the associated methods and tools available. Our purpose is to establish an understanding of how established customer interaction-based use cases can utilise these techniques. The focus is primarily on tackling the bias in unstructured text data as a pre-process prior to the machine learning model training phase. We hope that this research encourages the further establishment of responsible usage of customer interaction datasets to allow the prevention of bias being introduced into machine learning pipelines and to also allow greater awareness of the potential for further research in this area.

Keywords