Employing Hybrid AI Systems to Trace and Document Bias in ML Pipelines

Mayra Russo; Yasharajsinh Chudasama; Disha Purohit; Sammy Sawischa; Maria-Esther Vidal

doi:10.1109/ACCESS.2024.3427388

IEEE Access (Jan 2024)

Employing Hybrid AI Systems to Trace and Document Bias in ML Pipelines

Mayra Russo,
Yasharajsinh Chudasama,
Disha Purohit,
Sammy Sawischa,
Maria-Esther Vidal

Affiliations

Mayra Russo: ORCiD; L3S Research Center, Hannover, Germany
Yasharajsinh Chudasama: ORCiD; Institute of Data Science-Scientific Data Management, Leibniz University Hannover, Hannover, Germany
Disha Purohit: ORCiD; Institute of Data Science-Scientific Data Management, Leibniz University Hannover, Hannover, Germany
Sammy Sawischa: Institute of Data Science-Scientific Data Management, Leibniz University Hannover, Hannover, Germany
Maria-Esther Vidal: ORCiD; L3S Research Center, Hannover, Germany

DOI: https://doi.org/10.1109/ACCESS.2024.3427388
Journal volume & issue: Vol. 12
pp. 96821 – 96847

Abstract

Read online

Artificial Intelligence (AI) systems can introduce biases that lead to unreliable outcomes and, in the worst-case scenarios, perpetuate systemic and discriminatory results when deployed in the real world. While significant efforts have been made to create bias detection methods, developing reliable and comprehensive documentation artifacts also makes for valuable resources that address bias and aid in minimizing the harms associated with AI systems. Based on compositional design patterns, this paper introduces a documentation approach using a hybrid AI system to prompt the identification and traceability of bias in datasets and predictive AI models. To demonstrate the effectiveness of our approach, we instantiate our pattern in two implementations of a hybrid AI system. One follows an integrated approach and performs fine-grained tracing and documentation of the AI model. In contrast, the other hybrid system follows a principled approach and enables the documentation and comparison of bias in the input data and the predictions generated by the model. Through a use-case based on Fake News detection and an empirical evaluation, we show how biases detected during data ingestion steps (e.g., label, over-representation, activity bias) affect the training and predictions of the classification models. Concretely, we report a stark skewness in the distribution of input variables towards the Fake News label, we uncover how a predictive variable leads to more constraints in the learning process, and highlight open challenges of training models with unbalanced datasets. A video summarizing this work is available online (https://youtu.be/v2GfIQPAy_4?si=BXtWOf97cLiZavyu),and the implementation is publicly available on GitHub (https://github.com/SDM-TIB/DocBiasKG).

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords