Journal of Big Data (Sep 2024)

A multi-dimensional hierarchical evaluation system for data quality in trustworthy AI

  • Hui-Juan Zhang,
  • Can-Can Chen,
  • Peng Ran,
  • Kai Yang,
  • Quan-Chao Liu,
  • Zhe-Yuan Sun,
  • Jia Chen,
  • Jia-Ke Chen

DOI
https://doi.org/10.1186/s40537-024-00999-2
Journal volume & issue
Vol. 11, no. 1
pp. 1 – 26

Abstract

Read online

Abstract Recently, the widespread adoption of artificial intelligence (AI) has given rise to a significant trust crisis, stemming from the persistent emergence of issues in practical applications. As a crucial component of AI, data has a profound impact on the trustworthiness of AI. Nevertheless, researchers have struggled with the challenge of rationally assessing data quality, primarily due to the scarcity of versatile and effective evaluation methods. To address this trouble, a multi-dimensional hierarchical evaluation system (MDHES) is proposed to estimate the data quality. Initially, multiple key dimensions are devised to evaluate specific data conditions separately by the calculation of individual scores. Then, the strengths and weaknesses among various dimensions can be provided a clearer understanding. Furthermore, a comprehensive evaluation method, incorporating a fuzzy evaluation model, is developed to synthetically evaluate the data quality. Then, this evaluation method can achieve a dynamic balance, and meanwhile achieve a harmonious integration of subjectivity and objectivity criteria to ensure a more precise assessment result. Finally, rigorous experiment verification and comparison in both benchmark problems and real-world applications demonstrate the effectiveness of the proposed MDHES, which can accurately assess data quality to provide a strong data support for the development of trustworthy AI.

Keywords