A Systematic Literature Review on AI Safety: Identifying Trends, Challenges, and Future Directions

Wissam Salhab; Darine Ameyed; Fehmi Jaafar; Hamid Mcheick

doi:10.1109/ACCESS.2024.3440647

IEEE Access (Jan 2024)

A Systematic Literature Review on AI Safety: Identifying Trends, Challenges, and Future Directions

Wissam Salhab,
Darine Ameyed,
Fehmi Jaafar,
Hamid Mcheick

Affiliations

Wissam Salhab: ORCiD; Department of Computer Science and Mathematics, University of Québec at Chicoutimi, Chicoutimi, QC, Canada
Darine Ameyed: ORCiD; Department of Computer Science and Mathematics, University of Québec at Chicoutimi, Chicoutimi, QC, Canada
Fehmi Jaafar: Department of Computer Science and Mathematics, University of Québec at Chicoutimi, Chicoutimi, QC, Canada
Hamid Mcheick: Department of Computer Science and Mathematics, University of Québec at Chicoutimi, Chicoutimi, QC, Canada

DOI: https://doi.org/10.1109/ACCESS.2024.3440647
Journal volume & issue: Vol. 12
pp. 131762 – 131784

Abstract

Read online

Artificial intelligence (AI) is revolutionizing many aspects of our lives, except it raises fundamental safety and ethical issues. In this survey paper, we review the current state of research on safe and trustworthy AI. This work provides a structured and systematic overview of AI safety. In which, we emphasize the significance of designing AI systems with safety focus, encompassing elements from data management, model development, and deployment. We underscore the need for AI systems to align with human values and operate within mounted ethical frameworks. In addition, we notice the need for a complete safety framework that courses the development and implementation of AI systems, ensuring they do not inadvertently cause damage to humans. Our results show that AI safety is associated with model learning techniques, verification and validation methods, failure modes, and managing AI autonomy. As discussed in the literature, the main concerns include explainability, interpretability, robustness, reliability, fairness, bias, and adversarial attacks.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords