Survey of AIGC Large Model Evaluation: Enabling Technologies, Vulnerabilities and Mitigation

XU Zhiwei, LI Hailong, LI Bo, LI Tao, WANG Jiatai, XIE Xueshuo, DONG Zehui

doi:10.3778/j.issn.1673-9418.2402023

Jisuanji kexue yu tansuo (Sep 2024)

Survey of AIGC Large Model Evaluation: Enabling Technologies, Vulnerabilities and Mitigation

XU Zhiwei, LI Hailong, LI Bo, LI Tao, WANG Jiatai, XIE Xueshuo, DONG Zehui

Affiliations

XU Zhiwei, LI Hailong, LI Bo, LI Tao, WANG Jiatai, XIE Xueshuo, DONG Zehui: 1. Haihe Laboratory of Information Application Innovation, Tianjin 300350, China 2. Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100080, China 3. College of Data Science and Application, Inner Mongolia University of Technology, Hohhot 010080, China 4. College of Computer Science, Nankai University, Tianjin 300350, China 5. OPPO Research Institute, Beijing 100026, China

DOI: https://doi.org/10.3778/j.issn.1673-9418.2402023
Journal volume & issue: Vol. 18, no. 9
pp. 2293 – 2325

Abstract

Read online

Artificial intelligence generated content (AIGC) models have attracted widespread attention and application worldwide due to their excellent content generation capabilities. However, the rapid development of AIGC large models also brings a series of hidden dangers, such as concerns about interpretability, fairness, security, and privacy preservation of model-generated content. In order to reduce the unknowable risks and their harms, it becomes more and more important to carry out a comprehensive measurement and evaluation of AIGC large models. Academics have initiated AIGC large model evaluation studies aiming to effectively address the related challenges and avoid potential risks. This paper summarizes and analyzes the AIGC large model evaluation studies. Firstly, an overview of the model evaluation process is provided, covering model evaluation pre-preparation and corresponding measurement indicators, and existing measurement benchmarks are systematically organized. Secondly, the representative applications of the AIGC large model in finance, politics and healthcare and their problems are discussed. Then, the measurement methods are studied in depth through different perspectives, such as interpretability, fairness, robustness, security and privacy, and the new issues that need to be paid attention to AIGC large model evaluation are deconstructed, and the ways to cope with the new challenges of large model evaluation are proposed. Finally, the future challenges of AIGC large model evaluation are discussed, and its future development direction is envisioned.

aigc large model; large model evaluation; interpretability; fairness; robustness; security and privacy protection

Published in Jisuanji kexue yu tansuo

ISSN: 1673-9418 (Print)
Publisher: Journal of Computer Engineering and Applications Beijing Co., Ltd., Science Press
Country of publisher: China
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: http://fcst.ceaj.org

About the journal

Abstract

Keywords