Training, testing and benchmarking medical AI models using Clinical AIBench

Yunyou Huang; Xiuxia Miao; Ruchang Zhang; Li Ma; Wenjing Liu; Fan Zhang; Xianglong Guan; Xiaoshuang Liang; Xiangjiang Lu; Suqing Tang; Zhifei Zhang

BenchCouncil Transactions on Benchmarks, Standards and Evaluations (Mar 2022)

Training, testing and benchmarking medical AI models using Clinical AIBench

Yunyou Huang,
Xiuxia Miao,
Ruchang Zhang,
Li Ma,
Wenjing Liu,
Fan Zhang,
Xianglong Guan,
Xiaoshuang Liang,
Xiangjiang Lu,
Suqing Tang,
Zhifei Zhang

Affiliations

Yunyou Huang: Guangxi Key Lab of Multi-Source Information Mining & Security, School of Computer Science and Engineering & School of Software, Guangxi Normal University, Guilin, China; International Open Benchmark Council, Beijing, China
Xiuxia Miao: Guangxi Key Lab of Multi-Source Information Mining & Security, School of Computer Science and Engineering & School of Software, Guangxi Normal University, Guilin, China
Ruchang Zhang: Guangxi Key Lab of Multi-Source Information Mining & Security, School of Computer Science and Engineering & School of Software, Guangxi Normal University, Guilin, China
Li Ma: Guilin Medical University, Guilin, China
Wenjing Liu: Guangxi Key Lab of Multi-Source Information Mining & Security, School of Computer Science and Engineering & School of Software, Guangxi Normal University, Guilin, China
Fan Zhang: State Key Laboratory of Computer Architecture, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China
Xianglong Guan: Guangxi Key Lab of Multi-Source Information Mining & Security, School of Computer Science and Engineering & School of Software, Guangxi Normal University, Guilin, China
Xiaoshuang Liang: Guangxi Key Lab of Multi-Source Information Mining & Security, School of Computer Science and Engineering & School of Software, Guangxi Normal University, Guilin, China
Xiangjiang Lu: Guangxi Key Lab of Multi-Source Information Mining & Security, School of Computer Science and Engineering & School of Software, Guangxi Normal University, Guilin, China
Suqing Tang: Faculty of Education, Guangxi Normal University, Guilin, China
Zhifei Zhang: Department of Physiology and Pathophysiology, Capital Medical University, Beijing, China; Corresponding author.

Journal volume & issue: Vol. 2, no. 1
p. 100037

Abstract

Read online

AI technology has been used in many clinical research fields, but most AI technologies are difficult to land in real-world clinical settings. In most current clinical AI research settings, the diagnosis task is to identify different types of diseases among the given ones. However, the diagnosis in real-world settings needs dynamically developing inspection strategies based on the existing resources of medical institutions and identifying different kinds of diseases out of many possibilities. To promote the development of different clinical AI technologies and the implementation of clinical applications, we propose a benchmark named Clinical AIBench for developing, verifying, and evaluating clinical AI technologies in real-world clinical settings. Specifically, Clinical AIBench can be used for: (1) Model training and testing: Researchers can use the data to train and test their models. (2)Model evaluation: Researchers can use Clinical AIBench to objectively, fairly, and comparably evaluate various models of different researchers. (3) Clinical value evaluation: Researchers can use the clinical indicators provided by Clinical AIBench to evaluate the clinical value of models, which will be applied in real-world clinical settings. For convenience, Clinical AIBench provides three different levels of clinical settings: restricted clinical setting, which is named closed clinical setting, data island clinical setting, and real-world clinical setting, which is called open clinical setting. In addition, Clinical AIBench covers three diseases: Alzheimer’s disease, COVID-19, and dental. Clinical AIBench provides python APIs to researchers. The data and source code are publicly available from the project website https://www.benchcouncil.org/clinical_aibench/.

Published in BenchCouncil Transactions on Benchmarks, Standards and Evaluations

ISSN: 2772-4859 (Online)
Publisher: KeAi Communications Co. Ltd.
Country of publisher: China
LCC subjects: Science; Technology: Engineering (General). Civil engineering (General)
Website: https://www.keaipublishing.com/en/journals/benchcouncil-transactions-on-benchmarks-standards-and-evaluations/

About the journal

Abstract

Keywords