Methods of Plasticity Research, Department of Psychology, University of Zurich, Zurich, Switzerland; University Research Priority Program (URPP) Dynamics of Healthy Aging, Zurich, Switzerland; Neuroscience Center Zurich (ZNZ), University of Zurich and ETH Zurich, Zurich, Switzerland
Maurice Weber
Department of Computer Science, ETH Zurich, Zurich, Switzerland
Methods of Plasticity Research, Department of Psychology, University of Zurich, Zurich, Switzerland; University Research Priority Program (URPP) Dynamics of Healthy Aging, Zurich, Switzerland; Neuroscience Center Zurich (ZNZ), University of Zurich and ETH Zurich, Zurich, Switzerland
Dawid Strzelczyk
Methods of Plasticity Research, Department of Psychology, University of Zurich, Zurich, Switzerland; University Research Priority Program (URPP) Dynamics of Healthy Aging, Zurich, Switzerland; Neuroscience Center Zurich (ZNZ), University of Zurich and ETH Zurich, Zurich, Switzerland
Lukas Wolf
Methods of Plasticity Research, Department of Psychology, University of Zurich, Zurich, Switzerland
Andreas Pedroni
Methods of Plasticity Research, Department of Psychology, University of Zurich, Zurich, Switzerland; University Research Priority Program (URPP) Dynamics of Healthy Aging, Zurich, Switzerland; Neuroscience Center Zurich (ZNZ), University of Zurich and ETH Zurich, Zurich, Switzerland
Jonathan Heitz
Methods of Plasticity Research, Department of Psychology, University of Zurich, Zurich, Switzerland; Department of Computer Science, ETH Zurich, Zurich, Switzerland
Stephan Müller
Department of Computer Science, ETH Zurich, Zurich, Switzerland
Christoph Schultheiss
Department of Computer Science, ETH Zurich, Zurich, Switzerland
Marius Troendle
Methods of Plasticity Research, Department of Psychology, University of Zurich, Zurich, Switzerland; University Research Priority Program (URPP) Dynamics of Healthy Aging, Zurich, Switzerland; Neuroscience Center Zurich (ZNZ), University of Zurich and ETH Zurich, Zurich, Switzerland
Juan Carlos Arango Lasprilla
Virginia Commonwealth University, Richmond, United States
Department of Health Science, Public University of Navarre, Pamplona, Spain; Instituto de Investigación Sanitaria de Navarra (IdiSNA), Pamplona, Spain
Federica Scarpina
'Rita Levi Montalcini' Department of Neurosciences, University of Turin, Turin, Italy; IRCCS Istituto Auxologico Italiano, UO di Neurologia e Neuroriabilitazione, Ospedale San Giuseppe, Piancavallo, Italy
Qianhua Zhao
Huashan Hospital, Shanghai, China
Rico Leuthold
Smartcode, Zurich, Switzerland
Flavia Wehrle
University Children's Hospital Zurich, Child Development Center, Zurich, Switzerland
Oskar Jenni
University Children's Hospital Zurich, Child Development Center, Zurich, Switzerland
Memory deficits are a hallmark of many different neurological and psychiatric conditions. The Rey–Osterrieth complex figure (ROCF) is the state-of-the-art assessment tool for neuropsychologists across the globe to assess the degree of non-verbal visual memory deterioration. To obtain a score, a trained clinician inspects a patient’s ROCF drawing and quantifies deviations from the original figure. This manual procedure is time-consuming, slow and scores vary depending on the clinician’s experience, motivation, and tiredness. Here, we leverage novel deep learning architectures to automatize the rating of memory deficits. For this, we collected more than 20k hand-drawn ROCF drawings from patients with various neurological and psychiatric disorders as well as healthy participants. Unbiased ground truth ROCF scores were obtained from crowdsourced human intelligence. This dataset was used to train and evaluate a multihead convolutional neural network. The model performs highly unbiased as it yielded predictions very close to the ground truth and the error was similarly distributed around zero. The neural network outperforms both online raters and clinicians. The scoring system can reliably identify and accurately score individual figure elements in previously unseen ROCF drawings, which facilitates explainability of the AI-scoring system. To ensure generalizability and clinical utility, the model performance was successfully replicated in a large independent prospective validation study that was pre-registered prior to data collection. Our AI-powered scoring system provides healthcare institutions worldwide with a digital tool to assess objectively, reliably, and time-efficiently the performance in the ROCF test from hand-drawn images.