On the Role of Training Data for SVM-Based Microwave Brain Stroke Detection and Classification
Tomas Pokorny,
Jan Vrba,
Ondrej Fiser,
David Vrba,
Tomas Drizdal,
Marek Novak,
Luca Tosi,
Alessandro Polo,
Marco Salucci
Affiliations
Tomas Pokorny
Department of Biomedical Technology, Faculty of Biomedical Engineering, Czech Technical University in Prague, 166 00 Prague, Czech Republic
Jan Vrba
Department of Biomedical Technology, Faculty of Biomedical Engineering, Czech Technical University in Prague, 166 00 Prague, Czech Republic
Ondrej Fiser
Department of Biomedical Technology, Faculty of Biomedical Engineering, Czech Technical University in Prague, 166 00 Prague, Czech Republic
David Vrba
Department of Biomedical Technology, Faculty of Biomedical Engineering, Czech Technical University in Prague, 166 00 Prague, Czech Republic
Tomas Drizdal
Department of Biomedical Technology, Faculty of Biomedical Engineering, Czech Technical University in Prague, 166 00 Prague, Czech Republic
Marek Novak
Department of Biomedical Technology, Faculty of Biomedical Engineering, Czech Technical University in Prague, 166 00 Prague, Czech Republic
Luca Tosi
ELEDIA Research Center (ELEDIA@UniTN—University of Trento), DICAM—Department of Civil, Environmental, and Mechanical Engineering, Via Mesiano 77, 38123 Trento, Italy
Alessandro Polo
ELEDIA Research Center (ELEDIA@UniTN—University of Trento), DICAM—Department of Civil, Environmental, and Mechanical Engineering, Via Mesiano 77, 38123 Trento, Italy
Marco Salucci
ELEDIA Research Center (ELEDIA@UniTN—University of Trento), DICAM—Department of Civil, Environmental, and Mechanical Engineering, Via Mesiano 77, 38123 Trento, Italy
The aim of this work was to test microwave brain stroke detection and classification using support vector machines (SVMs). We tested how the nature and variability of training data and system parameters impact the achieved classification accuracy. Using experimentally verified numerical models, a large database of synthetic training and test data was created. The models consist of an antenna array surrounding reconfigurable geometrically and dielectrically realistic human head phantoms with virtually inserted strokes of arbitrary size, and different dielectric parameters in different positions. The generated synthetic data sets were used to test four different hypotheses, regarding the appropriate parameters of the training dataset, the appropriate frequency range and the number of frequency points, as well as the level of subject variability to reach the highest SVM classification accuracy. The results indicate that the SVM algorithm is able to detect the presence of the stroke and classify it (i.e., ischemic or hemorrhagic) even when trained with single-frequency data. Moreover, it is shown that data of subjects with smaller strokes appear to be the most suitable for training accurate SVM predictors with high generalization capabilities. Finally, the datasets created for this study are made available to the community for testing and developing their own algorithms.