An Experimental Comparison of Feature-Selection and Classification Methods for Microarray Datasets

Nicole  Dalia Cilia; Claudio De Stefano; Francesco Fontanella; Stefano Raimondo; Alessandra Scotto di Freca

doi:10.3390/info10030109

Information (Mar 2019)

An Experimental Comparison of Feature-Selection and Classification Methods for Microarray Datasets

Nicole Dalia Cilia,
Claudio De Stefano,
Francesco Fontanella,
Stefano Raimondo,
Alessandra Scotto di Freca

Affiliations

Nicole Dalia Cilia: Department of Electrical and Information Engineering “Maurizio Scarano”, University of Cassino and Southern Lazio, 03043 Cassino (FR), Italy
Claudio De Stefano: Department of Electrical and Information Engineering “Maurizio Scarano”, University of Cassino and Southern Lazio, 03043 Cassino (FR), Italy
Francesco Fontanella: Department of Electrical and Information Engineering “Maurizio Scarano”, University of Cassino and Southern Lazio, 03043 Cassino (FR), Italy
Stefano Raimondo: Department of Electrical and Information Engineering “Maurizio Scarano”, University of Cassino and Southern Lazio, 03043 Cassino (FR), Italy
Alessandra Scotto di Freca: Department of Electrical and Information Engineering “Maurizio Scarano”, University of Cassino and Southern Lazio, 03043 Cassino (FR), Italy

DOI: https://doi.org/10.3390/info10030109
Journal volume & issue: Vol. 10, no. 3
p. 109

Abstract

Read online

In the last decade, there has been a growing scientific interest in the analysis of DNA microarray datasets, which have been widely used in basic and translational cancer research. The application fields include both the identification of oncological subjects, separating them from the healthy ones, and the classification of different types of cancer. Since DNA microarray experiments typically generate a very large number of features for a limited number of patients, the classification task is very complex and typically requires the application of a feature-selection process to reduce the complexity of the feature space and to identify a subset of distinctive features. In this framework, there are no standard state-of-the-art results generally accepted by the scientific community and, therefore, it is difficult to decide which approach to use for obtaining satisfactory results in the general case. Based on these considerations, the aim of the present work is to provide a large experimental comparison for evaluating the effect of the feature-selection process applied to different classification schemes. For comparison purposes, we considered both ranking-based feature-selection techniques and state-of-the-art feature-selection methods. The experiments provide a broad overview of the results obtainable on standard microarray datasets with different characteristics in terms of both the number of features and the number of patients.

Published in Information

ISSN: 2078-2489 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Technology (General): Industrial engineering. Management engineering: Information technology
Website: http://www.mdpi.com/journal/information/

About the journal

Abstract

Keywords