The Empirical Comparison of Machine Learning Algorithm for the Class Imbalanced Problem in Conformational Epitope Prediction

Binti Solihah; Azhari Azhari; Aina Musdholifah

doi:10.30595/juita.v9i1.9969

Jurnal Informatika (May 2021)

The Empirical Comparison of Machine Learning Algorithm for the Class Imbalanced Problem in Conformational Epitope Prediction

Binti Solihah,
Azhari Azhari,
Aina Musdholifah

Affiliations

Binti Solihah: Jurusan Teknik Informatika, FTI, Universitas Trisakti
Azhari Azhari
Aina Musdholifah

DOI: https://doi.org/10.30595/juita.v9i1.9969
Journal volume & issue: Vol. 9, no. 1
pp. 131 – 138

Abstract

Read online

A conformational epitope is a part of a protein-based vaccine. It is challenging to identify using an experiment. A computational model is developed to support identification. However, the imbalance class is one of the constraints to achieving optimal performance on the conformational epitope B cell prediction. In this paper, we compare several conformational epitope B cell prediction models from non-ensemble and ensemble approaches. A sampling method from Random undersampling, SMOTE, and cluster-based undersampling is combined with a decision tree or SVM to build a non-ensemble model. A random forest model and several variants of the bagging method is used to construct the ensemble model. A 10-fold cross-validation method is used to validate the model. The experiment results show that the combination of the cluster-based under-sampling and decision tree outperformed the other sampling method when combined with the non-ensemble and the ensemble method. This study provides a baseline to improve existing models for dealing with the class imbalance in the conformational epitope prediction.

sampling-based method, class imbalance, conformational epitope, b-cell, machine learning-based

Published in Jurnal Informatika

ISSN: 2086-9398 (Print); 2579-8901 (Online)
Publisher: Universitas Muhammadiyah Purwokerto
Country of publisher: Indonesia
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: http://jurnalnasional.ump.ac.id/index.php/JUITA/

About the journal

Abstract

Keywords