ANALISIS PERBANDINGAN KINERJA CART KONVENSIONAL, BAGGING DAN RANDOM FOREST PADA KLASIFIKASI OBJEK:  HASIL DARI DUA SIMULASI

Yogo Aryo Jatmiko; Septiadi Padmadisastra; Anna Chadidjah

doi:10.14710/medstat.12.1.1-12

Media Statistika (Jul 2019)

ANALISIS PERBANDINGAN KINERJA CART KONVENSIONAL, BAGGING DAN RANDOM FOREST PADA KLASIFIKASI OBJEK: HASIL DARI DUA SIMULASI

Yogo Aryo Jatmiko,
Septiadi Padmadisastra,
Anna Chadidjah

Affiliations

Yogo Aryo Jatmiko: Badan Pusat Statistik
Septiadi Padmadisastra: Universitas Padjajaran
Anna Chadidjah: Universitas Padjajaran

DOI: https://doi.org/10.14710/medstat.12.1.1-12
Journal volume & issue: Vol. 12, no. 1
pp. 1 – 12

Abstract

Read online

The conventional CART method is a nonparametric classification method built on categorical response data. Bagging is one of the popular ensemble methods whereas, Random Forests (RF) is one of the relatively new ensemble methods in the decision tree that is the development of the Bagging method. Unlike Bagging, Random Forest was developed with the idea of adding layers to the random resampling process in bagging. Therefore, not only randomly sampled sample data to form a classification tree, but also independent variables are randomly selected and newly selected as the best divider when determining the sorting of trees, which is expected to produce more accurate predictions. Based on the above, the authors are interested to study the three methods by comparing the accuracy of classification on binary and non-binary simulation data to understand the effect of the number of sample sizes, the correlation between independent variables, the presence or absence of certain distribution patterns to the accuracy generated classification method. Results of the research on simulation data show that the Random Forest ensemble method can improve the accuracy of classification.

Published in Media Statistika

ISSN: 1979-3693 (Print); 2477-0647 (Online)
Publisher: Universitas Diponegoro
Country of publisher: Indonesia
LCC subjects: Science: Mathematics: Probabilities. Mathematical statistics
Website: https://ejournal.undip.ac.id/index.php/media_statistika/index

About the journal