Journal of Biostatistics and Epidemiology (Oct 2020)

Review of Random Survival Forest method

  • Majid Rezaei,
  • Leili Tapak,
  • Masoomeh Alimohammadian,
  • Alireza Sadjadi,
  • Mehdi Yaseri

DOI
https://doi.org/10.18502/jbe.v6i1.4760
Journal volume & issue
Vol. 6, no. 1

Abstract

Read online

Background: Over the past years, there has been a great deal of interest in applying statistical machine learning methods to survival analysis. Ensemble-based methods, especially random survival forest, have been developed in various fields, especially medical sciences, due to their high accuracy and non-parametric nature and applicability in high-dimensional data sets. This paper aims to provide a methodological review and how to use random survival forests in the analysis of right-censored survival data. Method: We present a review article based on the latest research in the PubMed database on random survival forest model methodology. Results: This article begins with an introduction to tree-based methods, ensemble algorithms, and random forest (RF) method, followed by random survival forest framework, bootstrapped data and out-of-bag (OOB) ensemble estimators, review of performance evaluation indicators, how to select important variables, and other advanced topics of random survival forests for time-to-event data. Conclusion: When analyzing right-censored survival data with high-dimensional data, while the relationships between variables are complex and their interactions are taken into account, the nonparametric random survival forest (RSF) method determines important variables affecting survival times with high accuracy and speed and also does not need to test the restrictive assumptions.

Keywords