Random Forests for Regression as a Weighted Sum of <inline-formula> <tex-math notation="LaTeX">${k}$ </tex-math></inline-formula>-Potential Nearest Neighbors

Pablo Fernandez-Gonzalez; Concha Bielza; Pedro Larranaga

doi:10.1109/access.2019.2900755

IEEE Access (Jan 2019)

Random Forests for Regression as a Weighted Sum of <inline-formula> <tex-math notation="LaTeX">${k}$ </tex-math></inline-formula>-Potential Nearest Neighbors

Pablo Fernandez-Gonzalez,
Concha Bielza,
Pedro Larranaga

Affiliations

Pablo Fernandez-Gonzalez: ORCiD; Technical University of Madrid, Madrid, Spain
Concha Bielza: Technical University of Madrid, Madrid, Spain
Pedro Larranaga: Technical University of Madrid, Madrid, Spain

DOI: https://doi.org/10.1109/access.2019.2900755
Journal volume & issue: Vol. 7
pp. 25660 – 25672

Abstract

Read online

In this paper, we tackle the problem of random forests for regression expressed as weighted sums of datapoints. We study the theoretical behavior of k-potential nearest neighbors (k-PNNs) under bagging and obtain an upper bound on the weights of a datapoint for random forests with any type of splitting criterion, provided that we use unpruned trees that stop growing only when there are k or less datapoints at their leaves. Moreover, we use the previous bound together with the concept of b-terms (i.e., bootstrap terms) introduced in this paper, to derive the explicit expression of weights for datapoints in a random (k-PNNs) selection setting, a datapoint selection strategy that we also introduce and to build a framework to derive other bagged estimators using a similar procedure. Finally, we derive from our framework the explicit expression of weights of a regression estimate equivalent to a random forest regression estimate with the random splitting criterion and demonstrate its equivalence both theoretically and practically.

Published in IEEE Access

ISSN: 2169-3536 (Online)
Publisher: IEEE
Country of publisher: United States
LCC subjects: Technology: Electrical engineering. Electronics. Nuclear engineering
Website: https://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639

About the journal

Abstract

Keywords