BayesRandomForest: An R implementation of Bayesian Random Forest for Regression Analysis of High-dimensional Data

Random Forest (RF) is a popular method for regression analysis of low or high-dimensional data. RF is often used with the later because it relaxes dimensionality assumption. RF major weakness lies in the fact that it is not governed by a statistical model, hence probabilistic interpretation of its p...

Full description

Bibliographic Details
Main Authors: Oyebayo Ridwan Olaniran, Mohd Asrul Affendi Bin Abdullah
Format: Article
Language:English
Published: Romanian National Institute of Statistics 2018-03-01
Series:Revista Română de Statistică
Subjects:
R
Online Access:http://www.revistadestatistica.ro/wp-content/uploads/2018/03/RRS_1_2018_A07.pdf
Description
Summary:Random Forest (RF) is a popular method for regression analysis of low or high-dimensional data. RF is often used with the later because it relaxes dimensionality assumption. RF major weakness lies in the fact that it is not governed by a statistical model, hence probabilistic interpretation of its prediction is not possible. RF major strengths are distribution free property and wide applicability to most real life problems. Bayesian Additive Regression Trees (BART) implemented in R via package BayesTree or bartMachine offers a bayesian interpretation to random forest but it suffers from high computational time as well as low efficiency when compared to RF in some specific situation. In this paper, we propose a new probabilistic interpretation to random forest called Bayesian Random Forest (BRF) for regression analysis of high-dimensional data. In addition, we present BRF implementation in R called BayesRandomForest. We also demonstrate the applicability of BRF using simulated dataset of varying dimensions. Results from the simulation experiment shows that BRF has improved efficiency over its competitors.
ISSN:1018-046X
1844-7694