BayesRandomForest: An R implementation of Bayesian Random Forest for Regression Analysis of High-dimensional Data
Random Forest (RF) is a popular method for regression analysis of low or high-dimensional data. RF is often used with the later because it relaxes dimensionality assumption. RF major weakness lies in the fact that it is not governed by a statistical model, hence probabilistic interpretation of its p...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Romanian National Institute of Statistics
2018-03-01
|
Series: | Revista Română de Statistică |
Subjects: | |
Online Access: | http://www.revistadestatistica.ro/wp-content/uploads/2018/03/RRS_1_2018_A07.pdf |
Summary: | Random Forest (RF) is a popular method for regression analysis of low or high-dimensional data. RF is often used with the later because it relaxes dimensionality assumption. RF major weakness lies in the fact that it is not governed by a statistical model, hence probabilistic interpretation of its prediction is not possible. RF major strengths are distribution free property and wide applicability to most real life problems. Bayesian Additive Regression Trees (BART) implemented in R via package BayesTree or bartMachine offers a bayesian interpretation to random forest but it suffers from high computational time as well as low efficiency when compared to RF in some specific situation. In this paper, we propose a new probabilistic interpretation to random forest called Bayesian Random Forest (BRF) for regression analysis of high-dimensional data. In addition, we present BRF implementation in R called BayesRandomForest. We also demonstrate the applicability of BRF using simulated dataset of varying dimensions. Results from the simulation experiment shows that BRF has improved efficiency over its competitors. |
---|---|
ISSN: | 1018-046X 1844-7694 |