BayesRandomForest: An R implementation of Bayesian Random Forest for Regression Analysis of High-dimensional Data

Random Forest (RF) is a popular method for regression analysis of low or high-dimensional data. RF is often used with the later because it relaxes dimensionality assumption. RF major weakness lies in the fact that it is not governed by a statistical model, hence probabilistic interpretation of its p...

Full description

Bibliographic Details
Main Authors: Oyebayo Ridwan Olaniran, Mohd Asrul Affendi Bin Abdullah
Format: Article
Language:English
Published: Romanian National Institute of Statistics 2018-03-01
Series:Revista Română de Statistică
Subjects:
R
Online Access:http://www.revistadestatistica.ro/wp-content/uploads/2018/03/RRS_1_2018_A07.pdf
id doaj-ef0e806a29c548bebbd7fa1afe02475f
record_format Article
spelling doaj-ef0e806a29c548bebbd7fa1afe02475f2020-11-24T21:52:09ZengRomanian National Institute of StatisticsRevista Română de Statistică1018-046X1844-76942018-03-0166195102BayesRandomForest: An R implementation of Bayesian Random Forest for Regression Analysis of High-dimensional DataOyebayo Ridwan Olaniran0Mohd Asrul Affendi Bin Abdullah1Universiti Tun Hussein Onn MalaysiaUniversiti Tun Hussein Onn MalaysiaRandom Forest (RF) is a popular method for regression analysis of low or high-dimensional data. RF is often used with the later because it relaxes dimensionality assumption. RF major weakness lies in the fact that it is not governed by a statistical model, hence probabilistic interpretation of its prediction is not possible. RF major strengths are distribution free property and wide applicability to most real life problems. Bayesian Additive Regression Trees (BART) implemented in R via package BayesTree or bartMachine offers a bayesian interpretation to random forest but it suffers from high computational time as well as low efficiency when compared to RF in some specific situation. In this paper, we propose a new probabilistic interpretation to random forest called Bayesian Random Forest (BRF) for regression analysis of high-dimensional data. In addition, we present BRF implementation in R called BayesRandomForest. We also demonstrate the applicability of BRF using simulated dataset of varying dimensions. Results from the simulation experiment shows that BRF has improved efficiency over its competitors.http://www.revistadestatistica.ro/wp-content/uploads/2018/03/RRS_1_2018_A07.pdfRandom ForestBayesian Additive Regression TreesHigh-dimensionalR
collection DOAJ
language English
format Article
sources DOAJ
author Oyebayo Ridwan Olaniran
Mohd Asrul Affendi Bin Abdullah
spellingShingle Oyebayo Ridwan Olaniran
Mohd Asrul Affendi Bin Abdullah
BayesRandomForest: An R implementation of Bayesian Random Forest for Regression Analysis of High-dimensional Data
Revista Română de Statistică
Random Forest
Bayesian Additive Regression Trees
High-dimensional
R
author_facet Oyebayo Ridwan Olaniran
Mohd Asrul Affendi Bin Abdullah
author_sort Oyebayo Ridwan Olaniran
title BayesRandomForest: An R implementation of Bayesian Random Forest for Regression Analysis of High-dimensional Data
title_short BayesRandomForest: An R implementation of Bayesian Random Forest for Regression Analysis of High-dimensional Data
title_full BayesRandomForest: An R implementation of Bayesian Random Forest for Regression Analysis of High-dimensional Data
title_fullStr BayesRandomForest: An R implementation of Bayesian Random Forest for Regression Analysis of High-dimensional Data
title_full_unstemmed BayesRandomForest: An R implementation of Bayesian Random Forest for Regression Analysis of High-dimensional Data
title_sort bayesrandomforest: an r implementation of bayesian random forest for regression analysis of high-dimensional data
publisher Romanian National Institute of Statistics
series Revista Română de Statistică
issn 1018-046X
1844-7694
publishDate 2018-03-01
description Random Forest (RF) is a popular method for regression analysis of low or high-dimensional data. RF is often used with the later because it relaxes dimensionality assumption. RF major weakness lies in the fact that it is not governed by a statistical model, hence probabilistic interpretation of its prediction is not possible. RF major strengths are distribution free property and wide applicability to most real life problems. Bayesian Additive Regression Trees (BART) implemented in R via package BayesTree or bartMachine offers a bayesian interpretation to random forest but it suffers from high computational time as well as low efficiency when compared to RF in some specific situation. In this paper, we propose a new probabilistic interpretation to random forest called Bayesian Random Forest (BRF) for regression analysis of high-dimensional data. In addition, we present BRF implementation in R called BayesRandomForest. We also demonstrate the applicability of BRF using simulated dataset of varying dimensions. Results from the simulation experiment shows that BRF has improved efficiency over its competitors.
topic Random Forest
Bayesian Additive Regression Trees
High-dimensional
R
url http://www.revistadestatistica.ro/wp-content/uploads/2018/03/RRS_1_2018_A07.pdf
work_keys_str_mv AT oyebayoridwanolaniran bayesrandomforestanrimplementationofbayesianrandomforestforregressionanalysisofhighdimensionaldata
AT mohdasrulaffendibinabdullah bayesrandomforestanrimplementationofbayesianrandomforestforregressionanalysisofhighdimensionaldata
_version_ 1725876587677614080