BayesRandomForest: An R implementation of Bayesian Random Forest for Regression Analysis of High-dimensional Data
Random Forest (RF) is a popular method for regression analysis of low or high-dimensional data. RF is often used with the later because it relaxes dimensionality assumption. RF major weakness lies in the fact that it is not governed by a statistical model, hence probabilistic interpretation of its p...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Romanian National Institute of Statistics
2018-03-01
|
Series: | Revista Română de Statistică |
Subjects: | |
Online Access: | http://www.revistadestatistica.ro/wp-content/uploads/2018/03/RRS_1_2018_A07.pdf |
id |
doaj-ef0e806a29c548bebbd7fa1afe02475f |
---|---|
record_format |
Article |
spelling |
doaj-ef0e806a29c548bebbd7fa1afe02475f2020-11-24T21:52:09ZengRomanian National Institute of StatisticsRevista Română de Statistică1018-046X1844-76942018-03-0166195102BayesRandomForest: An R implementation of Bayesian Random Forest for Regression Analysis of High-dimensional DataOyebayo Ridwan Olaniran0Mohd Asrul Affendi Bin Abdullah1Universiti Tun Hussein Onn MalaysiaUniversiti Tun Hussein Onn MalaysiaRandom Forest (RF) is a popular method for regression analysis of low or high-dimensional data. RF is often used with the later because it relaxes dimensionality assumption. RF major weakness lies in the fact that it is not governed by a statistical model, hence probabilistic interpretation of its prediction is not possible. RF major strengths are distribution free property and wide applicability to most real life problems. Bayesian Additive Regression Trees (BART) implemented in R via package BayesTree or bartMachine offers a bayesian interpretation to random forest but it suffers from high computational time as well as low efficiency when compared to RF in some specific situation. In this paper, we propose a new probabilistic interpretation to random forest called Bayesian Random Forest (BRF) for regression analysis of high-dimensional data. In addition, we present BRF implementation in R called BayesRandomForest. We also demonstrate the applicability of BRF using simulated dataset of varying dimensions. Results from the simulation experiment shows that BRF has improved efficiency over its competitors.http://www.revistadestatistica.ro/wp-content/uploads/2018/03/RRS_1_2018_A07.pdfRandom ForestBayesian Additive Regression TreesHigh-dimensionalR |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Oyebayo Ridwan Olaniran Mohd Asrul Affendi Bin Abdullah |
spellingShingle |
Oyebayo Ridwan Olaniran Mohd Asrul Affendi Bin Abdullah BayesRandomForest: An R implementation of Bayesian Random Forest for Regression Analysis of High-dimensional Data Revista Română de Statistică Random Forest Bayesian Additive Regression Trees High-dimensional R |
author_facet |
Oyebayo Ridwan Olaniran Mohd Asrul Affendi Bin Abdullah |
author_sort |
Oyebayo Ridwan Olaniran |
title |
BayesRandomForest: An R implementation of Bayesian Random Forest for Regression Analysis of High-dimensional Data |
title_short |
BayesRandomForest: An R implementation of Bayesian Random Forest for Regression Analysis of High-dimensional Data |
title_full |
BayesRandomForest: An R implementation of Bayesian Random Forest for Regression Analysis of High-dimensional Data |
title_fullStr |
BayesRandomForest: An R implementation of Bayesian Random Forest for Regression Analysis of High-dimensional Data |
title_full_unstemmed |
BayesRandomForest: An R implementation of Bayesian Random Forest for Regression Analysis of High-dimensional Data |
title_sort |
bayesrandomforest: an r implementation of bayesian random forest for regression analysis of high-dimensional data |
publisher |
Romanian National Institute of Statistics |
series |
Revista Română de Statistică |
issn |
1018-046X 1844-7694 |
publishDate |
2018-03-01 |
description |
Random Forest (RF) is a popular method for regression analysis of low or high-dimensional data. RF is often used with the later because it relaxes dimensionality assumption. RF major weakness lies in the fact that it is not governed by a statistical model, hence probabilistic interpretation of its prediction is not possible. RF major strengths are distribution free property and wide applicability to most real life problems. Bayesian Additive Regression Trees (BART) implemented in R via package BayesTree or bartMachine offers a bayesian interpretation to random forest but it suffers from high computational time as well as low efficiency when compared to RF in some specific situation. In this paper, we propose a new probabilistic interpretation to random forest called Bayesian Random Forest (BRF) for regression analysis of high-dimensional data. In addition, we present BRF implementation in R called BayesRandomForest. We also demonstrate the applicability of BRF using simulated dataset of varying dimensions. Results from the simulation experiment shows that BRF has improved efficiency over its competitors. |
topic |
Random Forest Bayesian Additive Regression Trees High-dimensional R |
url |
http://www.revistadestatistica.ro/wp-content/uploads/2018/03/RRS_1_2018_A07.pdf |
work_keys_str_mv |
AT oyebayoridwanolaniran bayesrandomforestanrimplementationofbayesianrandomforestforregressionanalysisofhighdimensionaldata AT mohdasrulaffendibinabdullah bayesrandomforestanrimplementationofbayesianrandomforestforregressionanalysisofhighdimensionaldata |
_version_ |
1725876587677614080 |