Applications of Bayesian shrinkage prior models in clinical research with categorical responses

Background: Prediction and classification algorithms are commonly used in clinical research for identifying patients susceptible to clinical conditions such as diabetes, colon cancer, and Alzheimer’s disease. Developing accurate prediction and classification methods benefits personalized medicine. B...

Full description

Bibliographic Details
Main Authors: Bhattacharyya, A. (Author), Mitra, R. (Author), Pal, S. (Author), Rai, S. (Author)
Format: Article
Language:English
Published: BioMed Central Ltd 2022
Subjects:
Online Access:View Fulltext in Publisher
LEADER 03260nam a2200289Ia 4500
001 10.1186-s12874-022-01560-6
008 220510s2022 CNT 000 0 und d
020 |a 14712288 (ISSN) 
245 1 0 |a Applications of Bayesian shrinkage prior models in clinical research with categorical responses 
260 0 |b BioMed Central Ltd  |c 2022 
856 |z View Fulltext in Publisher  |u https://doi.org/10.1186/s12874-022-01560-6 
520 3 |a Background: Prediction and classification algorithms are commonly used in clinical research for identifying patients susceptible to clinical conditions such as diabetes, colon cancer, and Alzheimer’s disease. Developing accurate prediction and classification methods benefits personalized medicine. Building an excellent predictive model involves selecting the features that are most significantly associated with the outcome. These features can include several biological and demographic characteristics, such as genomic biomarkers and health history. Such variable selection becomes challenging when the number of potential predictors is large. Bayesian shrinkage models have emerged as popular and flexible methods of variable selection in regression settings. This work discusses variable selection with three shrinkage priors and illustrates its application to clinical data such as Pima Indians Diabetes, Colon cancer, ADNI, and OASIS Alzheimer’s real-world data. Methods: A unified Bayesian hierarchical framework that implements and compares shrinkage priors in binary and multinomial logistic regression models is presented. The key feature is the representation of the likelihood by a Polya-Gamma data augmentation, which admits a natural integration with a family of shrinkage priors, specifically focusing on Horseshoe, Dirichlet Laplace, and Double Pareto priors. Extensive simulation studies are conducted to assess the performances under different data dimensions and parameter settings. Measures of accuracy, AUC, brier score, L1 error, cross-entropy, and ROC surface plots are used as evaluation criteria comparing the priors with frequentist methods as Lasso, Elastic-Net, and Ridge regression. Results: All three priors can be used for robust prediction on significant metrics, irrespective of their categorical response model choices. Simulation studies could achieve the mean prediction accuracy of 91.6% (95% CI: 88.5, 94.7) and 76.5% (95% CI: 69.3, 83.8) for logistic regression and multinomial logistic models, respectively. The model can identify significant variables for disease risk prediction and is computationally efficient. Conclusions: The models are robust enough to conduct both variable selection and prediction because of their high shrinkage properties and applicability to a broad range of classification problems. © 2022, The Author(s). 
650 0 4 |a ADNI 
650 0 4 |a Data augmentation 
650 0 4 |a Dirichlet Laplace 
650 0 4 |a Horseshoe 
650 0 4 |a Logistic regression 
650 0 4 |a MCMC 
650 0 4 |a Multinomial 
650 0 4 |a Pima 
650 0 4 |a Polya-Gamma 
650 0 4 |a Shrinkage priors 
700 1 |a Bhattacharyya, A.  |e author 
700 1 |a Mitra, R.  |e author 
700 1 |a Pal, S.  |e author 
700 1 |a Rai, S.  |e author 
773 |t BMC Medical Research Methodology