Performance of models for estimating absolute risk difference in multicenter trials with binary outcome

Abstract Background Reporting of absolute risk difference (RD) is recommended for clinical and epidemiological prospective studies. In analyses of multicenter studies, adjustment for center is necessary when randomization is stratified by center or when there is large variation in patients outcomes...

Full description

Bibliographic Details
Main Authors: Claudia Pedroza, Van Thi Thanh Truong
Format: Article
Language:English
Published: BMC 2016-08-01
Series:BMC Medical Research Methodology
Subjects:
Online Access:http://link.springer.com/article/10.1186/s12874-016-0217-0
id doaj-01dd77b93af24bf9ba3c4c90f22d0799
record_format Article
spelling doaj-01dd77b93af24bf9ba3c4c90f22d07992020-11-24T20:57:56ZengBMCBMC Medical Research Methodology1471-22882016-08-0116111210.1186/s12874-016-0217-0Performance of models for estimating absolute risk difference in multicenter trials with binary outcomeClaudia Pedroza0Van Thi Thanh Truong1Center for Clinical Research and Evidence-Based Medicine, McGovern Medical SchoolCenter for Clinical Research and Evidence-Based Medicine, McGovern Medical SchoolAbstract Background Reporting of absolute risk difference (RD) is recommended for clinical and epidemiological prospective studies. In analyses of multicenter studies, adjustment for center is necessary when randomization is stratified by center or when there is large variation in patients outcomes across centers. While regression methods are used to estimate RD adjusted for baseline predictors and clustering, no formal evaluation of their performance has been previously conducted. Methods We performed a simulation study to evaluate 6 regression methods fitted under a generalized estimating equation framework: binomial identity, Poisson identity, Normal identity, log binomial, log Poisson, and logistic regression model. We compared the model estimates to unadjusted estimates. We varied the true response function (identity or log), number of subjects per center, true risk difference, control outcome rate, effect of baseline predictor, and intracenter correlation. We compared the models in terms of convergence, absolute bias and coverage of 95 % confidence intervals for RD. Results The 6 models performed very similar to each other for the majority of scenarios. However, the log binomial model did not converge for a large portion of the scenarios including a baseline predictor. In scenarios with outcome rate close to the parameter boundary, the binomial and Poisson identity models had the best performance, but differences from other models were negligible. The unadjusted method introduced little bias to the RD estimates, but its coverage was larger than the nominal value in some scenarios with an identity response. Under the log response, coverage from the unadjusted method was well below the nominal value (<80 %) for some scenarios. Conclusions We recommend the use of a binomial or Poisson GEE model with identity link to estimate RD for correlated binary outcome data. If these models fail to run, then either a logistic regression, log Poisson regression, or linear regression GEE model can be used.http://link.springer.com/article/10.1186/s12874-016-0217-0Clustered dataCorrelated binary dataGeneralized estimating equationMulticenter trialRisk differenceRobust standard errors
collection DOAJ
language English
format Article
sources DOAJ
author Claudia Pedroza
Van Thi Thanh Truong
spellingShingle Claudia Pedroza
Van Thi Thanh Truong
Performance of models for estimating absolute risk difference in multicenter trials with binary outcome
BMC Medical Research Methodology
Clustered data
Correlated binary data
Generalized estimating equation
Multicenter trial
Risk difference
Robust standard errors
author_facet Claudia Pedroza
Van Thi Thanh Truong
author_sort Claudia Pedroza
title Performance of models for estimating absolute risk difference in multicenter trials with binary outcome
title_short Performance of models for estimating absolute risk difference in multicenter trials with binary outcome
title_full Performance of models for estimating absolute risk difference in multicenter trials with binary outcome
title_fullStr Performance of models for estimating absolute risk difference in multicenter trials with binary outcome
title_full_unstemmed Performance of models for estimating absolute risk difference in multicenter trials with binary outcome
title_sort performance of models for estimating absolute risk difference in multicenter trials with binary outcome
publisher BMC
series BMC Medical Research Methodology
issn 1471-2288
publishDate 2016-08-01
description Abstract Background Reporting of absolute risk difference (RD) is recommended for clinical and epidemiological prospective studies. In analyses of multicenter studies, adjustment for center is necessary when randomization is stratified by center or when there is large variation in patients outcomes across centers. While regression methods are used to estimate RD adjusted for baseline predictors and clustering, no formal evaluation of their performance has been previously conducted. Methods We performed a simulation study to evaluate 6 regression methods fitted under a generalized estimating equation framework: binomial identity, Poisson identity, Normal identity, log binomial, log Poisson, and logistic regression model. We compared the model estimates to unadjusted estimates. We varied the true response function (identity or log), number of subjects per center, true risk difference, control outcome rate, effect of baseline predictor, and intracenter correlation. We compared the models in terms of convergence, absolute bias and coverage of 95 % confidence intervals for RD. Results The 6 models performed very similar to each other for the majority of scenarios. However, the log binomial model did not converge for a large portion of the scenarios including a baseline predictor. In scenarios with outcome rate close to the parameter boundary, the binomial and Poisson identity models had the best performance, but differences from other models were negligible. The unadjusted method introduced little bias to the RD estimates, but its coverage was larger than the nominal value in some scenarios with an identity response. Under the log response, coverage from the unadjusted method was well below the nominal value (<80 %) for some scenarios. Conclusions We recommend the use of a binomial or Poisson GEE model with identity link to estimate RD for correlated binary outcome data. If these models fail to run, then either a logistic regression, log Poisson regression, or linear regression GEE model can be used.
topic Clustered data
Correlated binary data
Generalized estimating equation
Multicenter trial
Risk difference
Robust standard errors
url http://link.springer.com/article/10.1186/s12874-016-0217-0
work_keys_str_mv AT claudiapedroza performanceofmodelsforestimatingabsoluteriskdifferenceinmulticentertrialswithbinaryoutcome
AT vanthithanhtruong performanceofmodelsforestimatingabsoluteriskdifferenceinmulticentertrialswithbinaryoutcome
_version_ 1716787128973656064