Performance of models for estimating absolute risk difference in multicenter trials with binary outcome

Abstract Background Reporting of absolute risk difference (RD) is recommended for clinical and epidemiological prospective studies. In analyses of multicenter studies, adjustment for center is necessary when randomization is stratified by center or when there is large variation in patients outcomes...

Full description

Bibliographic Details
Main Authors:	Claudia Pedroza, Van Thi Thanh Truong
Format:	Article
Language:	English
Published:	BMC 2016-08-01
Series:	BMC Medical Research Methodology
Subjects:	Clustered data Correlated binary data Generalized estimating equation Multicenter trial Risk difference Robust standard errors
Online Access:	http://link.springer.com/article/10.1186/s12874-016-0217-0

id	doaj-01dd77b93af24bf9ba3c4c90f22d0799
record_format	Article
spelling	doaj-01dd77b93af24bf9ba3c4c90f22d07992020-11-24T20:57:56ZengBMCBMC Medical Research Methodology1471-22882016-08-0116111210.1186/s12874-016-0217-0Performance of models for estimating absolute risk difference in multicenter trials with binary outcomeClaudia Pedroza0Van Thi Thanh Truong1Center for Clinical Research and Evidence-Based Medicine, McGovern Medical SchoolCenter for Clinical Research and Evidence-Based Medicine, McGovern Medical SchoolAbstract Background Reporting of absolute risk difference (RD) is recommended for clinical and epidemiological prospective studies. In analyses of multicenter studies, adjustment for center is necessary when randomization is stratified by center or when there is large variation in patients outcomes across centers. While regression methods are used to estimate RD adjusted for baseline predictors and clustering, no formal evaluation of their performance has been previously conducted. Methods We performed a simulation study to evaluate 6 regression methods fitted under a generalized estimating equation framework: binomial identity, Poisson identity, Normal identity, log binomial, log Poisson, and logistic regression model. We compared the model estimates to unadjusted estimates. We varied the true response function (identity or log), number of subjects per center, true risk difference, control outcome rate, effect of baseline predictor, and intracenter correlation. We compared the models in terms of convergence, absolute bias and coverage of 95 % confidence intervals for RD. Results The 6 models performed very similar to each other for the majority of scenarios. However, the log binomial model did not converge for a large portion of the scenarios including a baseline predictor. In scenarios with outcome rate close to the parameter boundary, the binomial and Poisson identity models had the best performance, but differences from other models were negligible. The unadjusted method introduced little bias to the RD estimates, but its coverage was larger than the nominal value in some scenarios with an identity response. Under the log response, coverage from the unadjusted method was well below the nominal value (<80 %) for some scenarios. Conclusions We recommend the use of a binomial or Poisson GEE model with identity link to estimate RD for correlated binary outcome data. If these models fail to run, then either a logistic regression, log Poisson regression, or linear regression GEE model can be used.http://link.springer.com/article/10.1186/s12874-016-0217-0Clustered dataCorrelated binary dataGeneralized estimating equationMulticenter trialRisk differenceRobust standard errors
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Claudia Pedroza Van Thi Thanh Truong
spellingShingle	Claudia Pedroza Van Thi Thanh Truong Performance of models for estimating absolute risk difference in multicenter trials with binary outcome BMC Medical Research Methodology Clustered data Correlated binary data Generalized estimating equation Multicenter trial Risk difference Robust standard errors
author_facet	Claudia Pedroza Van Thi Thanh Truong
author_sort	Claudia Pedroza
title	Performance of models for estimating absolute risk difference in multicenter trials with binary outcome
title_short	Performance of models for estimating absolute risk difference in multicenter trials with binary outcome
title_full	Performance of models for estimating absolute risk difference in multicenter trials with binary outcome
title_fullStr	Performance of models for estimating absolute risk difference in multicenter trials with binary outcome
title_full_unstemmed	Performance of models for estimating absolute risk difference in multicenter trials with binary outcome
title_sort	performance of models for estimating absolute risk difference in multicenter trials with binary outcome
publisher	BMC
series	BMC Medical Research Methodology
issn	1471-2288
publishDate	2016-08-01
description	Abstract Background Reporting of absolute risk difference (RD) is recommended for clinical and epidemiological prospective studies. In analyses of multicenter studies, adjustment for center is necessary when randomization is stratified by center or when there is large variation in patients outcomes across centers. While regression methods are used to estimate RD adjusted for baseline predictors and clustering, no formal evaluation of their performance has been previously conducted. Methods We performed a simulation study to evaluate 6 regression methods fitted under a generalized estimating equation framework: binomial identity, Poisson identity, Normal identity, log binomial, log Poisson, and logistic regression model. We compared the model estimates to unadjusted estimates. We varied the true response function (identity or log), number of subjects per center, true risk difference, control outcome rate, effect of baseline predictor, and intracenter correlation. We compared the models in terms of convergence, absolute bias and coverage of 95 % confidence intervals for RD. Results The 6 models performed very similar to each other for the majority of scenarios. However, the log binomial model did not converge for a large portion of the scenarios including a baseline predictor. In scenarios with outcome rate close to the parameter boundary, the binomial and Poisson identity models had the best performance, but differences from other models were negligible. The unadjusted method introduced little bias to the RD estimates, but its coverage was larger than the nominal value in some scenarios with an identity response. Under the log response, coverage from the unadjusted method was well below the nominal value (<80 %) for some scenarios. Conclusions We recommend the use of a binomial or Poisson GEE model with identity link to estimate RD for correlated binary outcome data. If these models fail to run, then either a logistic regression, log Poisson regression, or linear regression GEE model can be used.
topic	Clustered data Correlated binary data Generalized estimating equation Multicenter trial Risk difference Robust standard errors
url	http://link.springer.com/article/10.1186/s12874-016-0217-0
work_keys_str_mv	AT claudiapedroza performanceofmodelsforestimatingabsoluteriskdifferenceinmulticentertrialswithbinaryoutcome AT vanthithanhtruong performanceofmodelsforestimatingabsoluteriskdifferenceinmulticentertrialswithbinaryoutcome
_version_	1716787128973656064

Performance of models for estimating absolute risk difference in multicenter trials with binary outcome

Similar Items