simcausal R Package: Conducting Transparent and Reproducible Simulation Studies of Causal Effect Estimation with Complex Longitudinal Data

The simcausal R package is a tool for specification and simulation of complex longitudinal data structures that are based on non-parametric structural equation models. The package aims to provide a flexible tool for simplifying the conduct of transparent and reproducible simulation studies, with a p...

Full description

Bibliographic Details
Main Authors: Oleg Sofrygin, Mark J. van der Laan, Romain Neugebauer
Format: Article
Language:English
Published: Foundation for Open Access Statistics 2017-10-01
Series:Journal of Statistical Software
Subjects:
R
Online Access:https://www.jstatsoft.org/index.php/jss/article/view/3285
id doaj-f01162da53c94a0a8267b371095a081c
record_format Article
spelling doaj-f01162da53c94a0a8267b371095a081c2020-11-24T20:45:32ZengFoundation for Open Access StatisticsJournal of Statistical Software1548-76602017-10-0181114710.18637/jss.v081.i021149simcausal R Package: Conducting Transparent and Reproducible Simulation Studies of Causal Effect Estimation with Complex Longitudinal DataOleg SofryginMark J. van der LaanRomain NeugebauerThe simcausal R package is a tool for specification and simulation of complex longitudinal data structures that are based on non-parametric structural equation models. The package aims to provide a flexible tool for simplifying the conduct of transparent and reproducible simulation studies, with a particular emphasis on the types of data and interventions frequently encountered in real-world causal inference problems, such as, observational data with time-dependent confounding, selection bias, and random monitoring processes. The package interface allows for concise expression of complex functional dependencies between a large number of nodes, where each node may represent a measurement at a specific time point. The package allows for specification and simulation of counterfactual data under various user-specified interventions (e.g., static, dynamic, deterministic, or stochastic). In particular, the interventions may represent exposures to treatment regimens, the occurrence or non-occurrence of right-censoring events, or of clinical monitoring events. Finally, the package enables the computation of a selected set of user-specified features of the distribution of the counterfactual data that represent common causal quantities of interest, such as, treatment-specific means, the average treatment effects and coefficients from working marginal structural models. The applicability of simcausal is demonstrated by replicating the results of two published simulation studies.https://www.jstatsoft.org/index.php/jss/article/view/3285causal inferencesimulationmarginal structural modelstructural equation modeldirected acyclic graphcausal modelR
collection DOAJ
language English
format Article
sources DOAJ
author Oleg Sofrygin
Mark J. van der Laan
Romain Neugebauer
spellingShingle Oleg Sofrygin
Mark J. van der Laan
Romain Neugebauer
simcausal R Package: Conducting Transparent and Reproducible Simulation Studies of Causal Effect Estimation with Complex Longitudinal Data
Journal of Statistical Software
causal inference
simulation
marginal structural model
structural equation model
directed acyclic graph
causal model
R
author_facet Oleg Sofrygin
Mark J. van der Laan
Romain Neugebauer
author_sort Oleg Sofrygin
title simcausal R Package: Conducting Transparent and Reproducible Simulation Studies of Causal Effect Estimation with Complex Longitudinal Data
title_short simcausal R Package: Conducting Transparent and Reproducible Simulation Studies of Causal Effect Estimation with Complex Longitudinal Data
title_full simcausal R Package: Conducting Transparent and Reproducible Simulation Studies of Causal Effect Estimation with Complex Longitudinal Data
title_fullStr simcausal R Package: Conducting Transparent and Reproducible Simulation Studies of Causal Effect Estimation with Complex Longitudinal Data
title_full_unstemmed simcausal R Package: Conducting Transparent and Reproducible Simulation Studies of Causal Effect Estimation with Complex Longitudinal Data
title_sort simcausal r package: conducting transparent and reproducible simulation studies of causal effect estimation with complex longitudinal data
publisher Foundation for Open Access Statistics
series Journal of Statistical Software
issn 1548-7660
publishDate 2017-10-01
description The simcausal R package is a tool for specification and simulation of complex longitudinal data structures that are based on non-parametric structural equation models. The package aims to provide a flexible tool for simplifying the conduct of transparent and reproducible simulation studies, with a particular emphasis on the types of data and interventions frequently encountered in real-world causal inference problems, such as, observational data with time-dependent confounding, selection bias, and random monitoring processes. The package interface allows for concise expression of complex functional dependencies between a large number of nodes, where each node may represent a measurement at a specific time point. The package allows for specification and simulation of counterfactual data under various user-specified interventions (e.g., static, dynamic, deterministic, or stochastic). In particular, the interventions may represent exposures to treatment regimens, the occurrence or non-occurrence of right-censoring events, or of clinical monitoring events. Finally, the package enables the computation of a selected set of user-specified features of the distribution of the counterfactual data that represent common causal quantities of interest, such as, treatment-specific means, the average treatment effects and coefficients from working marginal structural models. The applicability of simcausal is demonstrated by replicating the results of two published simulation studies.
topic causal inference
simulation
marginal structural model
structural equation model
directed acyclic graph
causal model
R
url https://www.jstatsoft.org/index.php/jss/article/view/3285
work_keys_str_mv AT olegsofrygin simcausalrpackageconductingtransparentandreproduciblesimulationstudiesofcausaleffectestimationwithcomplexlongitudinaldata
AT markjvanderlaan simcausalrpackageconductingtransparentandreproduciblesimulationstudiesofcausaleffectestimationwithcomplexlongitudinaldata
AT romainneugebauer simcausalrpackageconductingtransparentandreproduciblesimulationstudiesofcausaleffectestimationwithcomplexlongitudinaldata
_version_ 1716814420818001920