Semi-Parametric Estimation and Inference for the Mean Outcome of the Single Time-Point Intervention in a Causally Connected Population

We study the framework for semi-parametric estimation and statistical inference for the sample average treatment-specific mean effects in observational settings where data are collected on a single network of possibly dependent units (e.g., in the presence of interference or spillover). Despite rece...

Full description

Bibliographic Details
Main Authors: Sofrygin Oleg, van der Laan Mark J.
Format: Article
Language:English
Published: De Gruyter 2016-11-01
Series:Journal of Causal Inference
Subjects:
Online Access:https://doi.org/10.1515/jci-2016-0003
id doaj-8fccb86e022b4ec9a56f2d6dfac8fa37
record_format Article
spelling doaj-8fccb86e022b4ec9a56f2d6dfac8fa372021-09-06T19:40:28ZengDe GruyterJournal of Causal Inference2193-36772193-36852016-11-015146548010.1515/jci-2016-0003Semi-Parametric Estimation and Inference for the Mean Outcome of the Single Time-Point Intervention in a Causally Connected PopulationSofrygin Oleg0van der Laan Mark J.1Department of Biostatistics, University of California Berkeley, 101 Haviland Hall, Berkeley, CA 94720, USADepartment of Biostatistics, University of California Berkeley, 101 Haviland Hall, Berkeley, CA 94720, USAWe study the framework for semi-parametric estimation and statistical inference for the sample average treatment-specific mean effects in observational settings where data are collected on a single network of possibly dependent units (e.g., in the presence of interference or spillover). Despite recent advances, many of the current statistical methods rely on estimation techniques that assume a particular parametric model for the outcome, even though some of the important statistical assumptions required by these methods are often violated in observational network settings. In this work we rely on recent methodological advances in the field of targeted maximum likelihood estimation (TMLE) and describe an estimation approach that permits for more realistic classes of data-generative models while providing valid inference in the context of observational network-dependent data. We start by assuming that the true data-generating distribution belongs to a large class of semi-parametric statistical models. We then impose some restrictions on the possible set of such distributions. For example, we assume that the dependence among the observed outcomes can be fully described by an observed network. We then show that under our modeling assumptions, our estimand can be described as a functional of the mixture of the observed data-generating distribution. With this key insight in mind, we describe the TMLE for possibly-dependent units as an iid data algorithm and we demonstrate the validity of our approach with a simulation study. Finally, we extend prior work towards estimation of novel causal parameters such as the unit-specific indirect and direct treatment effects under interference and the effects of interventions that modify the structure of the network.https://doi.org/10.1515/jci-2016-0003networksdependent datasemi-parametric estimationtmleinterference
collection DOAJ
language English
format Article
sources DOAJ
author Sofrygin Oleg
van der Laan Mark J.
spellingShingle Sofrygin Oleg
van der Laan Mark J.
Semi-Parametric Estimation and Inference for the Mean Outcome of the Single Time-Point Intervention in a Causally Connected Population
Journal of Causal Inference
networks
dependent data
semi-parametric estimation
tmle
interference
author_facet Sofrygin Oleg
van der Laan Mark J.
author_sort Sofrygin Oleg
title Semi-Parametric Estimation and Inference for the Mean Outcome of the Single Time-Point Intervention in a Causally Connected Population
title_short Semi-Parametric Estimation and Inference for the Mean Outcome of the Single Time-Point Intervention in a Causally Connected Population
title_full Semi-Parametric Estimation and Inference for the Mean Outcome of the Single Time-Point Intervention in a Causally Connected Population
title_fullStr Semi-Parametric Estimation and Inference for the Mean Outcome of the Single Time-Point Intervention in a Causally Connected Population
title_full_unstemmed Semi-Parametric Estimation and Inference for the Mean Outcome of the Single Time-Point Intervention in a Causally Connected Population
title_sort semi-parametric estimation and inference for the mean outcome of the single time-point intervention in a causally connected population
publisher De Gruyter
series Journal of Causal Inference
issn 2193-3677
2193-3685
publishDate 2016-11-01
description We study the framework for semi-parametric estimation and statistical inference for the sample average treatment-specific mean effects in observational settings where data are collected on a single network of possibly dependent units (e.g., in the presence of interference or spillover). Despite recent advances, many of the current statistical methods rely on estimation techniques that assume a particular parametric model for the outcome, even though some of the important statistical assumptions required by these methods are often violated in observational network settings. In this work we rely on recent methodological advances in the field of targeted maximum likelihood estimation (TMLE) and describe an estimation approach that permits for more realistic classes of data-generative models while providing valid inference in the context of observational network-dependent data. We start by assuming that the true data-generating distribution belongs to a large class of semi-parametric statistical models. We then impose some restrictions on the possible set of such distributions. For example, we assume that the dependence among the observed outcomes can be fully described by an observed network. We then show that under our modeling assumptions, our estimand can be described as a functional of the mixture of the observed data-generating distribution. With this key insight in mind, we describe the TMLE for possibly-dependent units as an iid data algorithm and we demonstrate the validity of our approach with a simulation study. Finally, we extend prior work towards estimation of novel causal parameters such as the unit-specific indirect and direct treatment effects under interference and the effects of interventions that modify the structure of the network.
topic networks
dependent data
semi-parametric estimation
tmle
interference
url https://doi.org/10.1515/jci-2016-0003
work_keys_str_mv AT sofryginoleg semiparametricestimationandinferenceforthemeanoutcomeofthesingletimepointinterventioninacausallyconnectedpopulation
AT vanderlaanmarkj semiparametricestimationandinferenceforthemeanoutcomeofthesingletimepointinterventioninacausallyconnectedpopulation
_version_ 1717768437065842688