Performance of approaches relying on multidimensional intermediary data to decipher causal relationships between the exposome and health: A simulation study under various causal structures
Challenges in the assessment of the health effects of the exposome, defined as encompassing all environmental exposures from the prenatal period onwards, include a possibly high rate of false positive signals. It might be overcome using data dimension reduction techniques. Data from the biological l...
Main Authors: | , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Elsevier
2021-08-01
|
Series: | Environment International |
Subjects: | |
Online Access: | http://www.sciencedirect.com/science/article/pii/S0160412021001343 |
id |
doaj-11706fd5d85e46eb85217f9112f34854 |
---|---|
record_format |
Article |
spelling |
doaj-11706fd5d85e46eb85217f9112f348542021-05-22T04:35:14ZengElsevierEnvironment International0160-41202021-08-01153106509Performance of approaches relying on multidimensional intermediary data to decipher causal relationships between the exposome and health: A simulation study under various causal structuresSolène Cadiou0Xavier Basagaña1Juan R. Gonzalez2Johanna Lepeule3Martine Vrijheid4Valérie Siroux5Rémy Slama6Team of Environmental Epidemiology, IAB, Institute for Advanced Biosciences, Inserm, CNRS, CHU-Grenoble-Alpes, University Grenoble-Alpes, Grenoble, FranceISGlobal, Barcelona Institute for Global Health, Barcelona, Spain; Universitat Pompeu Fabra (UPF), Barcelona, Spain; CIBER Epidemiología y Salud Pública (CIBERESP), SpainISGlobal, Barcelona Institute for Global Health, Barcelona, Spain; Universitat Pompeu Fabra (UPF), Barcelona, Spain; CIBER Epidemiología y Salud Pública (CIBERESP), SpainTeam of Environmental Epidemiology, IAB, Institute for Advanced Biosciences, Inserm, CNRS, CHU-Grenoble-Alpes, University Grenoble-Alpes, Grenoble, FranceISGlobal, Barcelona Institute for Global Health, Barcelona, Spain; Universitat Pompeu Fabra (UPF), Barcelona, Spain; CIBER Epidemiología y Salud Pública (CIBERESP), SpainTeam of Environmental Epidemiology, IAB, Institute for Advanced Biosciences, Inserm, CNRS, CHU-Grenoble-Alpes, University Grenoble-Alpes, Grenoble, FranceTeam of Environmental Epidemiology, IAB, Institute for Advanced Biosciences, Inserm, CNRS, CHU-Grenoble-Alpes, University Grenoble-Alpes, Grenoble, France; Corresponding author at: Team of environmental epidemiology, IAB, Institute for Advanced Biosciences, Inserm, CNRS, CHU-Grenoble-Alpes, University Grenoble-Alpes, Allée de Alpes, Grenoble, France.Challenges in the assessment of the health effects of the exposome, defined as encompassing all environmental exposures from the prenatal period onwards, include a possibly high rate of false positive signals. It might be overcome using data dimension reduction techniques. Data from the biological layers lying between the exposome and its possible health consequences, such as the methylome, may help reducing exposome dimension. We aimed to quantify the performances of approaches relying on the incorporation of an intermediary biological layer to relate the exposome and health, and compare them with agnostic approaches ignoring the intermediary layer. We performed a Monte-Carlo simulation, in which we generated realistic exposome and intermediary layer data by sampling with replacement real data from the Helix exposome project. We generated a Gaussian outcome assuming linear relationships between the three data layers, in 2381 scenarios under five different causal structures, including mediation and reverse causality. We tested 3 agnostic methods considering only the exposome and the health outcome: ExWAS (for Exposome-Wide Association study), DSA, LASSO; and 3 methods relying on an intermediary layer: two implementations of our new oriented Meet-in-the-Middle (oMITM) design, using ExWAS and DSA, and a mediation analysis using ExWAS. Methods’ performances were assessed through their sensitivity and FDP (False-Discovery Proportion). The oMITM-based methods generally had lower FDP than the other approaches, possibly at a cost in terms of sensitivity; FDP was in particular lower under a structure of reverse causality and in some mediation scenarios. The oMITM–DSA implementation showed better performances than oMITM–ExWAS, especially in terms of FDP. Among the agnostic approaches, DSA showed the highest performance. Integrating information from intermediary biological layers can help lowering FDP in studies of the exposome health effects; in particular, oMITM seems less sensitive to reverse causality than agnostic exposome-health association studies.http://www.sciencedirect.com/science/article/pii/S0160412021001343ExposomeVariable selectionMultilayerOmicsSpecificitySensitivity |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Solène Cadiou Xavier Basagaña Juan R. Gonzalez Johanna Lepeule Martine Vrijheid Valérie Siroux Rémy Slama |
spellingShingle |
Solène Cadiou Xavier Basagaña Juan R. Gonzalez Johanna Lepeule Martine Vrijheid Valérie Siroux Rémy Slama Performance of approaches relying on multidimensional intermediary data to decipher causal relationships between the exposome and health: A simulation study under various causal structures Environment International Exposome Variable selection Multilayer Omics Specificity Sensitivity |
author_facet |
Solène Cadiou Xavier Basagaña Juan R. Gonzalez Johanna Lepeule Martine Vrijheid Valérie Siroux Rémy Slama |
author_sort |
Solène Cadiou |
title |
Performance of approaches relying on multidimensional intermediary data to decipher causal relationships between the exposome and health: A simulation study under various causal structures |
title_short |
Performance of approaches relying on multidimensional intermediary data to decipher causal relationships between the exposome and health: A simulation study under various causal structures |
title_full |
Performance of approaches relying on multidimensional intermediary data to decipher causal relationships between the exposome and health: A simulation study under various causal structures |
title_fullStr |
Performance of approaches relying on multidimensional intermediary data to decipher causal relationships between the exposome and health: A simulation study under various causal structures |
title_full_unstemmed |
Performance of approaches relying on multidimensional intermediary data to decipher causal relationships between the exposome and health: A simulation study under various causal structures |
title_sort |
performance of approaches relying on multidimensional intermediary data to decipher causal relationships between the exposome and health: a simulation study under various causal structures |
publisher |
Elsevier |
series |
Environment International |
issn |
0160-4120 |
publishDate |
2021-08-01 |
description |
Challenges in the assessment of the health effects of the exposome, defined as encompassing all environmental exposures from the prenatal period onwards, include a possibly high rate of false positive signals. It might be overcome using data dimension reduction techniques. Data from the biological layers lying between the exposome and its possible health consequences, such as the methylome, may help reducing exposome dimension. We aimed to quantify the performances of approaches relying on the incorporation of an intermediary biological layer to relate the exposome and health, and compare them with agnostic approaches ignoring the intermediary layer. We performed a Monte-Carlo simulation, in which we generated realistic exposome and intermediary layer data by sampling with replacement real data from the Helix exposome project. We generated a Gaussian outcome assuming linear relationships between the three data layers, in 2381 scenarios under five different causal structures, including mediation and reverse causality. We tested 3 agnostic methods considering only the exposome and the health outcome: ExWAS (for Exposome-Wide Association study), DSA, LASSO; and 3 methods relying on an intermediary layer: two implementations of our new oriented Meet-in-the-Middle (oMITM) design, using ExWAS and DSA, and a mediation analysis using ExWAS. Methods’ performances were assessed through their sensitivity and FDP (False-Discovery Proportion). The oMITM-based methods generally had lower FDP than the other approaches, possibly at a cost in terms of sensitivity; FDP was in particular lower under a structure of reverse causality and in some mediation scenarios. The oMITM–DSA implementation showed better performances than oMITM–ExWAS, especially in terms of FDP. Among the agnostic approaches, DSA showed the highest performance. Integrating information from intermediary biological layers can help lowering FDP in studies of the exposome health effects; in particular, oMITM seems less sensitive to reverse causality than agnostic exposome-health association studies. |
topic |
Exposome Variable selection Multilayer Omics Specificity Sensitivity |
url |
http://www.sciencedirect.com/science/article/pii/S0160412021001343 |
work_keys_str_mv |
AT solenecadiou performanceofapproachesrelyingonmultidimensionalintermediarydatatodeciphercausalrelationshipsbetweentheexposomeandhealthasimulationstudyundervariouscausalstructures AT xavierbasagana performanceofapproachesrelyingonmultidimensionalintermediarydatatodeciphercausalrelationshipsbetweentheexposomeandhealthasimulationstudyundervariouscausalstructures AT juanrgonzalez performanceofapproachesrelyingonmultidimensionalintermediarydatatodeciphercausalrelationshipsbetweentheexposomeandhealthasimulationstudyundervariouscausalstructures AT johannalepeule performanceofapproachesrelyingonmultidimensionalintermediarydatatodeciphercausalrelationshipsbetweentheexposomeandhealthasimulationstudyundervariouscausalstructures AT martinevrijheid performanceofapproachesrelyingonmultidimensionalintermediarydatatodeciphercausalrelationshipsbetweentheexposomeandhealthasimulationstudyundervariouscausalstructures AT valeriesiroux performanceofapproachesrelyingonmultidimensionalintermediarydatatodeciphercausalrelationshipsbetweentheexposomeandhealthasimulationstudyundervariouscausalstructures AT remyslama performanceofapproachesrelyingonmultidimensionalintermediarydatatodeciphercausalrelationshipsbetweentheexposomeandhealthasimulationstudyundervariouscausalstructures |
_version_ |
1721430841083559936 |