Two-step approach for assessing the health effects of environmental chemical mixtures: application to simulated datasets and real data from the Navajo Birth Cohort Study

Abstract Background There is increasing interest in examining the consequences of simultaneous exposures to chemical mixtures. However, a consensus or recommendations on how to appropriately select the statistical approach analyzing the health effects of mixture exposures which best aligns with stud...

Full description

Bibliographic Details
Main Authors: Li Luo, Laurie G. Hudson, Johnnye Lewis, Ji-Hyun Lee
Format: Article
Language:English
Published: BMC 2019-05-01
Series:Environmental Health
Subjects:
Online Access:http://link.springer.com/article/10.1186/s12940-019-0482-6
id doaj-285756645f434f978cd7a5723b722bd1
record_format Article
spelling doaj-285756645f434f978cd7a5723b722bd12020-11-25T02:04:33ZengBMCEnvironmental Health1476-069X2019-05-0118111610.1186/s12940-019-0482-6Two-step approach for assessing the health effects of environmental chemical mixtures: application to simulated datasets and real data from the Navajo Birth Cohort StudyLi Luo0Laurie G. Hudson1Johnnye Lewis2Ji-Hyun Lee3Department of Internal Medicine, MSC10-5550, 1 University of New MexicoDepartment of Pharmaceutical Sciences, College of Pharmacy, University of New MexicoCommunity Environmental Health Program, College of Pharmacy, University of New MexicoDepartment of Internal Medicine, MSC10-5550, 1 University of New MexicoAbstract Background There is increasing interest in examining the consequences of simultaneous exposures to chemical mixtures. However, a consensus or recommendations on how to appropriately select the statistical approach analyzing the health effects of mixture exposures which best aligns with study goals has not been well established. We recognize the limitations that existing methods have in effectively reducing data dimension and detecting interaction effects when analyzing chemical mixture exposures collected in high dimensional datasets with varying degrees of variable intercorrelations. In this research, we aim to examine the performance of a two-step statistical approach in addressing the analytical challenges of chemical mixture exposures using two simulated data sets, and an existing data set from the Navajo Birth Cohort Study as a representative case study. Methods We propose to use a two-step approach: a robust variable selection step using the random forest approach followed by adaptive lasso methods that incorporate both dimensionality reduction and quantification of the degree of association between the chemical exposures and the outcome of interest, including interaction terms. We compared the proposed method with other approaches including (1) single step adaptive lasso; and (2) two-step Classification and regression trees (CART) followed by adaptive lasso method. Results Utilizing simulated data sets and applying the method to a real-life dataset from the Navajo Birth Cohort Study, we have demonstrated good performance of the proposed two-step approach. Results from the simulation datasets indicated the effectiveness of variable dimension reduction and reliable identification of a parsimonious model compared to other methods: single-step adaptive lasso or two-step CART followed by adaptive lasso method. Conclusions Our proposed two-step approach provides a robust way of analyzing the effects of high-throughput chemical mixture exposures on health outcomes by combining the strengths of variable selection and adaptive shrinkage strategies.http://link.springer.com/article/10.1186/s12940-019-0482-6Chemical mixturesTwo-step approachRandom ForestAdaptive lasso
collection DOAJ
language English
format Article
sources DOAJ
author Li Luo
Laurie G. Hudson
Johnnye Lewis
Ji-Hyun Lee
spellingShingle Li Luo
Laurie G. Hudson
Johnnye Lewis
Ji-Hyun Lee
Two-step approach for assessing the health effects of environmental chemical mixtures: application to simulated datasets and real data from the Navajo Birth Cohort Study
Environmental Health
Chemical mixtures
Two-step approach
Random Forest
Adaptive lasso
author_facet Li Luo
Laurie G. Hudson
Johnnye Lewis
Ji-Hyun Lee
author_sort Li Luo
title Two-step approach for assessing the health effects of environmental chemical mixtures: application to simulated datasets and real data from the Navajo Birth Cohort Study
title_short Two-step approach for assessing the health effects of environmental chemical mixtures: application to simulated datasets and real data from the Navajo Birth Cohort Study
title_full Two-step approach for assessing the health effects of environmental chemical mixtures: application to simulated datasets and real data from the Navajo Birth Cohort Study
title_fullStr Two-step approach for assessing the health effects of environmental chemical mixtures: application to simulated datasets and real data from the Navajo Birth Cohort Study
title_full_unstemmed Two-step approach for assessing the health effects of environmental chemical mixtures: application to simulated datasets and real data from the Navajo Birth Cohort Study
title_sort two-step approach for assessing the health effects of environmental chemical mixtures: application to simulated datasets and real data from the navajo birth cohort study
publisher BMC
series Environmental Health
issn 1476-069X
publishDate 2019-05-01
description Abstract Background There is increasing interest in examining the consequences of simultaneous exposures to chemical mixtures. However, a consensus or recommendations on how to appropriately select the statistical approach analyzing the health effects of mixture exposures which best aligns with study goals has not been well established. We recognize the limitations that existing methods have in effectively reducing data dimension and detecting interaction effects when analyzing chemical mixture exposures collected in high dimensional datasets with varying degrees of variable intercorrelations. In this research, we aim to examine the performance of a two-step statistical approach in addressing the analytical challenges of chemical mixture exposures using two simulated data sets, and an existing data set from the Navajo Birth Cohort Study as a representative case study. Methods We propose to use a two-step approach: a robust variable selection step using the random forest approach followed by adaptive lasso methods that incorporate both dimensionality reduction and quantification of the degree of association between the chemical exposures and the outcome of interest, including interaction terms. We compared the proposed method with other approaches including (1) single step adaptive lasso; and (2) two-step Classification and regression trees (CART) followed by adaptive lasso method. Results Utilizing simulated data sets and applying the method to a real-life dataset from the Navajo Birth Cohort Study, we have demonstrated good performance of the proposed two-step approach. Results from the simulation datasets indicated the effectiveness of variable dimension reduction and reliable identification of a parsimonious model compared to other methods: single-step adaptive lasso or two-step CART followed by adaptive lasso method. Conclusions Our proposed two-step approach provides a robust way of analyzing the effects of high-throughput chemical mixture exposures on health outcomes by combining the strengths of variable selection and adaptive shrinkage strategies.
topic Chemical mixtures
Two-step approach
Random Forest
Adaptive lasso
url http://link.springer.com/article/10.1186/s12940-019-0482-6
work_keys_str_mv AT liluo twostepapproachforassessingthehealtheffectsofenvironmentalchemicalmixturesapplicationtosimulateddatasetsandrealdatafromthenavajobirthcohortstudy
AT laurieghudson twostepapproachforassessingthehealtheffectsofenvironmentalchemicalmixturesapplicationtosimulateddatasetsandrealdatafromthenavajobirthcohortstudy
AT johnnyelewis twostepapproachforassessingthehealtheffectsofenvironmentalchemicalmixturesapplicationtosimulateddatasetsandrealdatafromthenavajobirthcohortstudy
AT jihyunlee twostepapproachforassessingthehealtheffectsofenvironmentalchemicalmixturesapplicationtosimulateddatasetsandrealdatafromthenavajobirthcohortstudy
_version_ 1724942567486783488