A family of partial-linear single-index models for analyzing complex environmental exposures with continuous, categorical, time-to-event, and longitudinal health outcomes

Abstract Background Statistical methods to study the joint effects of environmental factors are of great importance to understand the impact of correlated exposures that may act synergistically or antagonistically on health outcomes. This study proposes a family of statistical models under a unified...

Full description

Bibliographic Details
Main Authors: Yuyan Wang, Yinxiang Wu, Melanie H. Jacobson, Myeonggyun Lee, Peng Jin, Leonardo Trasande, Mengling Liu
Format: Article
Language:English
Published: BMC 2020-09-01
Series:Environmental Health
Subjects:
Online Access:http://link.springer.com/article/10.1186/s12940-020-00644-4
id doaj-102a70f4376b49dabe0e37d559ab1548
record_format Article
spelling doaj-102a70f4376b49dabe0e37d559ab15482020-11-25T03:27:16ZengBMCEnvironmental Health1476-069X2020-09-0119111610.1186/s12940-020-00644-4A family of partial-linear single-index models for analyzing complex environmental exposures with continuous, categorical, time-to-event, and longitudinal health outcomesYuyan Wang0Yinxiang Wu1Melanie H. Jacobson2Myeonggyun Lee3Peng Jin4Leonardo Trasande5Mengling Liu6Department of Population Health, NYU Langone HealthDepartment of Population Health, NYU Langone HealthDepartment of Pediatrics, NYU Langone HealthDepartment of Population Health, NYU Langone HealthDepartment of Population Health, NYU Langone HealthDepartment of Population Health, NYU Langone HealthDepartment of Population Health, NYU Langone HealthAbstract Background Statistical methods to study the joint effects of environmental factors are of great importance to understand the impact of correlated exposures that may act synergistically or antagonistically on health outcomes. This study proposes a family of statistical models under a unified partial-linear single-index (PLSI) modeling framework, to assess the joint effects of environmental factors for continuous, categorical, time-to-event, and longitudinal outcomes. All PLSI models consist of a linear combination of exposures into a single index for practical interpretability of relative direction and importance, and a nonparametric link function for modeling flexibility. Methods We presented PLSI linear regression and PLSI quantile regression for continuous outcome, PLSI generalized linear regression for categorical outcome, PLSI proportional hazards model for time-to-event outcome, and PLSI mixed-effects model for longitudinal outcome. These models were demonstrated using a dataset of 800 subjects from NHANES 2003–2004 survey including 8 environmental factors. Serum triglyceride concentration was analyzed as a continuous outcome and then dichotomized as a binary outcome. Simulations were conducted to demonstrate the PLSI proportional hazards model and PLSI mixed-effects model. The performance of PLSI models was compared with their counterpart parametric models. Results PLSI linear, quantile, and logistic regressions showed similar results that the 8 environmental factors had both positive and negative associations with triglycerides, with a-Tocopherol having the most positive and trans-b-carotene having the most negative association. For the time-to-event and longitudinal settings, simulations showed that PLSI models could correctly identify directions and relative importance for the 8 environmental factors. Compared with parametric models, PLSI models got similar results when the link function was close to linear, but clearly outperformed in simulations with nonlinear effects. Conclusions We presented a unified family of PLSI models to assess the joint effects of exposures on four commonly-used types of outcomes in environmental research, and demonstrated their modeling flexibility and effectiveness, especially for studying environmental factors with mixed directional effects and/or nonlinear effects. Our study has expanded the analytical toolbox for investigating the complex effects of environmental factors. A practical contribution also included a coherent algorithm for all proposed PLSI models with R codes available.http://link.springer.com/article/10.1186/s12940-020-00644-4Environmental mixturesNHANESSemiparametric modelTriglyceride
collection DOAJ
language English
format Article
sources DOAJ
author Yuyan Wang
Yinxiang Wu
Melanie H. Jacobson
Myeonggyun Lee
Peng Jin
Leonardo Trasande
Mengling Liu
spellingShingle Yuyan Wang
Yinxiang Wu
Melanie H. Jacobson
Myeonggyun Lee
Peng Jin
Leonardo Trasande
Mengling Liu
A family of partial-linear single-index models for analyzing complex environmental exposures with continuous, categorical, time-to-event, and longitudinal health outcomes
Environmental Health
Environmental mixtures
NHANES
Semiparametric model
Triglyceride
author_facet Yuyan Wang
Yinxiang Wu
Melanie H. Jacobson
Myeonggyun Lee
Peng Jin
Leonardo Trasande
Mengling Liu
author_sort Yuyan Wang
title A family of partial-linear single-index models for analyzing complex environmental exposures with continuous, categorical, time-to-event, and longitudinal health outcomes
title_short A family of partial-linear single-index models for analyzing complex environmental exposures with continuous, categorical, time-to-event, and longitudinal health outcomes
title_full A family of partial-linear single-index models for analyzing complex environmental exposures with continuous, categorical, time-to-event, and longitudinal health outcomes
title_fullStr A family of partial-linear single-index models for analyzing complex environmental exposures with continuous, categorical, time-to-event, and longitudinal health outcomes
title_full_unstemmed A family of partial-linear single-index models for analyzing complex environmental exposures with continuous, categorical, time-to-event, and longitudinal health outcomes
title_sort family of partial-linear single-index models for analyzing complex environmental exposures with continuous, categorical, time-to-event, and longitudinal health outcomes
publisher BMC
series Environmental Health
issn 1476-069X
publishDate 2020-09-01
description Abstract Background Statistical methods to study the joint effects of environmental factors are of great importance to understand the impact of correlated exposures that may act synergistically or antagonistically on health outcomes. This study proposes a family of statistical models under a unified partial-linear single-index (PLSI) modeling framework, to assess the joint effects of environmental factors for continuous, categorical, time-to-event, and longitudinal outcomes. All PLSI models consist of a linear combination of exposures into a single index for practical interpretability of relative direction and importance, and a nonparametric link function for modeling flexibility. Methods We presented PLSI linear regression and PLSI quantile regression for continuous outcome, PLSI generalized linear regression for categorical outcome, PLSI proportional hazards model for time-to-event outcome, and PLSI mixed-effects model for longitudinal outcome. These models were demonstrated using a dataset of 800 subjects from NHANES 2003–2004 survey including 8 environmental factors. Serum triglyceride concentration was analyzed as a continuous outcome and then dichotomized as a binary outcome. Simulations were conducted to demonstrate the PLSI proportional hazards model and PLSI mixed-effects model. The performance of PLSI models was compared with their counterpart parametric models. Results PLSI linear, quantile, and logistic regressions showed similar results that the 8 environmental factors had both positive and negative associations with triglycerides, with a-Tocopherol having the most positive and trans-b-carotene having the most negative association. For the time-to-event and longitudinal settings, simulations showed that PLSI models could correctly identify directions and relative importance for the 8 environmental factors. Compared with parametric models, PLSI models got similar results when the link function was close to linear, but clearly outperformed in simulations with nonlinear effects. Conclusions We presented a unified family of PLSI models to assess the joint effects of exposures on four commonly-used types of outcomes in environmental research, and demonstrated their modeling flexibility and effectiveness, especially for studying environmental factors with mixed directional effects and/or nonlinear effects. Our study has expanded the analytical toolbox for investigating the complex effects of environmental factors. A practical contribution also included a coherent algorithm for all proposed PLSI models with R codes available.
topic Environmental mixtures
NHANES
Semiparametric model
Triglyceride
url http://link.springer.com/article/10.1186/s12940-020-00644-4
work_keys_str_mv AT yuyanwang afamilyofpartiallinearsingleindexmodelsforanalyzingcomplexenvironmentalexposureswithcontinuouscategoricaltimetoeventandlongitudinalhealthoutcomes
AT yinxiangwu afamilyofpartiallinearsingleindexmodelsforanalyzingcomplexenvironmentalexposureswithcontinuouscategoricaltimetoeventandlongitudinalhealthoutcomes
AT melaniehjacobson afamilyofpartiallinearsingleindexmodelsforanalyzingcomplexenvironmentalexposureswithcontinuouscategoricaltimetoeventandlongitudinalhealthoutcomes
AT myeonggyunlee afamilyofpartiallinearsingleindexmodelsforanalyzingcomplexenvironmentalexposureswithcontinuouscategoricaltimetoeventandlongitudinalhealthoutcomes
AT pengjin afamilyofpartiallinearsingleindexmodelsforanalyzingcomplexenvironmentalexposureswithcontinuouscategoricaltimetoeventandlongitudinalhealthoutcomes
AT leonardotrasande afamilyofpartiallinearsingleindexmodelsforanalyzingcomplexenvironmentalexposureswithcontinuouscategoricaltimetoeventandlongitudinalhealthoutcomes
AT menglingliu afamilyofpartiallinearsingleindexmodelsforanalyzingcomplexenvironmentalexposureswithcontinuouscategoricaltimetoeventandlongitudinalhealthoutcomes
AT yuyanwang familyofpartiallinearsingleindexmodelsforanalyzingcomplexenvironmentalexposureswithcontinuouscategoricaltimetoeventandlongitudinalhealthoutcomes
AT yinxiangwu familyofpartiallinearsingleindexmodelsforanalyzingcomplexenvironmentalexposureswithcontinuouscategoricaltimetoeventandlongitudinalhealthoutcomes
AT melaniehjacobson familyofpartiallinearsingleindexmodelsforanalyzingcomplexenvironmentalexposureswithcontinuouscategoricaltimetoeventandlongitudinalhealthoutcomes
AT myeonggyunlee familyofpartiallinearsingleindexmodelsforanalyzingcomplexenvironmentalexposureswithcontinuouscategoricaltimetoeventandlongitudinalhealthoutcomes
AT pengjin familyofpartiallinearsingleindexmodelsforanalyzingcomplexenvironmentalexposureswithcontinuouscategoricaltimetoeventandlongitudinalhealthoutcomes
AT leonardotrasande familyofpartiallinearsingleindexmodelsforanalyzingcomplexenvironmentalexposureswithcontinuouscategoricaltimetoeventandlongitudinalhealthoutcomes
AT menglingliu familyofpartiallinearsingleindexmodelsforanalyzingcomplexenvironmentalexposureswithcontinuouscategoricaltimetoeventandlongitudinalhealthoutcomes
_version_ 1724588700123267072