Modeling Community Health with Areal Data: Bayesian Inference with Survey Standard Errors and Spatial Structure

Epidemiologists and health geographers routinely use small-area survey estimates as covariates to model areal and even individual health outcomes. American Community Survey (ACS) estimates are accompanied by standard errors (SEs), but it is not yet standard practice to use them for evaluating or mod...

Full description

Bibliographic Details
Main Authors: Connor Donegan, Yongwan Chun, Daniel A. Griffith
Format: Article
Language:English
Published: MDPI AG 2021-06-01
Series:International Journal of Environmental Research and Public Health
Subjects:
Online Access:https://www.mdpi.com/1660-4601/18/13/6856
id doaj-a1a3e1a60b4344b994b8e851f4fa6c7d
record_format Article
spelling doaj-a1a3e1a60b4344b994b8e851f4fa6c7d2021-07-15T15:34:46ZengMDPI AGInternational Journal of Environmental Research and Public Health1661-78271660-46012021-06-01186856685610.3390/ijerph18136856Modeling Community Health with Areal Data: Bayesian Inference with Survey Standard Errors and Spatial StructureConnor Donegan0Yongwan Chun1Daniel A. Griffith2Geospatial Information Sciences, the University of Texas at Dallas, 800 W. Campbell Rd., Richardson, TX 75080-3021, USAGeospatial Information Sciences, the University of Texas at Dallas, 800 W. Campbell Rd., Richardson, TX 75080-3021, USAGeospatial Information Sciences, the University of Texas at Dallas, 800 W. Campbell Rd., Richardson, TX 75080-3021, USAEpidemiologists and health geographers routinely use small-area survey estimates as covariates to model areal and even individual health outcomes. American Community Survey (ACS) estimates are accompanied by standard errors (SEs), but it is not yet standard practice to use them for evaluating or modeling data reliability. ACS SEs vary systematically across regions, neighborhoods, socioeconomic characteristics, and variables. Failure to consider probable observational error may have substantial impact on the large bodies of literature relying on small-area estimates, including inferential biases and over-confidence in results. The issue is particularly salient for predictive models employed to prioritize communities for service provision or funding allocation. Leveraging the tenets of plausible reasoning and Bayes’ theorem, we propose a conceptual framework and workflow for spatial data analysis with areal survey data, including visual diagnostics and model specifications. To illustrate, we follow Krieger et al.’s (2018) call to routinely use the Index of Concentration at the Extremes (ICE) to monitor spatial inequalities in health and mortality. We construct and examine SEs for the ICE, use visual diagnostics to evaluate our observational error model for the ICE, and then estimate an ICE–mortality gradient by incorporating the latter model into our model of sex-specific, midlife (ages 55–64), all-cause United States county mortality rates. We urge researchers to consider data quality as a criterion for variable selection prior to modeling, and to incorporate data reliability information into their models whenever possible.https://www.mdpi.com/1660-4601/18/13/6856spatial epidemiologyhealth disparitiesBayesian inferencemortality ratesmeasurement errorspatial autocorrelation
collection DOAJ
language English
format Article
sources DOAJ
author Connor Donegan
Yongwan Chun
Daniel A. Griffith
spellingShingle Connor Donegan
Yongwan Chun
Daniel A. Griffith
Modeling Community Health with Areal Data: Bayesian Inference with Survey Standard Errors and Spatial Structure
International Journal of Environmental Research and Public Health
spatial epidemiology
health disparities
Bayesian inference
mortality rates
measurement error
spatial autocorrelation
author_facet Connor Donegan
Yongwan Chun
Daniel A. Griffith
author_sort Connor Donegan
title Modeling Community Health with Areal Data: Bayesian Inference with Survey Standard Errors and Spatial Structure
title_short Modeling Community Health with Areal Data: Bayesian Inference with Survey Standard Errors and Spatial Structure
title_full Modeling Community Health with Areal Data: Bayesian Inference with Survey Standard Errors and Spatial Structure
title_fullStr Modeling Community Health with Areal Data: Bayesian Inference with Survey Standard Errors and Spatial Structure
title_full_unstemmed Modeling Community Health with Areal Data: Bayesian Inference with Survey Standard Errors and Spatial Structure
title_sort modeling community health with areal data: bayesian inference with survey standard errors and spatial structure
publisher MDPI AG
series International Journal of Environmental Research and Public Health
issn 1661-7827
1660-4601
publishDate 2021-06-01
description Epidemiologists and health geographers routinely use small-area survey estimates as covariates to model areal and even individual health outcomes. American Community Survey (ACS) estimates are accompanied by standard errors (SEs), but it is not yet standard practice to use them for evaluating or modeling data reliability. ACS SEs vary systematically across regions, neighborhoods, socioeconomic characteristics, and variables. Failure to consider probable observational error may have substantial impact on the large bodies of literature relying on small-area estimates, including inferential biases and over-confidence in results. The issue is particularly salient for predictive models employed to prioritize communities for service provision or funding allocation. Leveraging the tenets of plausible reasoning and Bayes’ theorem, we propose a conceptual framework and workflow for spatial data analysis with areal survey data, including visual diagnostics and model specifications. To illustrate, we follow Krieger et al.’s (2018) call to routinely use the Index of Concentration at the Extremes (ICE) to monitor spatial inequalities in health and mortality. We construct and examine SEs for the ICE, use visual diagnostics to evaluate our observational error model for the ICE, and then estimate an ICE–mortality gradient by incorporating the latter model into our model of sex-specific, midlife (ages 55–64), all-cause United States county mortality rates. We urge researchers to consider data quality as a criterion for variable selection prior to modeling, and to incorporate data reliability information into their models whenever possible.
topic spatial epidemiology
health disparities
Bayesian inference
mortality rates
measurement error
spatial autocorrelation
url https://www.mdpi.com/1660-4601/18/13/6856
work_keys_str_mv AT connordonegan modelingcommunityhealthwitharealdatabayesianinferencewithsurveystandarderrorsandspatialstructure
AT yongwanchun modelingcommunityhealthwitharealdatabayesianinferencewithsurveystandarderrorsandspatialstructure
AT danielagriffith modelingcommunityhealthwitharealdatabayesianinferencewithsurveystandarderrorsandspatialstructure
_version_ 1721299586410086400