Health-Related Data Sources Accessible to Health Researchers From the US Government: Mapping Review

BackgroundBig data from large, government-sponsored surveys and data sets offers researchers opportunities to conduct population-based studies of important health issues in the United States, as well as develop preliminary data to support proposed future work. Yet, navigating...

Full description

Bibliographic Details
Published in:Journal of Medical Internet Research
Main Authors: Ann Annis, Crista Reaves, Jessica Sender, Sherry Bumpus
Format: Article
Language:English
Published: JMIR Publications 2023-04-01
Online Access:https://www.jmir.org/2023/1/e43802
_version_ 1852698259317850112
author Ann Annis
Crista Reaves
Jessica Sender
Sherry Bumpus
author_facet Ann Annis
Crista Reaves
Jessica Sender
Sherry Bumpus
author_sort Ann Annis
collection DOAJ
container_title Journal of Medical Internet Research
description BackgroundBig data from large, government-sponsored surveys and data sets offers researchers opportunities to conduct population-based studies of important health issues in the United States, as well as develop preliminary data to support proposed future work. Yet, navigating these national data sources is challenging. Despite the widespread availability of national data, there is little guidance for researchers on how to access and evaluate the use of these resources. ObjectiveOur aim was to identify and summarize a comprehensive list of federally sponsored, health- and health care–related data sources that are accessible in the public domain in order to facilitate their use by researchers. MethodsWe conducted a systematic mapping review of government sources of health-related data on US populations and with active or recent (previous 10 years) data collection. The key measures were government sponsor, overview and purpose of data, population of interest, sampling design, sample size, data collection methodology, type and description of data, and cost to obtain data. Convergent synthesis was used to aggregate findings. ResultsAmong 106 unique data sources, 57 met the inclusion criteria. Data sources were classified as survey or assessment data (n=30, 53%), trends data (n=27, 47%), summative processed data (n=27, 47%), primary registry data (n=17, 30%), and evaluative data (n=11, 19%). Most (n=39, 68%) served more than 1 purpose. The population of interest included individuals/patients (n=40, 70%), providers (n=15, 26%), and health care sites and systems (n=14, 25%). The sources collected data on demographic (n=44, 77%) and clinical information (n=35, 61%), health behaviors (n=24, 42%), provider or practice characteristics (n=22, 39%), health care costs (n=17, 30%), and laboratory tests (n=8, 14%). Most (n=43, 75%) offered free data sets. ConclusionsA broad scope of national health data is accessible to researchers. These data provide insights into important health issues and the nation’s health care system while eliminating the burden of primary data collection. Data standardization and uniformity were uncommon across government entities, highlighting a need to improve data consistency. Secondary analyses of national data are a feasible, cost-efficient means to address national health concerns.
format Article
id doaj-art-6ba2e8d041fb47ea976c2e45f45fa62d
institution Directory of Open Access Journals
issn 1438-8871
language English
publishDate 2023-04-01
publisher JMIR Publications
record_format Article
spelling doaj-art-6ba2e8d041fb47ea976c2e45f45fa62d2025-08-19T21:21:33ZengJMIR PublicationsJournal of Medical Internet Research1438-88712023-04-0125e4380210.2196/43802Health-Related Data Sources Accessible to Health Researchers From the US Government: Mapping ReviewAnn Annishttps://orcid.org/0000-0003-4243-8441Crista Reaveshttps://orcid.org/0000-0002-6547-291XJessica Senderhttps://orcid.org/0000-0001-9249-4981Sherry Bumpushttps://orcid.org/0000-0002-7812-5183 BackgroundBig data from large, government-sponsored surveys and data sets offers researchers opportunities to conduct population-based studies of important health issues in the United States, as well as develop preliminary data to support proposed future work. Yet, navigating these national data sources is challenging. Despite the widespread availability of national data, there is little guidance for researchers on how to access and evaluate the use of these resources. ObjectiveOur aim was to identify and summarize a comprehensive list of federally sponsored, health- and health care–related data sources that are accessible in the public domain in order to facilitate their use by researchers. MethodsWe conducted a systematic mapping review of government sources of health-related data on US populations and with active or recent (previous 10 years) data collection. The key measures were government sponsor, overview and purpose of data, population of interest, sampling design, sample size, data collection methodology, type and description of data, and cost to obtain data. Convergent synthesis was used to aggregate findings. ResultsAmong 106 unique data sources, 57 met the inclusion criteria. Data sources were classified as survey or assessment data (n=30, 53%), trends data (n=27, 47%), summative processed data (n=27, 47%), primary registry data (n=17, 30%), and evaluative data (n=11, 19%). Most (n=39, 68%) served more than 1 purpose. The population of interest included individuals/patients (n=40, 70%), providers (n=15, 26%), and health care sites and systems (n=14, 25%). The sources collected data on demographic (n=44, 77%) and clinical information (n=35, 61%), health behaviors (n=24, 42%), provider or practice characteristics (n=22, 39%), health care costs (n=17, 30%), and laboratory tests (n=8, 14%). Most (n=43, 75%) offered free data sets. ConclusionsA broad scope of national health data is accessible to researchers. These data provide insights into important health issues and the nation’s health care system while eliminating the burden of primary data collection. Data standardization and uniformity were uncommon across government entities, highlighting a need to improve data consistency. Secondary analyses of national data are a feasible, cost-efficient means to address national health concerns.https://www.jmir.org/2023/1/e43802
spellingShingle Ann Annis
Crista Reaves
Jessica Sender
Sherry Bumpus
Health-Related Data Sources Accessible to Health Researchers From the US Government: Mapping Review
title Health-Related Data Sources Accessible to Health Researchers From the US Government: Mapping Review
title_full Health-Related Data Sources Accessible to Health Researchers From the US Government: Mapping Review
title_fullStr Health-Related Data Sources Accessible to Health Researchers From the US Government: Mapping Review
title_full_unstemmed Health-Related Data Sources Accessible to Health Researchers From the US Government: Mapping Review
title_short Health-Related Data Sources Accessible to Health Researchers From the US Government: Mapping Review
title_sort health related data sources accessible to health researchers from the us government mapping review
url https://www.jmir.org/2023/1/e43802
work_keys_str_mv AT annannis healthrelateddatasourcesaccessibletohealthresearchersfromtheusgovernmentmappingreview
AT cristareaves healthrelateddatasourcesaccessibletohealthresearchersfromtheusgovernmentmappingreview
AT jessicasender healthrelateddatasourcesaccessibletohealthresearchersfromtheusgovernmentmappingreview
AT sherrybumpus healthrelateddatasourcesaccessibletohealthresearchersfromtheusgovernmentmappingreview