Building a research partnership between computer scientists and health service researchers for access and analysis of population-level health datasets: what are we learning?
Background and rationale There is widespread enthusiasm to improve health through the application of artificial intelligence and machine learning (AI/ML) methods to large population-level health datasets. Achieving this may require successful collaboration between institutions as well as between...
Main Authors: | , , , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Swansea University
2019-11-01
|
Series: | International Journal of Population Data Science |
Online Access: | https://ijpds.org/article/view/1208 |
id |
doaj-153f6904cc534c38bfcb7dae33283d81 |
---|---|
record_format |
Article |
spelling |
doaj-153f6904cc534c38bfcb7dae33283d812020-11-25T01:44:34ZengSwansea UniversityInternational Journal of Population Data Science2399-49082019-11-014310.23889/ijpds.v4i3.1208Building a research partnership between computer scientists and health service researchers for access and analysis of population-level health datasets: what are we learning?Michael Schull0Michael Brudno1Marzyeh Ghassemi2Garth Gibson3Anna Goldenbrg4Alison Paprica5Laura Rosella6Therese Stukel7Charles Victor8Carl Virtanen9ICESHPC4HVector InstituteVector InstituteVector InstituteVector InstituteICESICESICESHPC4H Background and rationale There is widespread enthusiasm to improve health through the application of artificial intelligence and machine learning (AI/ML) methods to large population-level health datasets. Achieving this may require successful collaboration between institutions as well as between computer scientists (CS), machine learning researchers (MLR) and health service researchers (HSR). Main Aim Describe lessons learned in creating the Health Artificial Data and Analysis Platform (HAIDAP) in Ontario, Canada. Methods/Approach A partnership between a HSR institute (ICES), an AI/ML institute (Vector) and a high-performance computing center (HPC4H) was initiated in 2017 to enable the application of AI/ML methods to population-level health data for the province of Ontario (population 14M). We describe lessons learned (and being learned) following the HAIDAP’s launch. Results The HAIDAP was launched in 2019. Major learnings include: 1) importance of institutional partnerships and alignment with institutional strategies; 2) potential of joint institutional risk-sharing models; 3) need for scientific collaborations bridging disciplines around joint research projects; 4) sensitivity to different scientific cultures (e.g., academic prestige of conference proceedings for MLR vs journal publications for HSR; traditional statistical vs. ML model assumptions); 5) differences in research timeline expectations; 6) different experience with and expectations for access to de-identified routinely collected data (e.g., need for research ethics committee project approvals and privacy impact assessments); 7) developing data access models that enable greater flexibility (e.g., importing code or using open source tools); 8) broadening data access models to allow modern high-dimensional exploratory data analysis; 9) obtaining support of information/privacy regulator; 10) the hardware is the (relatively) easy part compared to other success factors. Conclusion The HAIDAP has enabled multi-disciplinary collaborations and novel AI/ML research of Ontario’s population-level health data. Collectively we have learned that additional effort is required to develop systems and processes enabling more efficient access to data and analytic tools for the analysis of administrative health data. https://ijpds.org/article/view/1208 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Michael Schull Michael Brudno Marzyeh Ghassemi Garth Gibson Anna Goldenbrg Alison Paprica Laura Rosella Therese Stukel Charles Victor Carl Virtanen |
spellingShingle |
Michael Schull Michael Brudno Marzyeh Ghassemi Garth Gibson Anna Goldenbrg Alison Paprica Laura Rosella Therese Stukel Charles Victor Carl Virtanen Building a research partnership between computer scientists and health service researchers for access and analysis of population-level health datasets: what are we learning? International Journal of Population Data Science |
author_facet |
Michael Schull Michael Brudno Marzyeh Ghassemi Garth Gibson Anna Goldenbrg Alison Paprica Laura Rosella Therese Stukel Charles Victor Carl Virtanen |
author_sort |
Michael Schull |
title |
Building a research partnership between computer scientists and health service researchers for access and analysis of population-level health datasets: what are we learning? |
title_short |
Building a research partnership between computer scientists and health service researchers for access and analysis of population-level health datasets: what are we learning? |
title_full |
Building a research partnership between computer scientists and health service researchers for access and analysis of population-level health datasets: what are we learning? |
title_fullStr |
Building a research partnership between computer scientists and health service researchers for access and analysis of population-level health datasets: what are we learning? |
title_full_unstemmed |
Building a research partnership between computer scientists and health service researchers for access and analysis of population-level health datasets: what are we learning? |
title_sort |
building a research partnership between computer scientists and health service researchers for access and analysis of population-level health datasets: what are we learning? |
publisher |
Swansea University |
series |
International Journal of Population Data Science |
issn |
2399-4908 |
publishDate |
2019-11-01 |
description |
Background and rationale
There is widespread enthusiasm to improve health through the application of artificial intelligence and machine learning (AI/ML) methods to large population-level health datasets. Achieving this may require successful collaboration between institutions as well as between computer scientists (CS), machine learning researchers (MLR) and health service researchers (HSR).
Main Aim
Describe lessons learned in creating the Health Artificial Data and Analysis Platform (HAIDAP) in Ontario, Canada.
Methods/Approach
A partnership between a HSR institute (ICES), an AI/ML institute (Vector) and a high-performance computing center (HPC4H) was initiated in 2017 to enable the application of AI/ML methods to population-level health data for the province of Ontario (population 14M). We describe lessons learned (and being learned) following the HAIDAP’s launch.
Results
The HAIDAP was launched in 2019. Major learnings include: 1) importance of institutional partnerships and alignment with institutional strategies; 2) potential of joint institutional risk-sharing models; 3) need for scientific collaborations bridging disciplines around joint research projects; 4) sensitivity to different scientific cultures (e.g., academic prestige of conference proceedings for MLR vs journal publications for HSR; traditional statistical vs. ML model assumptions); 5) differences in research timeline expectations; 6) different experience with and expectations for access to de-identified routinely collected data (e.g., need for research ethics committee project approvals and privacy impact assessments); 7) developing data access models that enable greater flexibility (e.g., importing code or using open source tools); 8) broadening data access models to allow modern high-dimensional exploratory data analysis; 9) obtaining support of information/privacy regulator; 10) the hardware is the (relatively) easy part compared to other success factors.
Conclusion
The HAIDAP has enabled multi-disciplinary collaborations and novel AI/ML research of Ontario’s population-level health data. Collectively we have learned that additional effort is required to develop systems and processes enabling more efficient access to data and analytic tools for the analysis of administrative health data.
|
url |
https://ijpds.org/article/view/1208 |
work_keys_str_mv |
AT michaelschull buildingaresearchpartnershipbetweencomputerscientistsandhealthserviceresearchersforaccessandanalysisofpopulationlevelhealthdatasetswhatarewelearning AT michaelbrudno buildingaresearchpartnershipbetweencomputerscientistsandhealthserviceresearchersforaccessandanalysisofpopulationlevelhealthdatasetswhatarewelearning AT marzyehghassemi buildingaresearchpartnershipbetweencomputerscientistsandhealthserviceresearchersforaccessandanalysisofpopulationlevelhealthdatasetswhatarewelearning AT garthgibson buildingaresearchpartnershipbetweencomputerscientistsandhealthserviceresearchersforaccessandanalysisofpopulationlevelhealthdatasetswhatarewelearning AT annagoldenbrg buildingaresearchpartnershipbetweencomputerscientistsandhealthserviceresearchersforaccessandanalysisofpopulationlevelhealthdatasetswhatarewelearning AT alisonpaprica buildingaresearchpartnershipbetweencomputerscientistsandhealthserviceresearchersforaccessandanalysisofpopulationlevelhealthdatasetswhatarewelearning AT laurarosella buildingaresearchpartnershipbetweencomputerscientistsandhealthserviceresearchersforaccessandanalysisofpopulationlevelhealthdatasetswhatarewelearning AT theresestukel buildingaresearchpartnershipbetweencomputerscientistsandhealthserviceresearchersforaccessandanalysisofpopulationlevelhealthdatasetswhatarewelearning AT charlesvictor buildingaresearchpartnershipbetweencomputerscientistsandhealthserviceresearchersforaccessandanalysisofpopulationlevelhealthdatasetswhatarewelearning AT carlvirtanen buildingaresearchpartnershipbetweencomputerscientistsandhealthserviceresearchersforaccessandanalysisofpopulationlevelhealthdatasetswhatarewelearning |
_version_ |
1725027922442452992 |