Machine learning to refine decision making within a syndromic surveillance service

Abstract Background Worldwide, syndromic surveillance is increasingly used for improved and timely situational awareness and early identification of public health threats. Syndromic data streams are fed into detection algorithms, which produce statistical alarms highlighting potential activity of pu...

Full description

Bibliographic Details
Main Authors: I. R. Lake, F. J. Colón-González, G. C. Barker, R. A. Morbey, G. E. Smith, A. J. Elliot
Format: Article
Language:English
Published: BMC 2019-05-01
Series:BMC Public Health
Subjects:
Online Access:http://link.springer.com/article/10.1186/s12889-019-6916-9
id doaj-f86b64e017ae45a49ebe8204b7d377b3
record_format Article
spelling doaj-f86b64e017ae45a49ebe8204b7d377b32020-11-25T02:38:22ZengBMCBMC Public Health1471-24582019-05-0119111210.1186/s12889-019-6916-9Machine learning to refine decision making within a syndromic surveillance serviceI. R. Lake0F. J. Colón-González1G. C. Barker2R. A. Morbey3G. E. Smith4A. J. Elliot5School of Environmental Sciences, University of East AngliaSchool of Environmental Sciences, University of East AngliaNational Institute for Health Research Health Protection Research Unit in Emergency Preparedness and ResponseNational Institute for Health Research Health Protection Research Unit in Emergency Preparedness and ResponseNational Institute for Health Research Health Protection Research Unit in Emergency Preparedness and ResponseNational Institute for Health Research Health Protection Research Unit in Emergency Preparedness and ResponseAbstract Background Worldwide, syndromic surveillance is increasingly used for improved and timely situational awareness and early identification of public health threats. Syndromic data streams are fed into detection algorithms, which produce statistical alarms highlighting potential activity of public health importance. All alarms must be assessed to confirm whether they are of public health importance. In England, approximately 100 alarms are generated daily and, although their analysis is formalised through a risk assessment process, the process requires notable time, training, and maintenance of an expertise base to determine which alarms are of public health importance. The process is made more complicated by the observation that only 0.1% of statistical alarms are deemed to be of public health importance. Therefore, the aims of this study were to evaluate machine learning as a tool for computer-assisted human decision-making when assessing statistical alarms. Methods A record of the risk assessment process was obtained from Public Health England for all 67,505 statistical alarms between August 2013 and October 2015. This record contained information on the characteristics of the alarm (e.g. size, location). We used three Bayesian classifiers- naïve Bayes, tree-augmented naïve Bayes and Multinets - to examine the risk assessment record in England with respect to the final ‘Decision’ outcome made by an epidemiologist of ‘Alert’, ‘Monitor’ or ‘No-action’. Two further classifications based upon tree-augmented naïve Bayes and Multinets were implemented to account for the predominance of ‘No-action’ outcomes. Results The attributes of each individual risk assessment were linked to the final decision made by an epidemiologist, providing confidence in the current process. The naïve Bayesian classifier performed best, correctly classifying 51.5% of ‘Alert’ outcomes. If the ‘Alert’ and ‘Monitor’ actions are combined then performance increases to 82.6% correctly classified. We demonstrate how a decision support system based upon a naïve Bayes classifier could be operationalised within an operational syndromic surveillance system. Conclusions Within syndromic surveillance systems, machine learning techniques have the potential to make risk assessment following statistical alarms more automated, robust, and rigorous. However, our results also highlight the importance of specialist human input to the process.http://link.springer.com/article/10.1186/s12889-019-6916-9Syndromic surveillancePublic healthDecision makingBayes’ theoremMachine learningArtificial intelligence
collection DOAJ
language English
format Article
sources DOAJ
author I. R. Lake
F. J. Colón-González
G. C. Barker
R. A. Morbey
G. E. Smith
A. J. Elliot
spellingShingle I. R. Lake
F. J. Colón-González
G. C. Barker
R. A. Morbey
G. E. Smith
A. J. Elliot
Machine learning to refine decision making within a syndromic surveillance service
BMC Public Health
Syndromic surveillance
Public health
Decision making
Bayes’ theorem
Machine learning
Artificial intelligence
author_facet I. R. Lake
F. J. Colón-González
G. C. Barker
R. A. Morbey
G. E. Smith
A. J. Elliot
author_sort I. R. Lake
title Machine learning to refine decision making within a syndromic surveillance service
title_short Machine learning to refine decision making within a syndromic surveillance service
title_full Machine learning to refine decision making within a syndromic surveillance service
title_fullStr Machine learning to refine decision making within a syndromic surveillance service
title_full_unstemmed Machine learning to refine decision making within a syndromic surveillance service
title_sort machine learning to refine decision making within a syndromic surveillance service
publisher BMC
series BMC Public Health
issn 1471-2458
publishDate 2019-05-01
description Abstract Background Worldwide, syndromic surveillance is increasingly used for improved and timely situational awareness and early identification of public health threats. Syndromic data streams are fed into detection algorithms, which produce statistical alarms highlighting potential activity of public health importance. All alarms must be assessed to confirm whether they are of public health importance. In England, approximately 100 alarms are generated daily and, although their analysis is formalised through a risk assessment process, the process requires notable time, training, and maintenance of an expertise base to determine which alarms are of public health importance. The process is made more complicated by the observation that only 0.1% of statistical alarms are deemed to be of public health importance. Therefore, the aims of this study were to evaluate machine learning as a tool for computer-assisted human decision-making when assessing statistical alarms. Methods A record of the risk assessment process was obtained from Public Health England for all 67,505 statistical alarms between August 2013 and October 2015. This record contained information on the characteristics of the alarm (e.g. size, location). We used three Bayesian classifiers- naïve Bayes, tree-augmented naïve Bayes and Multinets - to examine the risk assessment record in England with respect to the final ‘Decision’ outcome made by an epidemiologist of ‘Alert’, ‘Monitor’ or ‘No-action’. Two further classifications based upon tree-augmented naïve Bayes and Multinets were implemented to account for the predominance of ‘No-action’ outcomes. Results The attributes of each individual risk assessment were linked to the final decision made by an epidemiologist, providing confidence in the current process. The naïve Bayesian classifier performed best, correctly classifying 51.5% of ‘Alert’ outcomes. If the ‘Alert’ and ‘Monitor’ actions are combined then performance increases to 82.6% correctly classified. We demonstrate how a decision support system based upon a naïve Bayes classifier could be operationalised within an operational syndromic surveillance system. Conclusions Within syndromic surveillance systems, machine learning techniques have the potential to make risk assessment following statistical alarms more automated, robust, and rigorous. However, our results also highlight the importance of specialist human input to the process.
topic Syndromic surveillance
Public health
Decision making
Bayes’ theorem
Machine learning
Artificial intelligence
url http://link.springer.com/article/10.1186/s12889-019-6916-9
work_keys_str_mv AT irlake machinelearningtorefinedecisionmakingwithinasyndromicsurveillanceservice
AT fjcolongonzalez machinelearningtorefinedecisionmakingwithinasyndromicsurveillanceservice
AT gcbarker machinelearningtorefinedecisionmakingwithinasyndromicsurveillanceservice
AT ramorbey machinelearningtorefinedecisionmakingwithinasyndromicsurveillanceservice
AT gesmith machinelearningtorefinedecisionmakingwithinasyndromicsurveillanceservice
AT ajelliot machinelearningtorefinedecisionmakingwithinasyndromicsurveillanceservice
_version_ 1724791373157105664