Predicting mental health problems in adolescence using machine learning techniques.

BACKGROUND:Predicting which children will go on to develop mental health symptoms as adolescents is critical for early intervention and preventing future, severe negative outcomes. Although many aspects of a child's life, personality, and symptoms have been flagged as indicators, there is curre...

Full description

Bibliographic Details
Main Authors: Ashley E Tate, Ryan C McCabe, Henrik Larsson, Sebastian Lundström, Paul Lichtenstein, Ralf Kuja-Halkola
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2020-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0230389
id doaj-fc7606ecb8ca430bbbc6b05ebe72d981
record_format Article
spelling doaj-fc7606ecb8ca430bbbc6b05ebe72d9812021-03-03T21:38:02ZengPublic Library of Science (PLoS)PLoS ONE1932-62032020-01-01154e023038910.1371/journal.pone.0230389Predicting mental health problems in adolescence using machine learning techniques.Ashley E TateRyan C McCabeHenrik LarssonSebastian LundströmPaul LichtensteinRalf Kuja-HalkolaBACKGROUND:Predicting which children will go on to develop mental health symptoms as adolescents is critical for early intervention and preventing future, severe negative outcomes. Although many aspects of a child's life, personality, and symptoms have been flagged as indicators, there is currently no model created to screen the general population for the risk of developing mental health problems. Additionally, the advent of machine learning techniques represents an exciting way to potentially improve upon the standard prediction modelling technique, logistic regression. Therefore, we aimed to I.) develop a model that can predict mental health problems in mid-adolescence II.) investigate if machine learning techniques (random forest, support vector machines, neural network, and XGBoost) will outperform logistic regression. METHODS:In 7,638 twins from the Child and Adolescent Twin Study in Sweden we used 474 predictors derived from parental report and register data. The outcome, mental health problems, was determined by the Strengths and Difficulties Questionnaire. Model performance was determined by the area under the receiver operating characteristic curve (AUC). RESULTS:Although model performance varied somewhat, the confidence interval overlapped for each model indicating non-significant superiority for the random forest model (AUC = 0.739, 95% CI 0.708-0.769), followed closely by support vector machines (AUC = 0.735, 95% CI 0.707-0.764). CONCLUSION:Ultimately, our top performing model would not be suitable for clinical use, however it lays important groundwork for future models seeking to predict general mental health outcomes. Future studies should make use of parent-rated assessments when possible. Additionally, it may not be necessary for similar studies to forgo logistic regression in favor of other more complex methods.https://doi.org/10.1371/journal.pone.0230389
collection DOAJ
language English
format Article
sources DOAJ
author Ashley E Tate
Ryan C McCabe
Henrik Larsson
Sebastian Lundström
Paul Lichtenstein
Ralf Kuja-Halkola
spellingShingle Ashley E Tate
Ryan C McCabe
Henrik Larsson
Sebastian Lundström
Paul Lichtenstein
Ralf Kuja-Halkola
Predicting mental health problems in adolescence using machine learning techniques.
PLoS ONE
author_facet Ashley E Tate
Ryan C McCabe
Henrik Larsson
Sebastian Lundström
Paul Lichtenstein
Ralf Kuja-Halkola
author_sort Ashley E Tate
title Predicting mental health problems in adolescence using machine learning techniques.
title_short Predicting mental health problems in adolescence using machine learning techniques.
title_full Predicting mental health problems in adolescence using machine learning techniques.
title_fullStr Predicting mental health problems in adolescence using machine learning techniques.
title_full_unstemmed Predicting mental health problems in adolescence using machine learning techniques.
title_sort predicting mental health problems in adolescence using machine learning techniques.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2020-01-01
description BACKGROUND:Predicting which children will go on to develop mental health symptoms as adolescents is critical for early intervention and preventing future, severe negative outcomes. Although many aspects of a child's life, personality, and symptoms have been flagged as indicators, there is currently no model created to screen the general population for the risk of developing mental health problems. Additionally, the advent of machine learning techniques represents an exciting way to potentially improve upon the standard prediction modelling technique, logistic regression. Therefore, we aimed to I.) develop a model that can predict mental health problems in mid-adolescence II.) investigate if machine learning techniques (random forest, support vector machines, neural network, and XGBoost) will outperform logistic regression. METHODS:In 7,638 twins from the Child and Adolescent Twin Study in Sweden we used 474 predictors derived from parental report and register data. The outcome, mental health problems, was determined by the Strengths and Difficulties Questionnaire. Model performance was determined by the area under the receiver operating characteristic curve (AUC). RESULTS:Although model performance varied somewhat, the confidence interval overlapped for each model indicating non-significant superiority for the random forest model (AUC = 0.739, 95% CI 0.708-0.769), followed closely by support vector machines (AUC = 0.735, 95% CI 0.707-0.764). CONCLUSION:Ultimately, our top performing model would not be suitable for clinical use, however it lays important groundwork for future models seeking to predict general mental health outcomes. Future studies should make use of parent-rated assessments when possible. Additionally, it may not be necessary for similar studies to forgo logistic regression in favor of other more complex methods.
url https://doi.org/10.1371/journal.pone.0230389
work_keys_str_mv AT ashleyetate predictingmentalhealthproblemsinadolescenceusingmachinelearningtechniques
AT ryancmccabe predictingmentalhealthproblemsinadolescenceusingmachinelearningtechniques
AT henriklarsson predictingmentalhealthproblemsinadolescenceusingmachinelearningtechniques
AT sebastianlundstrom predictingmentalhealthproblemsinadolescenceusingmachinelearningtechniques
AT paullichtenstein predictingmentalhealthproblemsinadolescenceusingmachinelearningtechniques
AT ralfkujahalkola predictingmentalhealthproblemsinadolescenceusingmachinelearningtechniques
_version_ 1714815962363985920