Improving risk prediction accuracy for new soldiers in the U.S. Army by adding self-report survey data to administrative data

Abstract Background High rates of mental disorders, suicidality, and interpersonal violence early in the military career have raised interest in implementing preventive interventions with high-risk new enlistees. The Army Study to Assess Risk and Resilience in Servicemembers (STARRS) developed risk-...

Full description

Bibliographic Details
Main Authors: Samantha L. Bernecker, Anthony J. Rosellini, Matthew K. Nock, Wai Tat Chiu, Peter M. Gutierrez, Irving Hwang, Thomas E. Joiner, James A. Naifeh, Nancy A. Sampson, Alan M. Zaslavsky, Murray B. Stein, Robert J. Ursano, Ronald C. Kessler
Format: Article
Language:English
Published: BMC 2018-04-01
Series:BMC Psychiatry
Subjects:
Online Access:http://link.springer.com/article/10.1186/s12888-018-1656-4
id doaj-543602403e3742d1878e995ad9a04882
record_format Article
collection DOAJ
language English
format Article
sources DOAJ
author Samantha L. Bernecker
Anthony J. Rosellini
Matthew K. Nock
Wai Tat Chiu
Peter M. Gutierrez
Irving Hwang
Thomas E. Joiner
James A. Naifeh
Nancy A. Sampson
Alan M. Zaslavsky
Murray B. Stein
Robert J. Ursano
Ronald C. Kessler
spellingShingle Samantha L. Bernecker
Anthony J. Rosellini
Matthew K. Nock
Wai Tat Chiu
Peter M. Gutierrez
Irving Hwang
Thomas E. Joiner
James A. Naifeh
Nancy A. Sampson
Alan M. Zaslavsky
Murray B. Stein
Robert J. Ursano
Ronald C. Kessler
Improving risk prediction accuracy for new soldiers in the U.S. Army by adding self-report survey data to administrative data
BMC Psychiatry
Army
Military
Predictive modeling
Risk assessment
Violence
Sexual assault
author_facet Samantha L. Bernecker
Anthony J. Rosellini
Matthew K. Nock
Wai Tat Chiu
Peter M. Gutierrez
Irving Hwang
Thomas E. Joiner
James A. Naifeh
Nancy A. Sampson
Alan M. Zaslavsky
Murray B. Stein
Robert J. Ursano
Ronald C. Kessler
author_sort Samantha L. Bernecker
title Improving risk prediction accuracy for new soldiers in the U.S. Army by adding self-report survey data to administrative data
title_short Improving risk prediction accuracy for new soldiers in the U.S. Army by adding self-report survey data to administrative data
title_full Improving risk prediction accuracy for new soldiers in the U.S. Army by adding self-report survey data to administrative data
title_fullStr Improving risk prediction accuracy for new soldiers in the U.S. Army by adding self-report survey data to administrative data
title_full_unstemmed Improving risk prediction accuracy for new soldiers in the U.S. Army by adding self-report survey data to administrative data
title_sort improving risk prediction accuracy for new soldiers in the u.s. army by adding self-report survey data to administrative data
publisher BMC
series BMC Psychiatry
issn 1471-244X
publishDate 2018-04-01
description Abstract Background High rates of mental disorders, suicidality, and interpersonal violence early in the military career have raised interest in implementing preventive interventions with high-risk new enlistees. The Army Study to Assess Risk and Resilience in Servicemembers (STARRS) developed risk-targeting systems for these outcomes based on machine learning methods using administrative data predictors. However, administrative data omit many risk factors, raising the question whether risk targeting could be improved by adding self-report survey data to prediction models. If so, the Army may gain from routinely administering surveys that assess additional risk factors. Methods The STARRS New Soldier Survey was administered to 21,790 Regular Army soldiers who agreed to have survey data linked to administrative records. As reported previously, machine learning models using administrative data as predictors found that small proportions of high-risk soldiers accounted for high proportions of negative outcomes. Other machine learning models using self-report survey data as predictors were developed previously for three of these outcomes: major physical violence and sexual violence perpetration among men and sexual violence victimization among women. Here we examined the extent to which this survey information increases prediction accuracy, over models based solely on administrative data, for those three outcomes. We used discrete-time survival analysis to estimate a series of models predicting first occurrence, assessing how model fit improved and concentration of risk increased when adding the predicted risk score based on survey data to the predicted risk score based on administrative data. Results The addition of survey data improved prediction significantly for all outcomes. In the most extreme case, the percentage of reported sexual violence victimization among the 5% of female soldiers with highest predicted risk increased from 17.5% using only administrative predictors to 29.4% adding survey predictors, a 67.9% proportional increase in prediction accuracy. Other proportional increases in concentration of risk ranged from 4.8% to 49.5% (median = 26.0%). Conclusions Data from an ongoing New Soldier Survey could substantially improve accuracy of risk models compared to models based exclusively on administrative predictors. Depending upon the characteristics of interventions used, the increase in targeting accuracy from survey data might offset survey administration costs.
topic Army
Military
Predictive modeling
Risk assessment
Violence
Sexual assault
url http://link.springer.com/article/10.1186/s12888-018-1656-4
work_keys_str_mv AT samanthalbernecker improvingriskpredictionaccuracyfornewsoldiersintheusarmybyaddingselfreportsurveydatatoadministrativedata
AT anthonyjrosellini improvingriskpredictionaccuracyfornewsoldiersintheusarmybyaddingselfreportsurveydatatoadministrativedata
AT matthewknock improvingriskpredictionaccuracyfornewsoldiersintheusarmybyaddingselfreportsurveydatatoadministrativedata
AT waitatchiu improvingriskpredictionaccuracyfornewsoldiersintheusarmybyaddingselfreportsurveydatatoadministrativedata
AT petermgutierrez improvingriskpredictionaccuracyfornewsoldiersintheusarmybyaddingselfreportsurveydatatoadministrativedata
AT irvinghwang improvingriskpredictionaccuracyfornewsoldiersintheusarmybyaddingselfreportsurveydatatoadministrativedata
AT thomasejoiner improvingriskpredictionaccuracyfornewsoldiersintheusarmybyaddingselfreportsurveydatatoadministrativedata
AT jamesanaifeh improvingriskpredictionaccuracyfornewsoldiersintheusarmybyaddingselfreportsurveydatatoadministrativedata
AT nancyasampson improvingriskpredictionaccuracyfornewsoldiersintheusarmybyaddingselfreportsurveydatatoadministrativedata
AT alanmzaslavsky improvingriskpredictionaccuracyfornewsoldiersintheusarmybyaddingselfreportsurveydatatoadministrativedata
AT murraybstein improvingriskpredictionaccuracyfornewsoldiersintheusarmybyaddingselfreportsurveydatatoadministrativedata
AT robertjursano improvingriskpredictionaccuracyfornewsoldiersintheusarmybyaddingselfreportsurveydatatoadministrativedata
AT ronaldckessler improvingriskpredictionaccuracyfornewsoldiersintheusarmybyaddingselfreportsurveydatatoadministrativedata
_version_ 1725397165042302976
spelling doaj-543602403e3742d1878e995ad9a048822020-11-25T00:12:51ZengBMCBMC Psychiatry1471-244X2018-04-0118111210.1186/s12888-018-1656-4Improving risk prediction accuracy for new soldiers in the U.S. Army by adding self-report survey data to administrative dataSamantha L. Bernecker0Anthony J. Rosellini1Matthew K. Nock2Wai Tat Chiu3Peter M. Gutierrez4Irving Hwang5Thomas E. Joiner6James A. Naifeh7Nancy A. Sampson8Alan M. Zaslavsky9Murray B. Stein10Robert J. Ursano11Ronald C. Kessler12Department of Psychology, Harvard UniversityDepartment of Psychological and Brain Sciences, Center for Anxiety and Related Disorders, Boston UniversityDepartment of Psychology, Harvard UniversityDepartment of Health Care Policy, Harvard Medical SchoolDepartment of Psychiatry, University of Colorado School of Medicine, and Rocky Mountain Mental Illness Research, Education, and Clinical Center, Denver Veterans Affairs Medical CenterDepartment of Health Care Policy, Harvard Medical SchoolDepartment of Psychology, Florida State UniversityCenter for the Study of Traumatic Stress, Department of Psychiatry, Uniformed Services University School of MedicineDepartment of Health Care Policy, Harvard Medical SchoolDepartment of Health Care Policy, Harvard Medical SchoolDepartments of Psychiatry and Family Medicine and Public Health, University of California San DiegoCenter for the Study of Traumatic Stress, Department of Psychiatry, Uniformed Services University School of MedicineDepartment of Health Care Policy, Harvard Medical SchoolAbstract Background High rates of mental disorders, suicidality, and interpersonal violence early in the military career have raised interest in implementing preventive interventions with high-risk new enlistees. The Army Study to Assess Risk and Resilience in Servicemembers (STARRS) developed risk-targeting systems for these outcomes based on machine learning methods using administrative data predictors. However, administrative data omit many risk factors, raising the question whether risk targeting could be improved by adding self-report survey data to prediction models. If so, the Army may gain from routinely administering surveys that assess additional risk factors. Methods The STARRS New Soldier Survey was administered to 21,790 Regular Army soldiers who agreed to have survey data linked to administrative records. As reported previously, machine learning models using administrative data as predictors found that small proportions of high-risk soldiers accounted for high proportions of negative outcomes. Other machine learning models using self-report survey data as predictors were developed previously for three of these outcomes: major physical violence and sexual violence perpetration among men and sexual violence victimization among women. Here we examined the extent to which this survey information increases prediction accuracy, over models based solely on administrative data, for those three outcomes. We used discrete-time survival analysis to estimate a series of models predicting first occurrence, assessing how model fit improved and concentration of risk increased when adding the predicted risk score based on survey data to the predicted risk score based on administrative data. Results The addition of survey data improved prediction significantly for all outcomes. In the most extreme case, the percentage of reported sexual violence victimization among the 5% of female soldiers with highest predicted risk increased from 17.5% using only administrative predictors to 29.4% adding survey predictors, a 67.9% proportional increase in prediction accuracy. Other proportional increases in concentration of risk ranged from 4.8% to 49.5% (median = 26.0%). Conclusions Data from an ongoing New Soldier Survey could substantially improve accuracy of risk models compared to models based exclusively on administrative predictors. Depending upon the characteristics of interventions used, the increase in targeting accuracy from survey data might offset survey administration costs.http://link.springer.com/article/10.1186/s12888-018-1656-4ArmyMilitaryPredictive modelingRisk assessmentViolenceSexual assault