Utilization of Machine Learning-Based Computer Vision and Voice Analysis to Derive Digital Biomarkers of Cognitive Functioning in Trauma Survivors
Background: Alterations in multiple domains of cognition have been observed in individuals who have experienced a traumatic stressor. These domains may provide important insights in identifying underlying neurobiological dysfunction driving an individual’s clinical response to trauma. However, such...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Karger Publishers
2020-12-01
|
Series: | Digital Biomarkers |
Subjects: | |
Online Access: | https://www.karger.com/Article/FullText/512394 |
id |
doaj-a7ab38cf2ec1409cb25c629d06d47931 |
---|---|
record_format |
Article |
spelling |
doaj-a7ab38cf2ec1409cb25c629d06d479312021-01-28T15:17:52ZengKarger PublishersDigital Biomarkers2504-110X2020-12-0151162310.1159/000512394512394Utilization of Machine Learning-Based Computer Vision and Voice Analysis to Derive Digital Biomarkers of Cognitive Functioning in Trauma SurvivorsKatharina SchultebraucksVijay YadavIsaac R. Galatzer-LevyBackground: Alterations in multiple domains of cognition have been observed in individuals who have experienced a traumatic stressor. These domains may provide important insights in identifying underlying neurobiological dysfunction driving an individual’s clinical response to trauma. However, such assessments are burdensome, costly, and time-consuming. To overcome barriers, efforts have emerged to measure multiple domains of cognitive functioning through the application of machine learning (ML) models to passive data sources. Methods: We utilized automated computer vision and voice analysis methods to extract facial, movement, and speech characteristics from semi-structured clinical interviews in 81 trauma survivors who additionally completed a cognitive assessment battery. A ML-based regression framework was used to identify variance in visual and auditory measures that relate to multiple cognitive domains. Results: Models derived from visual and auditory measures collectively accounted for a large variance in multiple domains of cognitive functioning, including motor coordination (R2 = 0.52), processing speed (R2 = 0.42), emotional bias (R2 = 0.52), sustained attention (R2 = 0.51), controlled attention (R2 = 0.44), cognitive flexibility (R2 = 0.43), cognitive inhibition (R2 = 0.64), and executive functioning (R2 = 0.63), consistent with the high test-retest reliability of traditional cognitive assessments. Face, voice, speech content, and movement have all significantly contributed to explaining the variance in predicting functioning in all cognitive domains. Conclusions: The results demonstrate the feasibility of automated measurement of reliable proxies of cognitive functioning through low-burden passive patient evaluations. This makes it easier to monitor cognitive functions and to intervene earlier and at a lower threshold without requiring a time-consuming neurocognitive assessment by, for instance, a licensed psychologist with specialized training in neuropsychology.https://www.karger.com/Article/FullText/512394computer visiondeep learningvoice analysisemergency departmentdigital biomarkerscognitive functioning |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Katharina Schultebraucks Vijay Yadav Isaac R. Galatzer-Levy |
spellingShingle |
Katharina Schultebraucks Vijay Yadav Isaac R. Galatzer-Levy Utilization of Machine Learning-Based Computer Vision and Voice Analysis to Derive Digital Biomarkers of Cognitive Functioning in Trauma Survivors Digital Biomarkers computer vision deep learning voice analysis emergency department digital biomarkers cognitive functioning |
author_facet |
Katharina Schultebraucks Vijay Yadav Isaac R. Galatzer-Levy |
author_sort |
Katharina Schultebraucks |
title |
Utilization of Machine Learning-Based Computer Vision and Voice Analysis to Derive Digital Biomarkers of Cognitive Functioning in Trauma Survivors |
title_short |
Utilization of Machine Learning-Based Computer Vision and Voice Analysis to Derive Digital Biomarkers of Cognitive Functioning in Trauma Survivors |
title_full |
Utilization of Machine Learning-Based Computer Vision and Voice Analysis to Derive Digital Biomarkers of Cognitive Functioning in Trauma Survivors |
title_fullStr |
Utilization of Machine Learning-Based Computer Vision and Voice Analysis to Derive Digital Biomarkers of Cognitive Functioning in Trauma Survivors |
title_full_unstemmed |
Utilization of Machine Learning-Based Computer Vision and Voice Analysis to Derive Digital Biomarkers of Cognitive Functioning in Trauma Survivors |
title_sort |
utilization of machine learning-based computer vision and voice analysis to derive digital biomarkers of cognitive functioning in trauma survivors |
publisher |
Karger Publishers |
series |
Digital Biomarkers |
issn |
2504-110X |
publishDate |
2020-12-01 |
description |
Background: Alterations in multiple domains of cognition have been observed in individuals who have experienced a traumatic stressor. These domains may provide important insights in identifying underlying neurobiological dysfunction driving an individual’s clinical response to trauma. However, such assessments are burdensome, costly, and time-consuming. To overcome barriers, efforts have emerged to measure multiple domains of cognitive functioning through the application of machine learning (ML) models to passive data sources. Methods: We utilized automated computer vision and voice analysis methods to extract facial, movement, and speech characteristics from semi-structured clinical interviews in 81 trauma survivors who additionally completed a cognitive assessment battery. A ML-based regression framework was used to identify variance in visual and auditory measures that relate to multiple cognitive domains. Results: Models derived from visual and auditory measures collectively accounted for a large variance in multiple domains of cognitive functioning, including motor coordination (R2 = 0.52), processing speed (R2 = 0.42), emotional bias (R2 = 0.52), sustained attention (R2 = 0.51), controlled attention (R2 = 0.44), cognitive flexibility (R2 = 0.43), cognitive inhibition (R2 = 0.64), and executive functioning (R2 = 0.63), consistent with the high test-retest reliability of traditional cognitive assessments. Face, voice, speech content, and movement have all significantly contributed to explaining the variance in predicting functioning in all cognitive domains. Conclusions: The results demonstrate the feasibility of automated measurement of reliable proxies of cognitive functioning through low-burden passive patient evaluations. This makes it easier to monitor cognitive functions and to intervene earlier and at a lower threshold without requiring a time-consuming neurocognitive assessment by, for instance, a licensed psychologist with specialized training in neuropsychology. |
topic |
computer vision deep learning voice analysis emergency department digital biomarkers cognitive functioning |
url |
https://www.karger.com/Article/FullText/512394 |
work_keys_str_mv |
AT katharinaschultebraucks utilizationofmachinelearningbasedcomputervisionandvoiceanalysistoderivedigitalbiomarkersofcognitivefunctioningintraumasurvivors AT vijayyadav utilizationofmachinelearningbasedcomputervisionandvoiceanalysistoderivedigitalbiomarkersofcognitivefunctioningintraumasurvivors AT isaacrgalatzerlevy utilizationofmachinelearningbasedcomputervisionandvoiceanalysistoderivedigitalbiomarkersofcognitivefunctioningintraumasurvivors |
_version_ |
1724319662412398592 |