Improving Outcome Prediction for Traumatic Brain Injury From Imbalanced Datasets Using RUSBoosted Trees on Electroencephalography Spectral Power

Reliable prediction of traumatic brain injury (TBI) outcomes based on machine learning (ML) that is derived from quantitative electroencephalography (EEG) features has renewed interest in recent years. Nevertheless, the approach has suffered from imbalanced datasets. Hence, to get a reliable predict...

Full description

Bibliographic Details
Main Authors: Nor Safira Elaina Mohd Noor, Haidi Ibrahim, Muhammad Hanif Che Lah, Jafri Malin Abdullah
Format: Article
Language:English
Published: IEEE 2021-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9527215/
Description
Summary:Reliable prediction of traumatic brain injury (TBI) outcomes based on machine learning (ML) that is derived from quantitative electroencephalography (EEG) features has renewed interest in recent years. Nevertheless, the approach has suffered from imbalanced datasets. Hence, to get a reliable predictive model for predicting outcomes, specifically in a high proportion of moderate TBI with good outcomes, could be challenging. This work proposes an improved outcome predictive model that combines the absolute power spectral density (PSD) as input features for training random under-sampling boosting decision trees (RUSBoosted Trees) as a classifier. Resting-state, eyes-closed EEG data were obtained from 27 moderate TBI patients with follow-up visits. Patient outcome at 4&#x2013;10 weeks to 12-month was dichotomized based on the Glasgow Outcome Scale as poor (GOS score &#x2264; 4) and good outcomes (GOS score = 5). The predictive values of absolute PSD from five frequency bands: <inline-formula> <tex-math notation="LaTeX">$\delta $ </tex-math></inline-formula> (0.5-4Hz), <inline-formula> <tex-math notation="LaTeX">$\theta $ </tex-math></inline-formula> (4-7Hz), <inline-formula> <tex-math notation="LaTeX">$\alpha $ </tex-math></inline-formula> (7-13Hz), <inline-formula> <tex-math notation="LaTeX">$\beta $ </tex-math></inline-formula> (13-30Hz) and <inline-formula> <tex-math notation="LaTeX">$\gamma $ </tex-math></inline-formula> (30&#x2013;100Hz) were evaluated to identify the most informative predictors for reliable prediction outcomes. RUSBoosted Trees performed best at discriminating patients into two outcomes categories (G-Mean &#x003D; 92.95&#x0025;, TP<sub>rate</sub> &#x003D; 100&#x0025;, TN<sub>rate</sub> &#x003D; 86.4&#x0025;) of absolute PSD in <inline-formula> <tex-math notation="LaTeX">$\delta $ </tex-math></inline-formula> and <inline-formula> <tex-math notation="LaTeX">$\gamma $ </tex-math></inline-formula> bands, which was excellent compared to the other state-of-the-art methods. The highest area under the curve (AUC) of absolute PSD in <inline-formula> <tex-math notation="LaTeX">$\delta $ </tex-math></inline-formula> (AUC<inline-formula> <tex-math notation="LaTeX">$_\delta =0.97$ </tex-math></inline-formula>) and <inline-formula> <tex-math notation="LaTeX">$\gamma $ </tex-math></inline-formula> (AUC<inline-formula> <tex-math notation="LaTeX">$_\gamma =0.95$ </tex-math></inline-formula>) revealed their predictive values as robust prognostic markers for prediction outcomes. The RUSBoosted Trees presents a promising result in prognosis prediction of highly imbalanced data, making it an accessible prediction tool for clinical decision-making, unlike the black-box approaches.
ISSN:2169-3536