Multinomial modelling of TB/HIV co-infection yields a robust predictive signature and generates hypotheses about the HIV+TB+ disease state.

<h4>Background</h4>Current diagnostics are inadequate to meet the challenges presented by co-infection with Mycobacterium tuberculosis (Mtb) and HIV, the leading cause of death for HIV-infected individuals. Improved characterization of Mtb/HIV coinfection as a distinct disease state may...

Full description

Bibliographic Details
Main Authors: Fergal J Duffy, Ethan G Thompson, Thomas J Scriba, Daniel E Zak
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2019-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0219322
id doaj-d16ac91a98e849afb223cdf4eb8da3f4
record_format Article
spelling doaj-d16ac91a98e849afb223cdf4eb8da3f42021-03-04T10:28:02ZengPublic Library of Science (PLoS)PLoS ONE1932-62032019-01-01147e021932210.1371/journal.pone.0219322Multinomial modelling of TB/HIV co-infection yields a robust predictive signature and generates hypotheses about the HIV+TB+ disease state.Fergal J DuffyEthan G ThompsonThomas J ScribaDaniel E Zak<h4>Background</h4>Current diagnostics are inadequate to meet the challenges presented by co-infection with Mycobacterium tuberculosis (Mtb) and HIV, the leading cause of death for HIV-infected individuals. Improved characterization of Mtb/HIV coinfection as a distinct disease state may lead to better identification and treatment of affected individuals.<h4>Methods</h4>Four previously-published TB and HIV co-infection related datasets were used to train and validate multinomial machine learning classifiers that simultaneously predict TB and HIV status. Classifier predictive performance was measured using leave-one-out cross validation on the training set and blind predictive performance on multiple test sets using area under the ROC curve (AUC) as the performance metric. Linear modelling of signature gene expression was applied to systematically classify genes as TB-only, HIV-only or combined TB/HIV.<h4>Results</h4>The optimal signature discovered was a 10-gene random forest multinomial signature that robustly discriminated active tuberculosis (TB) from other non-TB disease states with improved performance compared with previously published signatures (AUC: 0.87), and specifically discriminated active TB/HIV co-infection from all other conditions (AUC: 0.88). Signature genes exhibited a variety of transcriptional patterns including both TB-only and HIV-only response genes and genes with expression patterns driven by interactions between HIV and TB infection states, including the CD8+ T-cell receptor LAG3 and the apoptosis-related gene CERKL.<h4>Conclusions</h4>By explicitly including distinct disease states within the machine learning analysis framework, we developed a compact and highly diagnostic signature that simultaneously discriminates multiple disease states associated with Mtb/HIV co-infection. Examination of the expression patterns of signature genes suggests mechanisms underlying the unique inflammatory conditions associated with active TB in the presence of HIV. In particular, we observed that dysregulation of CD8+ effector T-cell and NK-cell associated genes may be an important feature of Mtb/HIV co-infection.https://doi.org/10.1371/journal.pone.0219322
collection DOAJ
language English
format Article
sources DOAJ
author Fergal J Duffy
Ethan G Thompson
Thomas J Scriba
Daniel E Zak
spellingShingle Fergal J Duffy
Ethan G Thompson
Thomas J Scriba
Daniel E Zak
Multinomial modelling of TB/HIV co-infection yields a robust predictive signature and generates hypotheses about the HIV+TB+ disease state.
PLoS ONE
author_facet Fergal J Duffy
Ethan G Thompson
Thomas J Scriba
Daniel E Zak
author_sort Fergal J Duffy
title Multinomial modelling of TB/HIV co-infection yields a robust predictive signature and generates hypotheses about the HIV+TB+ disease state.
title_short Multinomial modelling of TB/HIV co-infection yields a robust predictive signature and generates hypotheses about the HIV+TB+ disease state.
title_full Multinomial modelling of TB/HIV co-infection yields a robust predictive signature and generates hypotheses about the HIV+TB+ disease state.
title_fullStr Multinomial modelling of TB/HIV co-infection yields a robust predictive signature and generates hypotheses about the HIV+TB+ disease state.
title_full_unstemmed Multinomial modelling of TB/HIV co-infection yields a robust predictive signature and generates hypotheses about the HIV+TB+ disease state.
title_sort multinomial modelling of tb/hiv co-infection yields a robust predictive signature and generates hypotheses about the hiv+tb+ disease state.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2019-01-01
description <h4>Background</h4>Current diagnostics are inadequate to meet the challenges presented by co-infection with Mycobacterium tuberculosis (Mtb) and HIV, the leading cause of death for HIV-infected individuals. Improved characterization of Mtb/HIV coinfection as a distinct disease state may lead to better identification and treatment of affected individuals.<h4>Methods</h4>Four previously-published TB and HIV co-infection related datasets were used to train and validate multinomial machine learning classifiers that simultaneously predict TB and HIV status. Classifier predictive performance was measured using leave-one-out cross validation on the training set and blind predictive performance on multiple test sets using area under the ROC curve (AUC) as the performance metric. Linear modelling of signature gene expression was applied to systematically classify genes as TB-only, HIV-only or combined TB/HIV.<h4>Results</h4>The optimal signature discovered was a 10-gene random forest multinomial signature that robustly discriminated active tuberculosis (TB) from other non-TB disease states with improved performance compared with previously published signatures (AUC: 0.87), and specifically discriminated active TB/HIV co-infection from all other conditions (AUC: 0.88). Signature genes exhibited a variety of transcriptional patterns including both TB-only and HIV-only response genes and genes with expression patterns driven by interactions between HIV and TB infection states, including the CD8+ T-cell receptor LAG3 and the apoptosis-related gene CERKL.<h4>Conclusions</h4>By explicitly including distinct disease states within the machine learning analysis framework, we developed a compact and highly diagnostic signature that simultaneously discriminates multiple disease states associated with Mtb/HIV co-infection. Examination of the expression patterns of signature genes suggests mechanisms underlying the unique inflammatory conditions associated with active TB in the presence of HIV. In particular, we observed that dysregulation of CD8+ effector T-cell and NK-cell associated genes may be an important feature of Mtb/HIV co-infection.
url https://doi.org/10.1371/journal.pone.0219322
work_keys_str_mv AT fergaljduffy multinomialmodellingoftbhivcoinfectionyieldsarobustpredictivesignatureandgenerateshypothesesaboutthehivtbdiseasestate
AT ethangthompson multinomialmodellingoftbhivcoinfectionyieldsarobustpredictivesignatureandgenerateshypothesesaboutthehivtbdiseasestate
AT thomasjscriba multinomialmodellingoftbhivcoinfectionyieldsarobustpredictivesignatureandgenerateshypothesesaboutthehivtbdiseasestate
AT danielezak multinomialmodellingoftbhivcoinfectionyieldsarobustpredictivesignatureandgenerateshypothesesaboutthehivtbdiseasestate
_version_ 1714806003149570048