A generative-discriminative framework that integrates imaging, genetic, and diagnosis into coupled low dimensional space

We propose a novel optimization framework that integrates imaging and genetics data for simultaneous biomarker identification and disease classification. The generative component of our model uses a dictionary learning framework to project the imaging and genetic data into a shared low dimensional s...

Full description

Bibliographic Details
Main Authors: Sayan Ghosal, Qiang Chen, Giulio Pergola, Aaron L. Goldman, William Ulrich, Karen F. Berman, Giuseppe Blasi, Leonardo Fazio, Antonio Rampino, Alessandro Bertolino, Daniel R. Weinberger, Venkata S. Mattay, Archana Venkataraman
Format: Article
Language:English
Published: Elsevier 2021-09-01
Series:NeuroImage
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S1053811921004778
id doaj-cf018b37d14c4dbc9f8f24dc45d26ce6
record_format Article
spelling doaj-cf018b37d14c4dbc9f8f24dc45d26ce62021-07-25T04:41:57ZengElsevierNeuroImage1095-95722021-09-01238118200A generative-discriminative framework that integrates imaging, genetic, and diagnosis into coupled low dimensional spaceSayan Ghosal0Qiang Chen1Giulio Pergola2Aaron L. Goldman3William Ulrich4Karen F. Berman5Giuseppe Blasi6Leonardo Fazio7Antonio Rampino8Alessandro Bertolino9Daniel R. Weinberger10Venkata S. Mattay11Archana Venkataraman12Corresponding author.; Department of Electrical and Computer Engineering, Johns Hopkins University, USALieber Institute for Brain Development, USALieber Institute for Brain Development, USA; Group of Psychiatric Neuroscience, Department of Basic Medical Sciences, Neuroscience and Sense Organs, University of Bari Aldo Moro, Bari, ItalyLieber Institute for Brain Development, USALieber Institute for Brain Development, USAClinical and Translational Neuroscience Branch, NIMH, NIH, USAGroup of Psychiatric Neuroscience, Department of Basic Medical Sciences, Neuroscience and Sense Organs, University of Bari Aldo Moro, Bari, Italy; Azienda Ospedaliero-Universitaria Consorziale Policlinico, Bari, ItalyGroup of Psychiatric Neuroscience, Department of Basic Medical Sciences, Neuroscience and Sense Organs, University of Bari Aldo Moro, Bari, Italy; 4IRCCS Casa Sollievo della Sofferenza, San Giovanni Rotondo (FG), ItalyGroup of Psychiatric Neuroscience, Department of Basic Medical Sciences, Neuroscience and Sense Organs, University of Bari Aldo Moro, Bari, Italy; Azienda Ospedaliero-Universitaria Consorziale Policlinico, Bari, ItalyGroup of Psychiatric Neuroscience, Department of Basic Medical Sciences, Neuroscience and Sense Organs, University of Bari Aldo Moro, Bari, Italy; Azienda Ospedaliero-Universitaria Consorziale Policlinico, Bari, ItalyLieber Institute for Brain Development, USA; Department of Psychiatry, Neurology and Neuroscience, Johns Hopkins University School of Medicine, USALieber Institute for Brain Development, USA; Department of Neurology and Radiology, Johns Hopkins University School of Medicine, USADepartment of Electrical and Computer Engineering, Johns Hopkins University, USAWe propose a novel optimization framework that integrates imaging and genetics data for simultaneous biomarker identification and disease classification. The generative component of our model uses a dictionary learning framework to project the imaging and genetic data into a shared low dimensional space. We have coupled both the data modalities by tying the linear projection coefficients to the same latent space. The discriminative component of our model uses logistic regression on the projection vectors for disease diagnosis. This prediction task implicitly guides our framework to find interpretable biomarkers that are substantially different between a healthy and disease population. We exploit the interconnectedness of different brain regions by incorporating a graph regularization penalty into the joint objective function. We also use a group sparsity penalty to find a representative set of genetic basis vectors that span a low dimensional space where subjects are easily separable between patients and controls. We have evaluated our model on a population study of schizophrenia that includes two task fMRI paradigms and single nucleotide polymorphism (SNP) data. Using ten-fold cross validation, we compare our generative-discriminative framework with canonical correlation analysis (CCA) of imaging and genetics data, parallel independent component analysis (pICA) of imaging and genetics data, random forest (RF) classification, and a linear support vector machine (SVM). We also quantify the reproducibility of the imaging and genetics biomarkers via subsampling. Our framework achieves higher class prediction accuracy and identifies robust biomarkers. Moreover, the implicated brain regions and genetic variants underlie the well documented deficits in schizophrenia.http://www.sciencedirect.com/science/article/pii/S1053811921004778Imaging-geneticsClinical diagnosisLow dimensional subspaceGraph regularization
collection DOAJ
language English
format Article
sources DOAJ
author Sayan Ghosal
Qiang Chen
Giulio Pergola
Aaron L. Goldman
William Ulrich
Karen F. Berman
Giuseppe Blasi
Leonardo Fazio
Antonio Rampino
Alessandro Bertolino
Daniel R. Weinberger
Venkata S. Mattay
Archana Venkataraman
spellingShingle Sayan Ghosal
Qiang Chen
Giulio Pergola
Aaron L. Goldman
William Ulrich
Karen F. Berman
Giuseppe Blasi
Leonardo Fazio
Antonio Rampino
Alessandro Bertolino
Daniel R. Weinberger
Venkata S. Mattay
Archana Venkataraman
A generative-discriminative framework that integrates imaging, genetic, and diagnosis into coupled low dimensional space
NeuroImage
Imaging-genetics
Clinical diagnosis
Low dimensional subspace
Graph regularization
author_facet Sayan Ghosal
Qiang Chen
Giulio Pergola
Aaron L. Goldman
William Ulrich
Karen F. Berman
Giuseppe Blasi
Leonardo Fazio
Antonio Rampino
Alessandro Bertolino
Daniel R. Weinberger
Venkata S. Mattay
Archana Venkataraman
author_sort Sayan Ghosal
title A generative-discriminative framework that integrates imaging, genetic, and diagnosis into coupled low dimensional space
title_short A generative-discriminative framework that integrates imaging, genetic, and diagnosis into coupled low dimensional space
title_full A generative-discriminative framework that integrates imaging, genetic, and diagnosis into coupled low dimensional space
title_fullStr A generative-discriminative framework that integrates imaging, genetic, and diagnosis into coupled low dimensional space
title_full_unstemmed A generative-discriminative framework that integrates imaging, genetic, and diagnosis into coupled low dimensional space
title_sort generative-discriminative framework that integrates imaging, genetic, and diagnosis into coupled low dimensional space
publisher Elsevier
series NeuroImage
issn 1095-9572
publishDate 2021-09-01
description We propose a novel optimization framework that integrates imaging and genetics data for simultaneous biomarker identification and disease classification. The generative component of our model uses a dictionary learning framework to project the imaging and genetic data into a shared low dimensional space. We have coupled both the data modalities by tying the linear projection coefficients to the same latent space. The discriminative component of our model uses logistic regression on the projection vectors for disease diagnosis. This prediction task implicitly guides our framework to find interpretable biomarkers that are substantially different between a healthy and disease population. We exploit the interconnectedness of different brain regions by incorporating a graph regularization penalty into the joint objective function. We also use a group sparsity penalty to find a representative set of genetic basis vectors that span a low dimensional space where subjects are easily separable between patients and controls. We have evaluated our model on a population study of schizophrenia that includes two task fMRI paradigms and single nucleotide polymorphism (SNP) data. Using ten-fold cross validation, we compare our generative-discriminative framework with canonical correlation analysis (CCA) of imaging and genetics data, parallel independent component analysis (pICA) of imaging and genetics data, random forest (RF) classification, and a linear support vector machine (SVM). We also quantify the reproducibility of the imaging and genetics biomarkers via subsampling. Our framework achieves higher class prediction accuracy and identifies robust biomarkers. Moreover, the implicated brain regions and genetic variants underlie the well documented deficits in schizophrenia.
topic Imaging-genetics
Clinical diagnosis
Low dimensional subspace
Graph regularization
url http://www.sciencedirect.com/science/article/pii/S1053811921004778
work_keys_str_mv AT sayanghosal agenerativediscriminativeframeworkthatintegratesimaginggeneticanddiagnosisintocoupledlowdimensionalspace
AT qiangchen agenerativediscriminativeframeworkthatintegratesimaginggeneticanddiagnosisintocoupledlowdimensionalspace
AT giuliopergola agenerativediscriminativeframeworkthatintegratesimaginggeneticanddiagnosisintocoupledlowdimensionalspace
AT aaronlgoldman agenerativediscriminativeframeworkthatintegratesimaginggeneticanddiagnosisintocoupledlowdimensionalspace
AT williamulrich agenerativediscriminativeframeworkthatintegratesimaginggeneticanddiagnosisintocoupledlowdimensionalspace
AT karenfberman agenerativediscriminativeframeworkthatintegratesimaginggeneticanddiagnosisintocoupledlowdimensionalspace
AT giuseppeblasi agenerativediscriminativeframeworkthatintegratesimaginggeneticanddiagnosisintocoupledlowdimensionalspace
AT leonardofazio agenerativediscriminativeframeworkthatintegratesimaginggeneticanddiagnosisintocoupledlowdimensionalspace
AT antoniorampino agenerativediscriminativeframeworkthatintegratesimaginggeneticanddiagnosisintocoupledlowdimensionalspace
AT alessandrobertolino agenerativediscriminativeframeworkthatintegratesimaginggeneticanddiagnosisintocoupledlowdimensionalspace
AT danielrweinberger agenerativediscriminativeframeworkthatintegratesimaginggeneticanddiagnosisintocoupledlowdimensionalspace
AT venkatasmattay agenerativediscriminativeframeworkthatintegratesimaginggeneticanddiagnosisintocoupledlowdimensionalspace
AT archanavenkataraman agenerativediscriminativeframeworkthatintegratesimaginggeneticanddiagnosisintocoupledlowdimensionalspace
AT sayanghosal generativediscriminativeframeworkthatintegratesimaginggeneticanddiagnosisintocoupledlowdimensionalspace
AT qiangchen generativediscriminativeframeworkthatintegratesimaginggeneticanddiagnosisintocoupledlowdimensionalspace
AT giuliopergola generativediscriminativeframeworkthatintegratesimaginggeneticanddiagnosisintocoupledlowdimensionalspace
AT aaronlgoldman generativediscriminativeframeworkthatintegratesimaginggeneticanddiagnosisintocoupledlowdimensionalspace
AT williamulrich generativediscriminativeframeworkthatintegratesimaginggeneticanddiagnosisintocoupledlowdimensionalspace
AT karenfberman generativediscriminativeframeworkthatintegratesimaginggeneticanddiagnosisintocoupledlowdimensionalspace
AT giuseppeblasi generativediscriminativeframeworkthatintegratesimaginggeneticanddiagnosisintocoupledlowdimensionalspace
AT leonardofazio generativediscriminativeframeworkthatintegratesimaginggeneticanddiagnosisintocoupledlowdimensionalspace
AT antoniorampino generativediscriminativeframeworkthatintegratesimaginggeneticanddiagnosisintocoupledlowdimensionalspace
AT alessandrobertolino generativediscriminativeframeworkthatintegratesimaginggeneticanddiagnosisintocoupledlowdimensionalspace
AT danielrweinberger generativediscriminativeframeworkthatintegratesimaginggeneticanddiagnosisintocoupledlowdimensionalspace
AT venkatasmattay generativediscriminativeframeworkthatintegratesimaginggeneticanddiagnosisintocoupledlowdimensionalspace
AT archanavenkataraman generativediscriminativeframeworkthatintegratesimaginggeneticanddiagnosisintocoupledlowdimensionalspace
_version_ 1721283813553733632