Cell-Type-Specific Proteogenomic Signal Diffusion for Integrating Multi-Omics Data Predicts Novel Schizophrenia Risk Genes

Summary: Accumulation of diverse types of omics data on schizophrenia (SCZ) requires a systems approach to model the interplay between genome, transcriptome, and proteome. We introduce Markov affinity-based proteogenomic signal diffusion (MAPSD), a method to model intra-cellular protein trafficking...

Full description

Bibliographic Details
Main Authors: Abolfazl Doostparast Torshizi, Jubao Duan, Kai Wang
Format: Article
Language:English
Published: Elsevier 2020-09-01
Series:Patterns
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2666389920301197
id doaj-76c646fca3af49788ccc823a2ed43ce6
record_format Article
spelling doaj-76c646fca3af49788ccc823a2ed43ce62020-12-03T04:32:33ZengElsevierPatterns2666-38992020-09-0116100091Cell-Type-Specific Proteogenomic Signal Diffusion for Integrating Multi-Omics Data Predicts Novel Schizophrenia Risk GenesAbolfazl Doostparast Torshizi0Jubao Duan1Kai Wang2Raymond G. Perelman Center for Cellular and Molecular Therapeutics, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USACenter for Psychiatric Genetics, North Shore University Health System, Evanston, IL 60201, USA; Department of Psychiatry and Behavioral Neurosciences, University of Chicago, Chicago, IL 60637, USARaymond G. Perelman Center for Cellular and Molecular Therapeutics, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA; Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA; Corresponding authorSummary: Accumulation of diverse types of omics data on schizophrenia (SCZ) requires a systems approach to model the interplay between genome, transcriptome, and proteome. We introduce Markov affinity-based proteogenomic signal diffusion (MAPSD), a method to model intra-cellular protein trafficking paradigms and tissue-wise single-cell protein abundances. MAPSD integrates multi-omics data to amplify the signals at SCZ risk loci with small effect sizes, and reveal convergent disease-associated gene modules in the brain. We predicted a set of high-confidence SCZ risk loci followed by characterizing the subcellular localization of proteins encoded by candidate SCZ risk genes, and illustrated that most are enriched in neuronal cells in the cerebral cortex as well as Purkinje cells in the cerebellum. We demonstrated how the identified genes may be involved in neurodevelopment, how they may alter SCZ-related biological pathways, and how they facilitate drug repurposing. MAPSD is applicable in other polygenic diseases and can facilitate our understanding of disease mechanisms. The Bigger Picture: Proteins constitute the functional machinery in a cell. Genetic aberrations may cause disrupting the normal functionality of the proteins. On the other hand, biophysical and biochemical properties of proteins vary in distinct tissues mandating separate modeling of proteomic features given the tissue being studied, e.g. brain in case of schizophrenia. Using the concept of signal diffusion in graph theory, we proposed a model, termed MAPSD, which enables us to leverage proteomic properties of different tissues at single cell resolution along with genomic and epigenomic features of a disease in order to predict potential risk genes which cannot be annotated using common univariate approaches. Taking this approach helps create novel therapeutic hypotheses for precision medicine so that more effective treatments with less side effects on other organs can be developed. Application of MAPSD is not restricted to schizophrenia and most of complex diseases can benefit from the method.http://www.sciencedirect.com/science/article/pii/S2666389920301197signal diffusionprotein-protein interactioninteraction networksproteomemulti-omicsschizophrenia
collection DOAJ
language English
format Article
sources DOAJ
author Abolfazl Doostparast Torshizi
Jubao Duan
Kai Wang
spellingShingle Abolfazl Doostparast Torshizi
Jubao Duan
Kai Wang
Cell-Type-Specific Proteogenomic Signal Diffusion for Integrating Multi-Omics Data Predicts Novel Schizophrenia Risk Genes
Patterns
signal diffusion
protein-protein interaction
interaction networks
proteome
multi-omics
schizophrenia
author_facet Abolfazl Doostparast Torshizi
Jubao Duan
Kai Wang
author_sort Abolfazl Doostparast Torshizi
title Cell-Type-Specific Proteogenomic Signal Diffusion for Integrating Multi-Omics Data Predicts Novel Schizophrenia Risk Genes
title_short Cell-Type-Specific Proteogenomic Signal Diffusion for Integrating Multi-Omics Data Predicts Novel Schizophrenia Risk Genes
title_full Cell-Type-Specific Proteogenomic Signal Diffusion for Integrating Multi-Omics Data Predicts Novel Schizophrenia Risk Genes
title_fullStr Cell-Type-Specific Proteogenomic Signal Diffusion for Integrating Multi-Omics Data Predicts Novel Schizophrenia Risk Genes
title_full_unstemmed Cell-Type-Specific Proteogenomic Signal Diffusion for Integrating Multi-Omics Data Predicts Novel Schizophrenia Risk Genes
title_sort cell-type-specific proteogenomic signal diffusion for integrating multi-omics data predicts novel schizophrenia risk genes
publisher Elsevier
series Patterns
issn 2666-3899
publishDate 2020-09-01
description Summary: Accumulation of diverse types of omics data on schizophrenia (SCZ) requires a systems approach to model the interplay between genome, transcriptome, and proteome. We introduce Markov affinity-based proteogenomic signal diffusion (MAPSD), a method to model intra-cellular protein trafficking paradigms and tissue-wise single-cell protein abundances. MAPSD integrates multi-omics data to amplify the signals at SCZ risk loci with small effect sizes, and reveal convergent disease-associated gene modules in the brain. We predicted a set of high-confidence SCZ risk loci followed by characterizing the subcellular localization of proteins encoded by candidate SCZ risk genes, and illustrated that most are enriched in neuronal cells in the cerebral cortex as well as Purkinje cells in the cerebellum. We demonstrated how the identified genes may be involved in neurodevelopment, how they may alter SCZ-related biological pathways, and how they facilitate drug repurposing. MAPSD is applicable in other polygenic diseases and can facilitate our understanding of disease mechanisms. The Bigger Picture: Proteins constitute the functional machinery in a cell. Genetic aberrations may cause disrupting the normal functionality of the proteins. On the other hand, biophysical and biochemical properties of proteins vary in distinct tissues mandating separate modeling of proteomic features given the tissue being studied, e.g. brain in case of schizophrenia. Using the concept of signal diffusion in graph theory, we proposed a model, termed MAPSD, which enables us to leverage proteomic properties of different tissues at single cell resolution along with genomic and epigenomic features of a disease in order to predict potential risk genes which cannot be annotated using common univariate approaches. Taking this approach helps create novel therapeutic hypotheses for precision medicine so that more effective treatments with less side effects on other organs can be developed. Application of MAPSD is not restricted to schizophrenia and most of complex diseases can benefit from the method.
topic signal diffusion
protein-protein interaction
interaction networks
proteome
multi-omics
schizophrenia
url http://www.sciencedirect.com/science/article/pii/S2666389920301197
work_keys_str_mv AT abolfazldoostparasttorshizi celltypespecificproteogenomicsignaldiffusionforintegratingmultiomicsdatapredictsnovelschizophreniariskgenes
AT jubaoduan celltypespecificproteogenomicsignaldiffusionforintegratingmultiomicsdatapredictsnovelschizophreniariskgenes
AT kaiwang celltypespecificproteogenomicsignaldiffusionforintegratingmultiomicsdatapredictsnovelschizophreniariskgenes
_version_ 1724401391007432704