Developing a network view of type 2 diabetes risk pathways through integration of genetic, genomic and functional data

Abstract Background Genome-wide association studies (GWAS) have identified several hundred susceptibility loci for type 2 diabetes (T2D). One critical, but unresolved, issue concerns the extent to which the mechanisms through which these diverse signals influencing T2D predisposition converge on a l...

Full description

Bibliographic Details
Main Authors: Juan Fernández-Tajes, Kyle J. Gaulton, Martijn van de Bunt, Jason Torres, Matthias Thurner, Anubha Mahajan, Anna L. Gloyn, Kasper Lage, Mark I. McCarthy
Format: Article
Language:English
Published: BMC 2019-03-01
Series:Genome Medicine
Online Access:http://link.springer.com/article/10.1186/s13073-019-0628-8
id doaj-6be6fea4470f48a6bbf9cc44fdf798cf
record_format Article
spelling doaj-6be6fea4470f48a6bbf9cc44fdf798cf2020-11-25T02:40:35ZengBMCGenome Medicine1756-994X2019-03-0111111410.1186/s13073-019-0628-8Developing a network view of type 2 diabetes risk pathways through integration of genetic, genomic and functional dataJuan Fernández-Tajes0Kyle J. Gaulton1Martijn van de Bunt2Jason Torres3Matthias Thurner4Anubha Mahajan5Anna L. Gloyn6Kasper Lage7Mark I. McCarthy8Wellcome Centre for Human Genetics, University of OxfordDepartment of Pediatrics, University of CaliforniaWellcome Centre for Human Genetics, University of OxfordWellcome Centre for Human Genetics, University of OxfordWellcome Centre for Human Genetics, University of OxfordWellcome Centre for Human Genetics, University of OxfordWellcome Centre for Human Genetics, University of OxfordDepartment of Surgery, Massachusetts, General HospitalWellcome Centre for Human Genetics, University of OxfordAbstract Background Genome-wide association studies (GWAS) have identified several hundred susceptibility loci for type 2 diabetes (T2D). One critical, but unresolved, issue concerns the extent to which the mechanisms through which these diverse signals influencing T2D predisposition converge on a limited set of biological processes. However, the causal variants identified by GWAS mostly fall into a non-coding sequence, complicating the task of defining the effector transcripts through which they operate. Methods Here, we describe implementation of an analytical pipeline to address this question. First, we integrate multiple sources of genetic, genomic and biological data to assign positional candidacy scores to the genes that map to T2D GWAS signals. Second, we introduce genes with high scores as seeds within a network optimization algorithm (the asymmetric prize-collecting Steiner tree approach) which uses external, experimentally confirmed protein-protein interaction (PPI) data to generate high-confidence sub-networks. Third, we use GWAS data to test the T2D association enrichment of the “non-seed” proteins introduced into the network, as a measure of the overall functional connectivity of the network. Results We find (a) non-seed proteins in the T2D protein-interaction network so generated (comprising 705 nodes) are enriched for association to T2D (p = 0.0014) but not control traits, (b) stronger T2D-enrichment for islets than other tissues when we use RNA expression data to generate tissue-specific PPI networks and (c) enhanced enrichment (p = 3.9 × 10− 5) when we combine the analysis of the islet-specific PPI network with a focus on the subset of T2D GWAS loci which act through defective insulin secretion. Conclusions These analyses reveal a pattern of non-random functional connectivity between candidate causal genes at T2D GWAS loci and highlight the products of genes including YWHAG, SMAD4 or CDK2 as potential contributors to T2D-relevant islet dysfunction. The approach we describe can be applied to other complex genetic and genomic datasets, facilitating integration of diverse data types into disease-associated networks.http://link.springer.com/article/10.1186/s13073-019-0628-8
collection DOAJ
language English
format Article
sources DOAJ
author Juan Fernández-Tajes
Kyle J. Gaulton
Martijn van de Bunt
Jason Torres
Matthias Thurner
Anubha Mahajan
Anna L. Gloyn
Kasper Lage
Mark I. McCarthy
spellingShingle Juan Fernández-Tajes
Kyle J. Gaulton
Martijn van de Bunt
Jason Torres
Matthias Thurner
Anubha Mahajan
Anna L. Gloyn
Kasper Lage
Mark I. McCarthy
Developing a network view of type 2 diabetes risk pathways through integration of genetic, genomic and functional data
Genome Medicine
author_facet Juan Fernández-Tajes
Kyle J. Gaulton
Martijn van de Bunt
Jason Torres
Matthias Thurner
Anubha Mahajan
Anna L. Gloyn
Kasper Lage
Mark I. McCarthy
author_sort Juan Fernández-Tajes
title Developing a network view of type 2 diabetes risk pathways through integration of genetic, genomic and functional data
title_short Developing a network view of type 2 diabetes risk pathways through integration of genetic, genomic and functional data
title_full Developing a network view of type 2 diabetes risk pathways through integration of genetic, genomic and functional data
title_fullStr Developing a network view of type 2 diabetes risk pathways through integration of genetic, genomic and functional data
title_full_unstemmed Developing a network view of type 2 diabetes risk pathways through integration of genetic, genomic and functional data
title_sort developing a network view of type 2 diabetes risk pathways through integration of genetic, genomic and functional data
publisher BMC
series Genome Medicine
issn 1756-994X
publishDate 2019-03-01
description Abstract Background Genome-wide association studies (GWAS) have identified several hundred susceptibility loci for type 2 diabetes (T2D). One critical, but unresolved, issue concerns the extent to which the mechanisms through which these diverse signals influencing T2D predisposition converge on a limited set of biological processes. However, the causal variants identified by GWAS mostly fall into a non-coding sequence, complicating the task of defining the effector transcripts through which they operate. Methods Here, we describe implementation of an analytical pipeline to address this question. First, we integrate multiple sources of genetic, genomic and biological data to assign positional candidacy scores to the genes that map to T2D GWAS signals. Second, we introduce genes with high scores as seeds within a network optimization algorithm (the asymmetric prize-collecting Steiner tree approach) which uses external, experimentally confirmed protein-protein interaction (PPI) data to generate high-confidence sub-networks. Third, we use GWAS data to test the T2D association enrichment of the “non-seed” proteins introduced into the network, as a measure of the overall functional connectivity of the network. Results We find (a) non-seed proteins in the T2D protein-interaction network so generated (comprising 705 nodes) are enriched for association to T2D (p = 0.0014) but not control traits, (b) stronger T2D-enrichment for islets than other tissues when we use RNA expression data to generate tissue-specific PPI networks and (c) enhanced enrichment (p = 3.9 × 10− 5) when we combine the analysis of the islet-specific PPI network with a focus on the subset of T2D GWAS loci which act through defective insulin secretion. Conclusions These analyses reveal a pattern of non-random functional connectivity between candidate causal genes at T2D GWAS loci and highlight the products of genes including YWHAG, SMAD4 or CDK2 as potential contributors to T2D-relevant islet dysfunction. The approach we describe can be applied to other complex genetic and genomic datasets, facilitating integration of diverse data types into disease-associated networks.
url http://link.springer.com/article/10.1186/s13073-019-0628-8
work_keys_str_mv AT juanfernandeztajes developinganetworkviewoftype2diabetesriskpathwaysthroughintegrationofgeneticgenomicandfunctionaldata
AT kylejgaulton developinganetworkviewoftype2diabetesriskpathwaysthroughintegrationofgeneticgenomicandfunctionaldata
AT martijnvandebunt developinganetworkviewoftype2diabetesriskpathwaysthroughintegrationofgeneticgenomicandfunctionaldata
AT jasontorres developinganetworkviewoftype2diabetesriskpathwaysthroughintegrationofgeneticgenomicandfunctionaldata
AT matthiasthurner developinganetworkviewoftype2diabetesriskpathwaysthroughintegrationofgeneticgenomicandfunctionaldata
AT anubhamahajan developinganetworkviewoftype2diabetesriskpathwaysthroughintegrationofgeneticgenomicandfunctionaldata
AT annalgloyn developinganetworkviewoftype2diabetesriskpathwaysthroughintegrationofgeneticgenomicandfunctionaldata
AT kasperlage developinganetworkviewoftype2diabetesriskpathwaysthroughintegrationofgeneticgenomicandfunctionaldata
AT markimccarthy developinganetworkviewoftype2diabetesriskpathwaysthroughintegrationofgeneticgenomicandfunctionaldata
_version_ 1724780659695681536