Unified feature association networks through integration of transcriptomic and proteomic data.
High-throughput multi-omics studies and corresponding network analyses of multi-omic data have rapidly expanded their impact over the last 10 years. As biological features of different types (e.g. transcripts, proteins, metabolites) interact within cellular systems, the greatest amount of knowledge...
Main Authors: | , , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2019-09-01
|
Series: | PLoS Computational Biology |
Online Access: | https://doi.org/10.1371/journal.pcbi.1007241 |
id |
doaj-d85ab01185d84cdcb765feb025575f87 |
---|---|
record_format |
Article |
spelling |
doaj-d85ab01185d84cdcb765feb025575f872021-04-21T15:10:13ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582019-09-01159e100724110.1371/journal.pcbi.1007241Unified feature association networks through integration of transcriptomic and proteomic data.Ryan S McClureJason P WendlerJoshua N AdkinsJesica SwanstromRalph BaricBrooke L Deatherage KaiserKristie L OxfordKatrina M WatersJason E McDermottHigh-throughput multi-omics studies and corresponding network analyses of multi-omic data have rapidly expanded their impact over the last 10 years. As biological features of different types (e.g. transcripts, proteins, metabolites) interact within cellular systems, the greatest amount of knowledge can be gained from networks that incorporate multiple types of -omic data. However, biological and technical sources of variation diminish the ability to detect cross-type associations, yielding networks dominated by communities comprised of nodes of the same type. We describe here network building methods that can maximize edges between nodes of different data types leading to integrated networks, networks that have a large number of edges that link nodes of different-omic types (transcripts, proteins, lipids etc). We systematically rank several network inference methods and demonstrate that, in many cases, using a random forest method, GENIE3, produces the most integrated networks. This increase in integration does not come at the cost of accuracy as GENIE3 produces networks of approximately the same quality as the other network inference methods tested here. Using GENIE3, we also infer networks representing antibody-mediated Dengue virus cell invasion and receptor-mediated Dengue virus invasion. A number of functional pathways showed centrality differences between the two networks including genes responding to both GM-CSF and IL-4, which had a higher centrality value in an antibody-mediated vs. receptor-mediated Dengue network. Because a biological system involves the interplay of many different types of molecules, incorporating multiple data types into networks will improve their use as models of biological systems. The methods explored here are some of the first to specifically highlight and address the challenges associated with how such multi-omic networks can be assembled and how the greatest number of interactions can be inferred from different data types. The resulting networks can lead to the discovery of new host response patterns and interactions during viral infection, generate new hypotheses of pathogenic mechanisms and confirm mechanisms of disease.https://doi.org/10.1371/journal.pcbi.1007241 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Ryan S McClure Jason P Wendler Joshua N Adkins Jesica Swanstrom Ralph Baric Brooke L Deatherage Kaiser Kristie L Oxford Katrina M Waters Jason E McDermott |
spellingShingle |
Ryan S McClure Jason P Wendler Joshua N Adkins Jesica Swanstrom Ralph Baric Brooke L Deatherage Kaiser Kristie L Oxford Katrina M Waters Jason E McDermott Unified feature association networks through integration of transcriptomic and proteomic data. PLoS Computational Biology |
author_facet |
Ryan S McClure Jason P Wendler Joshua N Adkins Jesica Swanstrom Ralph Baric Brooke L Deatherage Kaiser Kristie L Oxford Katrina M Waters Jason E McDermott |
author_sort |
Ryan S McClure |
title |
Unified feature association networks through integration of transcriptomic and proteomic data. |
title_short |
Unified feature association networks through integration of transcriptomic and proteomic data. |
title_full |
Unified feature association networks through integration of transcriptomic and proteomic data. |
title_fullStr |
Unified feature association networks through integration of transcriptomic and proteomic data. |
title_full_unstemmed |
Unified feature association networks through integration of transcriptomic and proteomic data. |
title_sort |
unified feature association networks through integration of transcriptomic and proteomic data. |
publisher |
Public Library of Science (PLoS) |
series |
PLoS Computational Biology |
issn |
1553-734X 1553-7358 |
publishDate |
2019-09-01 |
description |
High-throughput multi-omics studies and corresponding network analyses of multi-omic data have rapidly expanded their impact over the last 10 years. As biological features of different types (e.g. transcripts, proteins, metabolites) interact within cellular systems, the greatest amount of knowledge can be gained from networks that incorporate multiple types of -omic data. However, biological and technical sources of variation diminish the ability to detect cross-type associations, yielding networks dominated by communities comprised of nodes of the same type. We describe here network building methods that can maximize edges between nodes of different data types leading to integrated networks, networks that have a large number of edges that link nodes of different-omic types (transcripts, proteins, lipids etc). We systematically rank several network inference methods and demonstrate that, in many cases, using a random forest method, GENIE3, produces the most integrated networks. This increase in integration does not come at the cost of accuracy as GENIE3 produces networks of approximately the same quality as the other network inference methods tested here. Using GENIE3, we also infer networks representing antibody-mediated Dengue virus cell invasion and receptor-mediated Dengue virus invasion. A number of functional pathways showed centrality differences between the two networks including genes responding to both GM-CSF and IL-4, which had a higher centrality value in an antibody-mediated vs. receptor-mediated Dengue network. Because a biological system involves the interplay of many different types of molecules, incorporating multiple data types into networks will improve their use as models of biological systems. The methods explored here are some of the first to specifically highlight and address the challenges associated with how such multi-omic networks can be assembled and how the greatest number of interactions can be inferred from different data types. The resulting networks can lead to the discovery of new host response patterns and interactions during viral infection, generate new hypotheses of pathogenic mechanisms and confirm mechanisms of disease. |
url |
https://doi.org/10.1371/journal.pcbi.1007241 |
work_keys_str_mv |
AT ryansmcclure unifiedfeatureassociationnetworksthroughintegrationoftranscriptomicandproteomicdata AT jasonpwendler unifiedfeatureassociationnetworksthroughintegrationoftranscriptomicandproteomicdata AT joshuanadkins unifiedfeatureassociationnetworksthroughintegrationoftranscriptomicandproteomicdata AT jesicaswanstrom unifiedfeatureassociationnetworksthroughintegrationoftranscriptomicandproteomicdata AT ralphbaric unifiedfeatureassociationnetworksthroughintegrationoftranscriptomicandproteomicdata AT brookeldeatheragekaiser unifiedfeatureassociationnetworksthroughintegrationoftranscriptomicandproteomicdata AT kristieloxford unifiedfeatureassociationnetworksthroughintegrationoftranscriptomicandproteomicdata AT katrinamwaters unifiedfeatureassociationnetworksthroughintegrationoftranscriptomicandproteomicdata AT jasonemcdermott unifiedfeatureassociationnetworksthroughintegrationoftranscriptomicandproteomicdata |
_version_ |
1714667837591650304 |