Solving the influence maximization problem reveals regulatory organization of the yeast cell cycle.

The Influence Maximization Problem (IMP) aims to discover the set of nodes with the greatest influence on network dynamics. The problem has previously been applied in epidemiology and social network analysis. Here, we demonstrate the application to cell cycle regulatory network analysis for Saccharo...

Full description

Bibliographic Details
Main Authors: David L Gibbs, Ilya Shmulevich
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2017-06-01
Series:PLoS Computational Biology
Online Access:http://europepmc.org/articles/PMC5495484?pdf=render
id doaj-05cc0e173eaf40d88c33437a22393e2b
record_format Article
spelling doaj-05cc0e173eaf40d88c33437a22393e2b2020-11-25T01:34:03ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582017-06-01136e100559110.1371/journal.pcbi.1005591Solving the influence maximization problem reveals regulatory organization of the yeast cell cycle.David L GibbsIlya ShmulevichThe Influence Maximization Problem (IMP) aims to discover the set of nodes with the greatest influence on network dynamics. The problem has previously been applied in epidemiology and social network analysis. Here, we demonstrate the application to cell cycle regulatory network analysis for Saccharomyces cerevisiae. Fundamentally, gene regulation is linked to the flow of information. Therefore, our implementation of the IMP was framed as an information theoretic problem using network diffusion. Utilizing more than 26,000 regulatory edges from YeastMine, gene expression dynamics were encoded as edge weights using time lagged transfer entropy, a method for quantifying information transfer between variables. By picking a set of source nodes, a diffusion process covers a portion of the network. The size of the network cover relates to the influence of the source nodes. The set of nodes that maximizes influence is the solution to the IMP. By solving the IMP over different numbers of source nodes, an influence ranking on genes was produced. The influence ranking was compared to other metrics of network centrality. Although the top genes from each centrality ranking contained well-known cell cycle regulators, there was little agreement and no clear winner. However, it was found that influential genes tend to directly regulate or sit upstream of genes ranked by other centrality measures. The influential nodes act as critical sources of information flow, potentially having a large impact on the state of the network. Biological events that affect influential nodes and thereby affect information flow could have a strong effect on network dynamics, potentially leading to disease. Code and data can be found at: https://github.com/gibbsdavidl/miergolf.http://europepmc.org/articles/PMC5495484?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author David L Gibbs
Ilya Shmulevich
spellingShingle David L Gibbs
Ilya Shmulevich
Solving the influence maximization problem reveals regulatory organization of the yeast cell cycle.
PLoS Computational Biology
author_facet David L Gibbs
Ilya Shmulevich
author_sort David L Gibbs
title Solving the influence maximization problem reveals regulatory organization of the yeast cell cycle.
title_short Solving the influence maximization problem reveals regulatory organization of the yeast cell cycle.
title_full Solving the influence maximization problem reveals regulatory organization of the yeast cell cycle.
title_fullStr Solving the influence maximization problem reveals regulatory organization of the yeast cell cycle.
title_full_unstemmed Solving the influence maximization problem reveals regulatory organization of the yeast cell cycle.
title_sort solving the influence maximization problem reveals regulatory organization of the yeast cell cycle.
publisher Public Library of Science (PLoS)
series PLoS Computational Biology
issn 1553-734X
1553-7358
publishDate 2017-06-01
description The Influence Maximization Problem (IMP) aims to discover the set of nodes with the greatest influence on network dynamics. The problem has previously been applied in epidemiology and social network analysis. Here, we demonstrate the application to cell cycle regulatory network analysis for Saccharomyces cerevisiae. Fundamentally, gene regulation is linked to the flow of information. Therefore, our implementation of the IMP was framed as an information theoretic problem using network diffusion. Utilizing more than 26,000 regulatory edges from YeastMine, gene expression dynamics were encoded as edge weights using time lagged transfer entropy, a method for quantifying information transfer between variables. By picking a set of source nodes, a diffusion process covers a portion of the network. The size of the network cover relates to the influence of the source nodes. The set of nodes that maximizes influence is the solution to the IMP. By solving the IMP over different numbers of source nodes, an influence ranking on genes was produced. The influence ranking was compared to other metrics of network centrality. Although the top genes from each centrality ranking contained well-known cell cycle regulators, there was little agreement and no clear winner. However, it was found that influential genes tend to directly regulate or sit upstream of genes ranked by other centrality measures. The influential nodes act as critical sources of information flow, potentially having a large impact on the state of the network. Biological events that affect influential nodes and thereby affect information flow could have a strong effect on network dynamics, potentially leading to disease. Code and data can be found at: https://github.com/gibbsdavidl/miergolf.
url http://europepmc.org/articles/PMC5495484?pdf=render
work_keys_str_mv AT davidlgibbs solvingtheinfluencemaximizationproblemrevealsregulatoryorganizationoftheyeastcellcycle
AT ilyashmulevich solvingtheinfluencemaximizationproblemrevealsregulatoryorganizationoftheyeastcellcycle
_version_ 1725073980746891264