Multi-study inference of regulatory networks for more accurate models of gene regulation.

Gene regulatory networks are composed of sub-networks that are often shared across biological processes, cell-types, and organisms. Leveraging multiple sources of information, such as publicly available gene expression datasets, could therefore be helpful when learning a network of interest. Integra...

Full description

Bibliographic Details
Main Authors:	Dayanne M Castro, Nicholas R de Veaux, Emily R Miraldi, Richard Bonneau
Format:	Article
Language:	English
Published:	Public Library of Science (PLoS) 2019-01-01
Series:	PLoS Computational Biology
Online Access:	https://doi.org/10.1371/journal.pcbi.1006591

id	doaj-8601b05fb3514647844886f1952d35e8
record_format	Article
spelling	doaj-8601b05fb3514647844886f1952d35e82021-04-21T15:11:59ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582019-01-01151e100659110.1371/journal.pcbi.1006591Multi-study inference of regulatory networks for more accurate models of gene regulation.Dayanne M CastroNicholas R de VeauxEmily R MiraldiRichard BonneauGene regulatory networks are composed of sub-networks that are often shared across biological processes, cell-types, and organisms. Leveraging multiple sources of information, such as publicly available gene expression datasets, could therefore be helpful when learning a network of interest. Integrating data across different studies, however, raises numerous technical concerns. Hence, a common approach in network inference, and broadly in genomics research, is to separately learn models from each dataset and combine the results. Individual models, however, often suffer from under-sampling, poor generalization and limited network recovery. In this study, we explore previous integration strategies, such as batch-correction and model ensembles, and introduce a new multitask learning approach for joint network inference across several datasets. Our method initially estimates the activities of transcription factors, and subsequently, infers the relevant network topology. As regulatory interactions are context-dependent, we estimate model coefficients as a combination of both dataset-specific and conserved components. In addition, adaptive penalties may be used to favor models that include interactions derived from multiple sources of prior knowledge including orthogonal genomics experiments. We evaluate generalization and network recovery using examples from Bacillus subtilis and Saccharomyces cerevisiae, and show that sharing information across models improves network reconstruction. Finally, we demonstrate robustness to both false positives in the prior information and heterogeneity among datasets.https://doi.org/10.1371/journal.pcbi.1006591
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Dayanne M Castro Nicholas R de Veaux Emily R Miraldi Richard Bonneau
spellingShingle	Dayanne M Castro Nicholas R de Veaux Emily R Miraldi Richard Bonneau Multi-study inference of regulatory networks for more accurate models of gene regulation. PLoS Computational Biology
author_facet	Dayanne M Castro Nicholas R de Veaux Emily R Miraldi Richard Bonneau
author_sort	Dayanne M Castro
title	Multi-study inference of regulatory networks for more accurate models of gene regulation.
title_short	Multi-study inference of regulatory networks for more accurate models of gene regulation.
title_full	Multi-study inference of regulatory networks for more accurate models of gene regulation.
title_fullStr	Multi-study inference of regulatory networks for more accurate models of gene regulation.
title_full_unstemmed	Multi-study inference of regulatory networks for more accurate models of gene regulation.
title_sort	multi-study inference of regulatory networks for more accurate models of gene regulation.
publisher	Public Library of Science (PLoS)
series	PLoS Computational Biology
issn	1553-734X 1553-7358
publishDate	2019-01-01
description	Gene regulatory networks are composed of sub-networks that are often shared across biological processes, cell-types, and organisms. Leveraging multiple sources of information, such as publicly available gene expression datasets, could therefore be helpful when learning a network of interest. Integrating data across different studies, however, raises numerous technical concerns. Hence, a common approach in network inference, and broadly in genomics research, is to separately learn models from each dataset and combine the results. Individual models, however, often suffer from under-sampling, poor generalization and limited network recovery. In this study, we explore previous integration strategies, such as batch-correction and model ensembles, and introduce a new multitask learning approach for joint network inference across several datasets. Our method initially estimates the activities of transcription factors, and subsequently, infers the relevant network topology. As regulatory interactions are context-dependent, we estimate model coefficients as a combination of both dataset-specific and conserved components. In addition, adaptive penalties may be used to favor models that include interactions derived from multiple sources of prior knowledge including orthogonal genomics experiments. We evaluate generalization and network recovery using examples from Bacillus subtilis and Saccharomyces cerevisiae, and show that sharing information across models improves network reconstruction. Finally, we demonstrate robustness to both false positives in the prior information and heterogeneity among datasets.
url	https://doi.org/10.1371/journal.pcbi.1006591
work_keys_str_mv	AT dayannemcastro multistudyinferenceofregulatorynetworksformoreaccuratemodelsofgeneregulation AT nicholasrdeveaux multistudyinferenceofregulatorynetworksformoreaccuratemodelsofgeneregulation AT emilyrmiraldi multistudyinferenceofregulatorynetworksformoreaccuratemodelsofgeneregulation AT richardbonneau multistudyinferenceofregulatorynetworksformoreaccuratemodelsofgeneregulation
_version_	1714667796311310336

Multi-study inference of regulatory networks for more accurate models of gene regulation.

Similar Items