Multi-task learning for the simultaneous reconstruction of the human and mouse gene regulatory networks

Abstract The reconstruction of Gene Regulatory Networks (GRNs) from gene expression data, supported by machine learning approaches, has received increasing attention in recent years. The task at hand is to identify regulatory links between genes in a network. However, existing methods often suffer w...

Full description

Bibliographic Details
Main Authors: Paolo Mignone, Gianvito Pio, Sašo Džeroski, Michelangelo Ceci
Format: Article
Language:English
Published: Nature Publishing Group 2020-12-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-020-78033-7
id doaj-3fa04fcf691b4bbcbf6dcf37d380ecce
record_format Article
spelling doaj-3fa04fcf691b4bbcbf6dcf37d380ecce2020-12-20T12:32:08ZengNature Publishing GroupScientific Reports2045-23222020-12-0110111510.1038/s41598-020-78033-7Multi-task learning for the simultaneous reconstruction of the human and mouse gene regulatory networksPaolo Mignone0Gianvito Pio1Sašo Džeroski2Michelangelo Ceci3Department of Computer Science, University of Bari Aldo MoroDepartment of Computer Science, University of Bari Aldo MoroDepartment of Knowledge Technologies, Jožef Stefan InstituteDepartment of Computer Science, University of Bari Aldo MoroAbstract The reconstruction of Gene Regulatory Networks (GRNs) from gene expression data, supported by machine learning approaches, has received increasing attention in recent years. The task at hand is to identify regulatory links between genes in a network. However, existing methods often suffer when the number of labeled examples is low or when no negative examples are available. In this paper we propose a multi-task method that is able to simultaneously reconstruct the human and the mouse GRNs using the similarities between the two. This is done by exploiting, in a transfer learning approach, possible dependencies that may exist among them. Simultaneously, we solve the issues arising from the limited availability of examples of links by relying on a novel clustering-based approach, able to estimate the degree of certainty of unlabeled examples of links, so that they can be exploited during the training together with the labeled examples. Our experiments show that the proposed method can reconstruct both the human and the mouse GRNs more effectively compared to reconstructing each network separately. Moreover, it significantly outperforms three state-of-the-art transfer learning approaches that, analogously to our method, can exploit the knowledge coming from both organisms. Finally, a specific robustness analysis reveals that, even when the number of labeled examples is very low with respect to the number of unlabeled examples, the proposed method is almost always able to outperform its single-task counterpart.https://doi.org/10.1038/s41598-020-78033-7
collection DOAJ
language English
format Article
sources DOAJ
author Paolo Mignone
Gianvito Pio
Sašo Džeroski
Michelangelo Ceci
spellingShingle Paolo Mignone
Gianvito Pio
Sašo Džeroski
Michelangelo Ceci
Multi-task learning for the simultaneous reconstruction of the human and mouse gene regulatory networks
Scientific Reports
author_facet Paolo Mignone
Gianvito Pio
Sašo Džeroski
Michelangelo Ceci
author_sort Paolo Mignone
title Multi-task learning for the simultaneous reconstruction of the human and mouse gene regulatory networks
title_short Multi-task learning for the simultaneous reconstruction of the human and mouse gene regulatory networks
title_full Multi-task learning for the simultaneous reconstruction of the human and mouse gene regulatory networks
title_fullStr Multi-task learning for the simultaneous reconstruction of the human and mouse gene regulatory networks
title_full_unstemmed Multi-task learning for the simultaneous reconstruction of the human and mouse gene regulatory networks
title_sort multi-task learning for the simultaneous reconstruction of the human and mouse gene regulatory networks
publisher Nature Publishing Group
series Scientific Reports
issn 2045-2322
publishDate 2020-12-01
description Abstract The reconstruction of Gene Regulatory Networks (GRNs) from gene expression data, supported by machine learning approaches, has received increasing attention in recent years. The task at hand is to identify regulatory links between genes in a network. However, existing methods often suffer when the number of labeled examples is low or when no negative examples are available. In this paper we propose a multi-task method that is able to simultaneously reconstruct the human and the mouse GRNs using the similarities between the two. This is done by exploiting, in a transfer learning approach, possible dependencies that may exist among them. Simultaneously, we solve the issues arising from the limited availability of examples of links by relying on a novel clustering-based approach, able to estimate the degree of certainty of unlabeled examples of links, so that they can be exploited during the training together with the labeled examples. Our experiments show that the proposed method can reconstruct both the human and the mouse GRNs more effectively compared to reconstructing each network separately. Moreover, it significantly outperforms three state-of-the-art transfer learning approaches that, analogously to our method, can exploit the knowledge coming from both organisms. Finally, a specific robustness analysis reveals that, even when the number of labeled examples is very low with respect to the number of unlabeled examples, the proposed method is almost always able to outperform its single-task counterpart.
url https://doi.org/10.1038/s41598-020-78033-7
work_keys_str_mv AT paolomignone multitasklearningforthesimultaneousreconstructionofthehumanandmousegeneregulatorynetworks
AT gianvitopio multitasklearningforthesimultaneousreconstructionofthehumanandmousegeneregulatorynetworks
AT sasodzeroski multitasklearningforthesimultaneousreconstructionofthehumanandmousegeneregulatorynetworks
AT michelangeloceci multitasklearningforthesimultaneousreconstructionofthehumanandmousegeneregulatorynetworks
_version_ 1724376471310434304