SalMotifDB: a tool for analyzing putative transcription factor binding sites in salmonid genomes

Abstract Background Recently developed genome resources in Salmonid fish provides tools for studying the genomics underlying a wide range of properties including life history trait variation in the wild, economically important traits in aquaculture and the evolutionary consequences of whole genome d...

Full description

Bibliographic Details
Main Authors: Teshome Dagne Mulugeta, Torfinn Nome, Thu-Hien To, Manu Kumar Gundappa, Daniel J. Macqueen, Dag Inge Våge, Simen Rød Sandve, Torgeir R. Hvidsten
Format: Article
Language:English
Published: BMC 2019-09-01
Series:BMC Genomics
Subjects:
Online Access:http://link.springer.com/article/10.1186/s12864-019-6051-0
id doaj-28d34678ab7444a3b6aa715aa83a66f1
record_format Article
spelling doaj-28d34678ab7444a3b6aa715aa83a66f12020-11-25T02:31:44ZengBMCBMC Genomics1471-21642019-09-012011810.1186/s12864-019-6051-0SalMotifDB: a tool for analyzing putative transcription factor binding sites in salmonid genomesTeshome Dagne Mulugeta0Torfinn Nome1Thu-Hien To2Manu Kumar Gundappa3Daniel J. Macqueen4Dag Inge Våge5Simen Rød Sandve6Torgeir R. Hvidsten7Centre for Integrative Genetics (CIGENE), Department of Animal and Aquacultural Sciences, Faculty of Biosciences, Norwegian University of Life SciencesCentre for Integrative Genetics (CIGENE), Department of Animal and Aquacultural Sciences, Faculty of Biosciences, Norwegian University of Life SciencesCentre for Integrative Genetics (CIGENE), Department of Animal and Aquacultural Sciences, Faculty of Biosciences, Norwegian University of Life SciencesThe Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of EdinburghThe Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of EdinburghCentre for Integrative Genetics (CIGENE), Department of Animal and Aquacultural Sciences, Faculty of Biosciences, Norwegian University of Life SciencesCentre for Integrative Genetics (CIGENE), Department of Animal and Aquacultural Sciences, Faculty of Biosciences, Norwegian University of Life SciencesFaculty of Chemistry, Biotechnology and Food Science, Norwegian University of Life SciencesAbstract Background Recently developed genome resources in Salmonid fish provides tools for studying the genomics underlying a wide range of properties including life history trait variation in the wild, economically important traits in aquaculture and the evolutionary consequences of whole genome duplications. Although genome assemblies now exist for a number of salmonid species, the lack of regulatory annotations are holding back our mechanistic understanding of how genetic variation in non-coding regulatory regions affect gene expression and the downstream phenotypic effects. Results We present SalMotifDB, a database and associated web and R interface for the analysis of transcription factors (TFs) and their cis-regulatory binding sites in five salmonid genomes. SalMotifDB integrates TF-binding site information for 3072 non-redundant DNA patterns (motifs) assembled from a large number of metazoan motif databases. Through motif matching and TF prediction, we have used these multi-species databases to construct putative regulatory networks in salmonid species. The utility of SalMotifDB is demonstrated by showing that key lipid metabolism regulators are predicted to regulate a set of genes affected by different lipid and fatty acid content in the feed, and by showing that our motif database explains a significant proportion of gene expression divergence in gene duplicates originating from the salmonid specific whole genome duplication. Conclusions SalMotifDB is an effective tool for analyzing transcription factors, their binding sites and the resulting gene regulatory networks in salmonid species, and will be an important tool for gaining a better mechanistic understanding of gene regulation and the associated phenotypes in salmonids. SalMotifDB is available at https://salmobase.org/apps/SalMotifDB.http://link.springer.com/article/10.1186/s12864-019-6051-0Transcription factor binding sitesRegulatory networksSalmonid genomicsGene regulationWeb tool
collection DOAJ
language English
format Article
sources DOAJ
author Teshome Dagne Mulugeta
Torfinn Nome
Thu-Hien To
Manu Kumar Gundappa
Daniel J. Macqueen
Dag Inge Våge
Simen Rød Sandve
Torgeir R. Hvidsten
spellingShingle Teshome Dagne Mulugeta
Torfinn Nome
Thu-Hien To
Manu Kumar Gundappa
Daniel J. Macqueen
Dag Inge Våge
Simen Rød Sandve
Torgeir R. Hvidsten
SalMotifDB: a tool for analyzing putative transcription factor binding sites in salmonid genomes
BMC Genomics
Transcription factor binding sites
Regulatory networks
Salmonid genomics
Gene regulation
Web tool
author_facet Teshome Dagne Mulugeta
Torfinn Nome
Thu-Hien To
Manu Kumar Gundappa
Daniel J. Macqueen
Dag Inge Våge
Simen Rød Sandve
Torgeir R. Hvidsten
author_sort Teshome Dagne Mulugeta
title SalMotifDB: a tool for analyzing putative transcription factor binding sites in salmonid genomes
title_short SalMotifDB: a tool for analyzing putative transcription factor binding sites in salmonid genomes
title_full SalMotifDB: a tool for analyzing putative transcription factor binding sites in salmonid genomes
title_fullStr SalMotifDB: a tool for analyzing putative transcription factor binding sites in salmonid genomes
title_full_unstemmed SalMotifDB: a tool for analyzing putative transcription factor binding sites in salmonid genomes
title_sort salmotifdb: a tool for analyzing putative transcription factor binding sites in salmonid genomes
publisher BMC
series BMC Genomics
issn 1471-2164
publishDate 2019-09-01
description Abstract Background Recently developed genome resources in Salmonid fish provides tools for studying the genomics underlying a wide range of properties including life history trait variation in the wild, economically important traits in aquaculture and the evolutionary consequences of whole genome duplications. Although genome assemblies now exist for a number of salmonid species, the lack of regulatory annotations are holding back our mechanistic understanding of how genetic variation in non-coding regulatory regions affect gene expression and the downstream phenotypic effects. Results We present SalMotifDB, a database and associated web and R interface for the analysis of transcription factors (TFs) and their cis-regulatory binding sites in five salmonid genomes. SalMotifDB integrates TF-binding site information for 3072 non-redundant DNA patterns (motifs) assembled from a large number of metazoan motif databases. Through motif matching and TF prediction, we have used these multi-species databases to construct putative regulatory networks in salmonid species. The utility of SalMotifDB is demonstrated by showing that key lipid metabolism regulators are predicted to regulate a set of genes affected by different lipid and fatty acid content in the feed, and by showing that our motif database explains a significant proportion of gene expression divergence in gene duplicates originating from the salmonid specific whole genome duplication. Conclusions SalMotifDB is an effective tool for analyzing transcription factors, their binding sites and the resulting gene regulatory networks in salmonid species, and will be an important tool for gaining a better mechanistic understanding of gene regulation and the associated phenotypes in salmonids. SalMotifDB is available at https://salmobase.org/apps/SalMotifDB.
topic Transcription factor binding sites
Regulatory networks
Salmonid genomics
Gene regulation
Web tool
url http://link.springer.com/article/10.1186/s12864-019-6051-0
work_keys_str_mv AT teshomedagnemulugeta salmotifdbatoolforanalyzingputativetranscriptionfactorbindingsitesinsalmonidgenomes
AT torfinnnome salmotifdbatoolforanalyzingputativetranscriptionfactorbindingsitesinsalmonidgenomes
AT thuhiento salmotifdbatoolforanalyzingputativetranscriptionfactorbindingsitesinsalmonidgenomes
AT manukumargundappa salmotifdbatoolforanalyzingputativetranscriptionfactorbindingsitesinsalmonidgenomes
AT danieljmacqueen salmotifdbatoolforanalyzingputativetranscriptionfactorbindingsitesinsalmonidgenomes
AT dagingevage salmotifdbatoolforanalyzingputativetranscriptionfactorbindingsitesinsalmonidgenomes
AT simenrødsandve salmotifdbatoolforanalyzingputativetranscriptionfactorbindingsitesinsalmonidgenomes
AT torgeirrhvidsten salmotifdbatoolforanalyzingputativetranscriptionfactorbindingsitesinsalmonidgenomes
_version_ 1724822345611214848