Reference-free population genomics from next-generation transcriptome data and the vertebrate-invertebrate gap.

In animals, the population genomic literature is dominated by two taxa, namely mammals and drosophilids, in which fully sequenced, well-annotated genomes have been available for years. Data from other metazoan phyla are scarce, probably because the vast majority of living species still lack a closel...

Full description

Bibliographic Details
Main Authors: Philippe Gayral, José Melo-Ferreira, Sylvain Glémin, Nicolas Bierne, Miguel Carneiro, Benoit Nabholz, Joao M Lourenco, Paulo C Alves, Marion Ballenghien, Nicolas Faivre, Khalid Belkhir, Vincent Cahais, Etienne Loire, Aurélien Bernard, Nicolas Galtier
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2013-04-01
Series:PLoS Genetics
Online Access:http://europepmc.org/articles/PMC3623758?pdf=render
id doaj-e8d1932589f54ffa8de430a8edb39e09
record_format Article
spelling doaj-e8d1932589f54ffa8de430a8edb39e092020-11-25T01:16:11ZengPublic Library of Science (PLoS)PLoS Genetics1553-73901553-74042013-04-0194e100345710.1371/journal.pgen.1003457Reference-free population genomics from next-generation transcriptome data and the vertebrate-invertebrate gap.Philippe GayralJosé Melo-FerreiraSylvain GléminNicolas BierneMiguel CarneiroBenoit NabholzJoao M LourencoPaulo C AlvesMarion BallenghienNicolas FaivreKhalid BelkhirVincent CahaisEtienne LoireAurélien BernardNicolas GaltierIn animals, the population genomic literature is dominated by two taxa, namely mammals and drosophilids, in which fully sequenced, well-annotated genomes have been available for years. Data from other metazoan phyla are scarce, probably because the vast majority of living species still lack a closely related reference genome. Here we achieve de novo, reference-free population genomic analysis from wild samples in five non-model animal species, based on next-generation sequencing transcriptome data. We introduce a pipe-line for cDNA assembly, read mapping, SNP/genotype calling, and data cleaning, with specific focus on the issue of hidden paralogy detection. In two species for which a reference genome is available, similar results were obtained whether the reference was used or not, demonstrating the robustness of our de novo inferences. The population genomic profile of a hare, a turtle, an oyster, a tunicate, and a termite were found to be intermediate between those of human and Drosophila, indicating that the discordant genomic diversity patterns that have been reported between these two species do not reflect a generalized vertebrate versus invertebrate gap. The genomic average diversity was generally higher in invertebrates than in vertebrates (with the notable exception of termite), in agreement with the notion that population size tends to be larger in the former than in the latter. The non-synonymous to synonymous ratio, however, did not differ significantly between vertebrates and invertebrates, even though it was negatively correlated with genetic diversity within each of the two groups. This study opens promising perspective regarding genome-wide population analyses of non-model organisms and the influence of population size on non-synonymous versus synonymous diversity.http://europepmc.org/articles/PMC3623758?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Philippe Gayral
José Melo-Ferreira
Sylvain Glémin
Nicolas Bierne
Miguel Carneiro
Benoit Nabholz
Joao M Lourenco
Paulo C Alves
Marion Ballenghien
Nicolas Faivre
Khalid Belkhir
Vincent Cahais
Etienne Loire
Aurélien Bernard
Nicolas Galtier
spellingShingle Philippe Gayral
José Melo-Ferreira
Sylvain Glémin
Nicolas Bierne
Miguel Carneiro
Benoit Nabholz
Joao M Lourenco
Paulo C Alves
Marion Ballenghien
Nicolas Faivre
Khalid Belkhir
Vincent Cahais
Etienne Loire
Aurélien Bernard
Nicolas Galtier
Reference-free population genomics from next-generation transcriptome data and the vertebrate-invertebrate gap.
PLoS Genetics
author_facet Philippe Gayral
José Melo-Ferreira
Sylvain Glémin
Nicolas Bierne
Miguel Carneiro
Benoit Nabholz
Joao M Lourenco
Paulo C Alves
Marion Ballenghien
Nicolas Faivre
Khalid Belkhir
Vincent Cahais
Etienne Loire
Aurélien Bernard
Nicolas Galtier
author_sort Philippe Gayral
title Reference-free population genomics from next-generation transcriptome data and the vertebrate-invertebrate gap.
title_short Reference-free population genomics from next-generation transcriptome data and the vertebrate-invertebrate gap.
title_full Reference-free population genomics from next-generation transcriptome data and the vertebrate-invertebrate gap.
title_fullStr Reference-free population genomics from next-generation transcriptome data and the vertebrate-invertebrate gap.
title_full_unstemmed Reference-free population genomics from next-generation transcriptome data and the vertebrate-invertebrate gap.
title_sort reference-free population genomics from next-generation transcriptome data and the vertebrate-invertebrate gap.
publisher Public Library of Science (PLoS)
series PLoS Genetics
issn 1553-7390
1553-7404
publishDate 2013-04-01
description In animals, the population genomic literature is dominated by two taxa, namely mammals and drosophilids, in which fully sequenced, well-annotated genomes have been available for years. Data from other metazoan phyla are scarce, probably because the vast majority of living species still lack a closely related reference genome. Here we achieve de novo, reference-free population genomic analysis from wild samples in five non-model animal species, based on next-generation sequencing transcriptome data. We introduce a pipe-line for cDNA assembly, read mapping, SNP/genotype calling, and data cleaning, with specific focus on the issue of hidden paralogy detection. In two species for which a reference genome is available, similar results were obtained whether the reference was used or not, demonstrating the robustness of our de novo inferences. The population genomic profile of a hare, a turtle, an oyster, a tunicate, and a termite were found to be intermediate between those of human and Drosophila, indicating that the discordant genomic diversity patterns that have been reported between these two species do not reflect a generalized vertebrate versus invertebrate gap. The genomic average diversity was generally higher in invertebrates than in vertebrates (with the notable exception of termite), in agreement with the notion that population size tends to be larger in the former than in the latter. The non-synonymous to synonymous ratio, however, did not differ significantly between vertebrates and invertebrates, even though it was negatively correlated with genetic diversity within each of the two groups. This study opens promising perspective regarding genome-wide population analyses of non-model organisms and the influence of population size on non-synonymous versus synonymous diversity.
url http://europepmc.org/articles/PMC3623758?pdf=render
work_keys_str_mv AT philippegayral referencefreepopulationgenomicsfromnextgenerationtranscriptomedataandthevertebrateinvertebrategap
AT josemeloferreira referencefreepopulationgenomicsfromnextgenerationtranscriptomedataandthevertebrateinvertebrategap
AT sylvainglemin referencefreepopulationgenomicsfromnextgenerationtranscriptomedataandthevertebrateinvertebrategap
AT nicolasbierne referencefreepopulationgenomicsfromnextgenerationtranscriptomedataandthevertebrateinvertebrategap
AT miguelcarneiro referencefreepopulationgenomicsfromnextgenerationtranscriptomedataandthevertebrateinvertebrategap
AT benoitnabholz referencefreepopulationgenomicsfromnextgenerationtranscriptomedataandthevertebrateinvertebrategap
AT joaomlourenco referencefreepopulationgenomicsfromnextgenerationtranscriptomedataandthevertebrateinvertebrategap
AT paulocalves referencefreepopulationgenomicsfromnextgenerationtranscriptomedataandthevertebrateinvertebrategap
AT marionballenghien referencefreepopulationgenomicsfromnextgenerationtranscriptomedataandthevertebrateinvertebrategap
AT nicolasfaivre referencefreepopulationgenomicsfromnextgenerationtranscriptomedataandthevertebrateinvertebrategap
AT khalidbelkhir referencefreepopulationgenomicsfromnextgenerationtranscriptomedataandthevertebrateinvertebrategap
AT vincentcahais referencefreepopulationgenomicsfromnextgenerationtranscriptomedataandthevertebrateinvertebrategap
AT etienneloire referencefreepopulationgenomicsfromnextgenerationtranscriptomedataandthevertebrateinvertebrategap
AT aurelienbernard referencefreepopulationgenomicsfromnextgenerationtranscriptomedataandthevertebrateinvertebrategap
AT nicolasgaltier referencefreepopulationgenomicsfromnextgenerationtranscriptomedataandthevertebrateinvertebrategap
_version_ 1725150821585256448