Reference-free population genomics from next-generation transcriptome data and the vertebrate-invertebrate gap.
In animals, the population genomic literature is dominated by two taxa, namely mammals and drosophilids, in which fully sequenced, well-annotated genomes have been available for years. Data from other metazoan phyla are scarce, probably because the vast majority of living species still lack a closel...
Main Authors: | , , , , , , , , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2013-04-01
|
Series: | PLoS Genetics |
Online Access: | http://europepmc.org/articles/PMC3623758?pdf=render |
id |
doaj-e8d1932589f54ffa8de430a8edb39e09 |
---|---|
record_format |
Article |
spelling |
doaj-e8d1932589f54ffa8de430a8edb39e092020-11-25T01:16:11ZengPublic Library of Science (PLoS)PLoS Genetics1553-73901553-74042013-04-0194e100345710.1371/journal.pgen.1003457Reference-free population genomics from next-generation transcriptome data and the vertebrate-invertebrate gap.Philippe GayralJosé Melo-FerreiraSylvain GléminNicolas BierneMiguel CarneiroBenoit NabholzJoao M LourencoPaulo C AlvesMarion BallenghienNicolas FaivreKhalid BelkhirVincent CahaisEtienne LoireAurélien BernardNicolas GaltierIn animals, the population genomic literature is dominated by two taxa, namely mammals and drosophilids, in which fully sequenced, well-annotated genomes have been available for years. Data from other metazoan phyla are scarce, probably because the vast majority of living species still lack a closely related reference genome. Here we achieve de novo, reference-free population genomic analysis from wild samples in five non-model animal species, based on next-generation sequencing transcriptome data. We introduce a pipe-line for cDNA assembly, read mapping, SNP/genotype calling, and data cleaning, with specific focus on the issue of hidden paralogy detection. In two species for which a reference genome is available, similar results were obtained whether the reference was used or not, demonstrating the robustness of our de novo inferences. The population genomic profile of a hare, a turtle, an oyster, a tunicate, and a termite were found to be intermediate between those of human and Drosophila, indicating that the discordant genomic diversity patterns that have been reported between these two species do not reflect a generalized vertebrate versus invertebrate gap. The genomic average diversity was generally higher in invertebrates than in vertebrates (with the notable exception of termite), in agreement with the notion that population size tends to be larger in the former than in the latter. The non-synonymous to synonymous ratio, however, did not differ significantly between vertebrates and invertebrates, even though it was negatively correlated with genetic diversity within each of the two groups. This study opens promising perspective regarding genome-wide population analyses of non-model organisms and the influence of population size on non-synonymous versus synonymous diversity.http://europepmc.org/articles/PMC3623758?pdf=render |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Philippe Gayral José Melo-Ferreira Sylvain Glémin Nicolas Bierne Miguel Carneiro Benoit Nabholz Joao M Lourenco Paulo C Alves Marion Ballenghien Nicolas Faivre Khalid Belkhir Vincent Cahais Etienne Loire Aurélien Bernard Nicolas Galtier |
spellingShingle |
Philippe Gayral José Melo-Ferreira Sylvain Glémin Nicolas Bierne Miguel Carneiro Benoit Nabholz Joao M Lourenco Paulo C Alves Marion Ballenghien Nicolas Faivre Khalid Belkhir Vincent Cahais Etienne Loire Aurélien Bernard Nicolas Galtier Reference-free population genomics from next-generation transcriptome data and the vertebrate-invertebrate gap. PLoS Genetics |
author_facet |
Philippe Gayral José Melo-Ferreira Sylvain Glémin Nicolas Bierne Miguel Carneiro Benoit Nabholz Joao M Lourenco Paulo C Alves Marion Ballenghien Nicolas Faivre Khalid Belkhir Vincent Cahais Etienne Loire Aurélien Bernard Nicolas Galtier |
author_sort |
Philippe Gayral |
title |
Reference-free population genomics from next-generation transcriptome data and the vertebrate-invertebrate gap. |
title_short |
Reference-free population genomics from next-generation transcriptome data and the vertebrate-invertebrate gap. |
title_full |
Reference-free population genomics from next-generation transcriptome data and the vertebrate-invertebrate gap. |
title_fullStr |
Reference-free population genomics from next-generation transcriptome data and the vertebrate-invertebrate gap. |
title_full_unstemmed |
Reference-free population genomics from next-generation transcriptome data and the vertebrate-invertebrate gap. |
title_sort |
reference-free population genomics from next-generation transcriptome data and the vertebrate-invertebrate gap. |
publisher |
Public Library of Science (PLoS) |
series |
PLoS Genetics |
issn |
1553-7390 1553-7404 |
publishDate |
2013-04-01 |
description |
In animals, the population genomic literature is dominated by two taxa, namely mammals and drosophilids, in which fully sequenced, well-annotated genomes have been available for years. Data from other metazoan phyla are scarce, probably because the vast majority of living species still lack a closely related reference genome. Here we achieve de novo, reference-free population genomic analysis from wild samples in five non-model animal species, based on next-generation sequencing transcriptome data. We introduce a pipe-line for cDNA assembly, read mapping, SNP/genotype calling, and data cleaning, with specific focus on the issue of hidden paralogy detection. In two species for which a reference genome is available, similar results were obtained whether the reference was used or not, demonstrating the robustness of our de novo inferences. The population genomic profile of a hare, a turtle, an oyster, a tunicate, and a termite were found to be intermediate between those of human and Drosophila, indicating that the discordant genomic diversity patterns that have been reported between these two species do not reflect a generalized vertebrate versus invertebrate gap. The genomic average diversity was generally higher in invertebrates than in vertebrates (with the notable exception of termite), in agreement with the notion that population size tends to be larger in the former than in the latter. The non-synonymous to synonymous ratio, however, did not differ significantly between vertebrates and invertebrates, even though it was negatively correlated with genetic diversity within each of the two groups. This study opens promising perspective regarding genome-wide population analyses of non-model organisms and the influence of population size on non-synonymous versus synonymous diversity. |
url |
http://europepmc.org/articles/PMC3623758?pdf=render |
work_keys_str_mv |
AT philippegayral referencefreepopulationgenomicsfromnextgenerationtranscriptomedataandthevertebrateinvertebrategap AT josemeloferreira referencefreepopulationgenomicsfromnextgenerationtranscriptomedataandthevertebrateinvertebrategap AT sylvainglemin referencefreepopulationgenomicsfromnextgenerationtranscriptomedataandthevertebrateinvertebrategap AT nicolasbierne referencefreepopulationgenomicsfromnextgenerationtranscriptomedataandthevertebrateinvertebrategap AT miguelcarneiro referencefreepopulationgenomicsfromnextgenerationtranscriptomedataandthevertebrateinvertebrategap AT benoitnabholz referencefreepopulationgenomicsfromnextgenerationtranscriptomedataandthevertebrateinvertebrategap AT joaomlourenco referencefreepopulationgenomicsfromnextgenerationtranscriptomedataandthevertebrateinvertebrategap AT paulocalves referencefreepopulationgenomicsfromnextgenerationtranscriptomedataandthevertebrateinvertebrategap AT marionballenghien referencefreepopulationgenomicsfromnextgenerationtranscriptomedataandthevertebrateinvertebrategap AT nicolasfaivre referencefreepopulationgenomicsfromnextgenerationtranscriptomedataandthevertebrateinvertebrategap AT khalidbelkhir referencefreepopulationgenomicsfromnextgenerationtranscriptomedataandthevertebrateinvertebrategap AT vincentcahais referencefreepopulationgenomicsfromnextgenerationtranscriptomedataandthevertebrateinvertebrategap AT etienneloire referencefreepopulationgenomicsfromnextgenerationtranscriptomedataandthevertebrateinvertebrategap AT aurelienbernard referencefreepopulationgenomicsfromnextgenerationtranscriptomedataandthevertebrateinvertebrategap AT nicolasgaltier referencefreepopulationgenomicsfromnextgenerationtranscriptomedataandthevertebrateinvertebrategap |
_version_ |
1725150821585256448 |