Long-read metagenomics retrieves complete single-contig bacterial genomes from canine feces

Abstract Background Long-read sequencing in metagenomics facilitates the assembly of complete genomes out of complex microbial communities. These genomes include essential biologic information such as the ribosomal genes or the mobile genetic elements, which are usually missed with short-reads. We a...

Full description

Bibliographic Details
Main Authors: Anna Cuscó, Daniel Pérez, Joaquim Viñes, Norma Fàbregas, Olga Francino
Format: Article
Language:English
Published: BMC 2021-05-01
Series:BMC Genomics
Subjects:
Online Access:https://doi.org/10.1186/s12864-021-07607-0
id doaj-9ef34e908ddc473cbf3b6f55ed801f92
record_format Article
spelling doaj-9ef34e908ddc473cbf3b6f55ed801f922021-05-09T11:23:07ZengBMCBMC Genomics1471-21642021-05-0122111510.1186/s12864-021-07607-0Long-read metagenomics retrieves complete single-contig bacterial genomes from canine fecesAnna Cuscó0Daniel Pérez1Joaquim Viñes2Norma Fàbregas3Olga Francino4Vetgenomics, Ed Eureka, Parc de Recerca UABMolecular Genetics Veterinary Service (SVGM), Veterinary School, Universitat Autònoma de BarcelonaVetgenomics, Ed Eureka, Parc de Recerca UABVetgenomics, Ed Eureka, Parc de Recerca UABMolecular Genetics Veterinary Service (SVGM), Veterinary School, Universitat Autònoma de BarcelonaAbstract Background Long-read sequencing in metagenomics facilitates the assembly of complete genomes out of complex microbial communities. These genomes include essential biologic information such as the ribosomal genes or the mobile genetic elements, which are usually missed with short-reads. We applied long-read metagenomics with Nanopore sequencing to retrieve high-quality metagenome-assembled genomes (HQ MAGs) from a dog fecal sample. Results We used nanopore long-read metagenomics and frameshift aware correction on a canine fecal sample and retrieved eight single-contig HQ MAGs, which were > 90% complete with < 5% contamination, and contained most ribosomal genes and tRNAs. At the technical level, we demonstrated that a high-molecular-weight DNA extraction improved the metagenomics assembly contiguity, the recovery of the rRNA operons, and the retrieval of longer and circular contigs that are potential HQ MAGs. These HQ MAGs corresponded to Succinivibrio, Sutterella, Prevotellamassilia, Phascolarctobacterium, Catenibacterium, Blautia, and Enterococcus genera. Linking our results to previous gastrointestinal microbiome reports (metagenome or 16S rRNA-based), we found that some bacterial species on the gastrointestinal tract seem to be more canid-specific –Succinivibrio, Prevotellamassilia, Phascolarctobacterium, Blautia_A sp900541345–, whereas others are more broadly distributed among animal and human microbiomes –Sutterella, Catenibacterium, Enterococcus, and Blautia sp003287895. Sutterella HQ MAG is potentially the first reported genome assembly for Sutterella stercoricanis, as assigned by 16S rRNA gene similarity. Moreover, we show that long reads are essential to detect mobilome functions, usually missed in short-read MAGs. Conclusions We recovered eight single-contig HQ MAGs from canine feces of a healthy dog with nanopore long-reads. We also retrieved relevant biological insights from these specific bacterial species previously missed in public databases, such as complete ribosomal operons and mobilome functions. The high-molecular-weight DNA extraction improved the assembly’s contiguity, whereas the high-accuracy basecalling, the raw read error correction, the assembly polishing, and the frameshift correction reduced the insertion and deletion errors. Both experimental and analytical steps ensured the retrieval of complete bacterial genomes.https://doi.org/10.1186/s12864-021-07607-0Long-read metagenomicsGastrointestinal microbiomeFecal microbiomeLong-readsNanoporeCanine microbiome
collection DOAJ
language English
format Article
sources DOAJ
author Anna Cuscó
Daniel Pérez
Joaquim Viñes
Norma Fàbregas
Olga Francino
spellingShingle Anna Cuscó
Daniel Pérez
Joaquim Viñes
Norma Fàbregas
Olga Francino
Long-read metagenomics retrieves complete single-contig bacterial genomes from canine feces
BMC Genomics
Long-read metagenomics
Gastrointestinal microbiome
Fecal microbiome
Long-reads
Nanopore
Canine microbiome
author_facet Anna Cuscó
Daniel Pérez
Joaquim Viñes
Norma Fàbregas
Olga Francino
author_sort Anna Cuscó
title Long-read metagenomics retrieves complete single-contig bacterial genomes from canine feces
title_short Long-read metagenomics retrieves complete single-contig bacterial genomes from canine feces
title_full Long-read metagenomics retrieves complete single-contig bacterial genomes from canine feces
title_fullStr Long-read metagenomics retrieves complete single-contig bacterial genomes from canine feces
title_full_unstemmed Long-read metagenomics retrieves complete single-contig bacterial genomes from canine feces
title_sort long-read metagenomics retrieves complete single-contig bacterial genomes from canine feces
publisher BMC
series BMC Genomics
issn 1471-2164
publishDate 2021-05-01
description Abstract Background Long-read sequencing in metagenomics facilitates the assembly of complete genomes out of complex microbial communities. These genomes include essential biologic information such as the ribosomal genes or the mobile genetic elements, which are usually missed with short-reads. We applied long-read metagenomics with Nanopore sequencing to retrieve high-quality metagenome-assembled genomes (HQ MAGs) from a dog fecal sample. Results We used nanopore long-read metagenomics and frameshift aware correction on a canine fecal sample and retrieved eight single-contig HQ MAGs, which were > 90% complete with < 5% contamination, and contained most ribosomal genes and tRNAs. At the technical level, we demonstrated that a high-molecular-weight DNA extraction improved the metagenomics assembly contiguity, the recovery of the rRNA operons, and the retrieval of longer and circular contigs that are potential HQ MAGs. These HQ MAGs corresponded to Succinivibrio, Sutterella, Prevotellamassilia, Phascolarctobacterium, Catenibacterium, Blautia, and Enterococcus genera. Linking our results to previous gastrointestinal microbiome reports (metagenome or 16S rRNA-based), we found that some bacterial species on the gastrointestinal tract seem to be more canid-specific –Succinivibrio, Prevotellamassilia, Phascolarctobacterium, Blautia_A sp900541345–, whereas others are more broadly distributed among animal and human microbiomes –Sutterella, Catenibacterium, Enterococcus, and Blautia sp003287895. Sutterella HQ MAG is potentially the first reported genome assembly for Sutterella stercoricanis, as assigned by 16S rRNA gene similarity. Moreover, we show that long reads are essential to detect mobilome functions, usually missed in short-read MAGs. Conclusions We recovered eight single-contig HQ MAGs from canine feces of a healthy dog with nanopore long-reads. We also retrieved relevant biological insights from these specific bacterial species previously missed in public databases, such as complete ribosomal operons and mobilome functions. The high-molecular-weight DNA extraction improved the assembly’s contiguity, whereas the high-accuracy basecalling, the raw read error correction, the assembly polishing, and the frameshift correction reduced the insertion and deletion errors. Both experimental and analytical steps ensured the retrieval of complete bacterial genomes.
topic Long-read metagenomics
Gastrointestinal microbiome
Fecal microbiome
Long-reads
Nanopore
Canine microbiome
url https://doi.org/10.1186/s12864-021-07607-0
work_keys_str_mv AT annacusco longreadmetagenomicsretrievescompletesinglecontigbacterialgenomesfromcaninefeces
AT danielperez longreadmetagenomicsretrievescompletesinglecontigbacterialgenomesfromcaninefeces
AT joaquimvines longreadmetagenomicsretrievescompletesinglecontigbacterialgenomesfromcaninefeces
AT normafabregas longreadmetagenomicsretrievescompletesinglecontigbacterialgenomesfromcaninefeces
AT olgafrancino longreadmetagenomicsretrievescompletesinglecontigbacterialgenomesfromcaninefeces
_version_ 1721454404083646464