Repeatability and reproducibility assessment in a large-scale population-based microbiota study: case study on human milk microbiota

Abstract Background Quality control including assessment of batch variabilities and confirmation of repeatability and reproducibility are integral component of high throughput omics studies including microbiome research. Batch effects can mask true biological results and/or result in irreproducible...

Full description

Bibliographic Details
Main Authors:	Shirin Moossavi, Kelsey Fehr, Ehsan Khafipour, Meghan B. Azad
Format:	Article
Language:	English
Published:	BMC 2021-02-01
Series:	Microbiome
Subjects:	Decontam Reagent contaminant Batch variation Repeatability Reproducibility Microbiome
Online Access:	https://doi.org/10.1186/s40168-020-00998-4

id	doaj-d3d82c5ba3554f35b2fe1368613a992b
record_format	Article
spelling	doaj-d3d82c5ba3554f35b2fe1368613a992b2021-02-14T12:46:50ZengBMCMicrobiome2049-26182021-02-019111010.1186/s40168-020-00998-4Repeatability and reproducibility assessment in a large-scale population-based microbiota study: case study on human milk microbiotaShirin Moossavi0Kelsey Fehr1Ehsan Khafipour2Meghan B. Azad3Department of Medical Microbiology and Infectious Diseases, University of ManitobaChildren’s Hospital Research Institute of ManitobaDepartment of Animal Science, University of ManitobaChildren’s Hospital Research Institute of ManitobaAbstract Background Quality control including assessment of batch variabilities and confirmation of repeatability and reproducibility are integral component of high throughput omics studies including microbiome research. Batch effects can mask true biological results and/or result in irreproducible conclusions and interpretations. Low biomass samples in microbiome research are prone to reagent contamination; yet, quality control procedures for low biomass samples in large-scale microbiome studies are not well established. Results In this study, we have proposed a framework for an in-depth step-by-step approach to address this gap. The framework consists of three independent stages: (1) verification of sequencing accuracy by assessing technical repeatability and reproducibility of the results using mock communities and biological controls; (2) contaminant removal and batch variability correction by applying a two-tier strategy using statistical algorithms (e.g. decontam) followed by comparison of the data structure between batches; and (3) corroborating the repeatability and reproducibility of microbiome composition and downstream statistical analysis. Using this approach on the milk microbiota data from the CHILD Cohort generated in two batches (extracted and sequenced in 2016 and 2019), we were able to identify potential reagent contaminants that were missed with standard algorithms and substantially reduce contaminant-induced batch variability. Additionally, we confirmed the repeatability and reproducibility of our results in each batch before merging them for downstream analysis. Conclusion This study provides important insight to advance quality control efforts in low biomass microbiome research. Within-study quality control that takes advantage of the data structure (i.e. differential prevalence of contaminants between batches) would enhance the overall reliability and reproducibility of research in this field. Video abstracthttps://doi.org/10.1186/s40168-020-00998-4DecontamReagent contaminantBatch variationRepeatabilityReproducibilityMicrobiome
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Shirin Moossavi Kelsey Fehr Ehsan Khafipour Meghan B. Azad
spellingShingle	Shirin Moossavi Kelsey Fehr Ehsan Khafipour Meghan B. Azad Repeatability and reproducibility assessment in a large-scale population-based microbiota study: case study on human milk microbiota Microbiome Decontam Reagent contaminant Batch variation Repeatability Reproducibility Microbiome
author_facet	Shirin Moossavi Kelsey Fehr Ehsan Khafipour Meghan B. Azad
author_sort	Shirin Moossavi
title	Repeatability and reproducibility assessment in a large-scale population-based microbiota study: case study on human milk microbiota
title_short	Repeatability and reproducibility assessment in a large-scale population-based microbiota study: case study on human milk microbiota
title_full	Repeatability and reproducibility assessment in a large-scale population-based microbiota study: case study on human milk microbiota
title_fullStr	Repeatability and reproducibility assessment in a large-scale population-based microbiota study: case study on human milk microbiota
title_full_unstemmed	Repeatability and reproducibility assessment in a large-scale population-based microbiota study: case study on human milk microbiota
title_sort	repeatability and reproducibility assessment in a large-scale population-based microbiota study: case study on human milk microbiota
publisher	BMC
series	Microbiome
issn	2049-2618
publishDate	2021-02-01
description	Abstract Background Quality control including assessment of batch variabilities and confirmation of repeatability and reproducibility are integral component of high throughput omics studies including microbiome research. Batch effects can mask true biological results and/or result in irreproducible conclusions and interpretations. Low biomass samples in microbiome research are prone to reagent contamination; yet, quality control procedures for low biomass samples in large-scale microbiome studies are not well established. Results In this study, we have proposed a framework for an in-depth step-by-step approach to address this gap. The framework consists of three independent stages: (1) verification of sequencing accuracy by assessing technical repeatability and reproducibility of the results using mock communities and biological controls; (2) contaminant removal and batch variability correction by applying a two-tier strategy using statistical algorithms (e.g. decontam) followed by comparison of the data structure between batches; and (3) corroborating the repeatability and reproducibility of microbiome composition and downstream statistical analysis. Using this approach on the milk microbiota data from the CHILD Cohort generated in two batches (extracted and sequenced in 2016 and 2019), we were able to identify potential reagent contaminants that were missed with standard algorithms and substantially reduce contaminant-induced batch variability. Additionally, we confirmed the repeatability and reproducibility of our results in each batch before merging them for downstream analysis. Conclusion This study provides important insight to advance quality control efforts in low biomass microbiome research. Within-study quality control that takes advantage of the data structure (i.e. differential prevalence of contaminants between batches) would enhance the overall reliability and reproducibility of research in this field. Video abstract
topic	Decontam Reagent contaminant Batch variation Repeatability Reproducibility Microbiome
url	https://doi.org/10.1186/s40168-020-00998-4
work_keys_str_mv	AT shirinmoossavi repeatabilityandreproducibilityassessmentinalargescalepopulationbasedmicrobiotastudycasestudyonhumanmilkmicrobiota AT kelseyfehr repeatabilityandreproducibilityassessmentinalargescalepopulationbasedmicrobiotastudycasestudyonhumanmilkmicrobiota AT ehsankhafipour repeatabilityandreproducibilityassessmentinalargescalepopulationbasedmicrobiotastudycasestudyonhumanmilkmicrobiota AT meghanbazad repeatabilityandreproducibilityassessmentinalargescalepopulationbasedmicrobiotastudycasestudyonhumanmilkmicrobiota
_version_	1724270106242973696

Repeatability and reproducibility assessment in a large-scale population-based microbiota study: case study on human milk microbiota

Similar Items