UltraStrain: An NGS-Based Ultra Sensitive Strain Typing Method for Salmonella enterica

In the last few years, advances in next-generation sequencing (NGS) technology for whole genome sequencing (WGS) of foodborne pathogens have provided drastic improvements in food pathogen outbreak surveillance. WGS of foodborne pathogen enables identification of pathogens from food or environmental...

Full description

Bibliographic Details
Main Authors: Wenxian Yang, Lihong Huang, Chong Shi, Liansheng Wang, Rongshan Yu
Format: Article
Language:English
Published: Frontiers Media S.A. 2019-04-01
Series:Frontiers in Genetics
Subjects:
Online Access:https://www.frontiersin.org/article/10.3389/fgene.2019.00276/full
id doaj-dcaf9f7db8904d3a9f8330affb2c54df
record_format Article
spelling doaj-dcaf9f7db8904d3a9f8330affb2c54df2020-11-25T01:02:09ZengFrontiers Media S.A.Frontiers in Genetics1664-80212019-04-011010.3389/fgene.2019.00276444845UltraStrain: An NGS-Based Ultra Sensitive Strain Typing Method for Salmonella entericaWenxian Yang0Lihong Huang1Chong Shi2Liansheng Wang3Rongshan Yu4Rongshan Yu5Aginome-XMU Joint Lab, Xiamen University, Xiamen, ChinaSchool of Information Science and Engineering, Xiamen University, Xiamen, ChinaSchool of Information Science and Engineering, Xiamen University, Xiamen, ChinaSchool of Information Science and Engineering, Xiamen University, Xiamen, ChinaAginome-XMU Joint Lab, Xiamen University, Xiamen, ChinaSchool of Information Science and Engineering, Xiamen University, Xiamen, ChinaIn the last few years, advances in next-generation sequencing (NGS) technology for whole genome sequencing (WGS) of foodborne pathogens have provided drastic improvements in food pathogen outbreak surveillance. WGS of foodborne pathogen enables identification of pathogens from food or environmental samples, including difficult-to-detect pathogens in culture-negative infections. Compared to traditional low-resolution methods such as the pulsed-field gel electrophoresis (PFGE), WGS provides advantages to differentiate even closely related strains of the same species, thus enables rapid identification of food-source associated with pathogen outbreak events for a fast mitigation plan. In this paper, we present UltraStrain, which is a fast and ultra sensitive pathogen detection and strain typing method for Salmonella enterica (S. enterica) based on WGS data analysis. In the proposed method, a noise filtering step is first performed where the raw sequencing data are mapped to a synthetic species-specific reference genome generated from S. enterica specific marker sequences to avoid potential interference from closely related species for low spike samples. After that, a statistical learning based method is used to identify candidate strains, from a database of known S. enterica strains, that best explain the retained S. enterica specific reads.Finally, a refinement step is further performed by mapping all the reads before filtering onto the identified top candidate strains, and recalculating the probability of presence for each candidate strain. Experiment results using both synthetic and real sequencing data show that the proposed method is able to identify the correct S. enterica strains from low-spike samples, and outperforms several existing strain-typing methods in terms of sensitivity and accuracy.https://www.frontiersin.org/article/10.3389/fgene.2019.00276/fullmetagenomesnext-generation sequencing (NGS)whole genome sequencing (WGS)Salmonella entericastrain typing
collection DOAJ
language English
format Article
sources DOAJ
author Wenxian Yang
Lihong Huang
Chong Shi
Liansheng Wang
Rongshan Yu
Rongshan Yu
spellingShingle Wenxian Yang
Lihong Huang
Chong Shi
Liansheng Wang
Rongshan Yu
Rongshan Yu
UltraStrain: An NGS-Based Ultra Sensitive Strain Typing Method for Salmonella enterica
Frontiers in Genetics
metagenomes
next-generation sequencing (NGS)
whole genome sequencing (WGS)
Salmonella enterica
strain typing
author_facet Wenxian Yang
Lihong Huang
Chong Shi
Liansheng Wang
Rongshan Yu
Rongshan Yu
author_sort Wenxian Yang
title UltraStrain: An NGS-Based Ultra Sensitive Strain Typing Method for Salmonella enterica
title_short UltraStrain: An NGS-Based Ultra Sensitive Strain Typing Method for Salmonella enterica
title_full UltraStrain: An NGS-Based Ultra Sensitive Strain Typing Method for Salmonella enterica
title_fullStr UltraStrain: An NGS-Based Ultra Sensitive Strain Typing Method for Salmonella enterica
title_full_unstemmed UltraStrain: An NGS-Based Ultra Sensitive Strain Typing Method for Salmonella enterica
title_sort ultrastrain: an ngs-based ultra sensitive strain typing method for salmonella enterica
publisher Frontiers Media S.A.
series Frontiers in Genetics
issn 1664-8021
publishDate 2019-04-01
description In the last few years, advances in next-generation sequencing (NGS) technology for whole genome sequencing (WGS) of foodborne pathogens have provided drastic improvements in food pathogen outbreak surveillance. WGS of foodborne pathogen enables identification of pathogens from food or environmental samples, including difficult-to-detect pathogens in culture-negative infections. Compared to traditional low-resolution methods such as the pulsed-field gel electrophoresis (PFGE), WGS provides advantages to differentiate even closely related strains of the same species, thus enables rapid identification of food-source associated with pathogen outbreak events for a fast mitigation plan. In this paper, we present UltraStrain, which is a fast and ultra sensitive pathogen detection and strain typing method for Salmonella enterica (S. enterica) based on WGS data analysis. In the proposed method, a noise filtering step is first performed where the raw sequencing data are mapped to a synthetic species-specific reference genome generated from S. enterica specific marker sequences to avoid potential interference from closely related species for low spike samples. After that, a statistical learning based method is used to identify candidate strains, from a database of known S. enterica strains, that best explain the retained S. enterica specific reads.Finally, a refinement step is further performed by mapping all the reads before filtering onto the identified top candidate strains, and recalculating the probability of presence for each candidate strain. Experiment results using both synthetic and real sequencing data show that the proposed method is able to identify the correct S. enterica strains from low-spike samples, and outperforms several existing strain-typing methods in terms of sensitivity and accuracy.
topic metagenomes
next-generation sequencing (NGS)
whole genome sequencing (WGS)
Salmonella enterica
strain typing
url https://www.frontiersin.org/article/10.3389/fgene.2019.00276/full
work_keys_str_mv AT wenxianyang ultrastrainanngsbasedultrasensitivestraintypingmethodforsalmonellaenterica
AT lihonghuang ultrastrainanngsbasedultrasensitivestraintypingmethodforsalmonellaenterica
AT chongshi ultrastrainanngsbasedultrasensitivestraintypingmethodforsalmonellaenterica
AT lianshengwang ultrastrainanngsbasedultrasensitivestraintypingmethodforsalmonellaenterica
AT rongshanyu ultrastrainanngsbasedultrasensitivestraintypingmethodforsalmonellaenterica
AT rongshanyu ultrastrainanngsbasedultrasensitivestraintypingmethodforsalmonellaenterica
_version_ 1725206396871376896