BAMClipper: removing primers from alignments to minimize false-negative mutations in amplicon next-generation sequencing

Abstract Amplicon-based next-generation sequencing (NGS) has been widely adopted for genetic variation detection in human and other organisms. Conventional data analysis paradigm includes primer trimming before read mapping. Here we introduce BAMClipper that removes primer sequences after mapping or...

Full description

Bibliographic Details
Main Authors: Chun Hang Au, Dona N. Ho, Ava Kwong, Tsun Leung Chan, Edmond S. K. Ma
Format: Article
Language:English
Published: Nature Publishing Group 2017-05-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-017-01703-6
id doaj-6e4c0ae171ba4d6baeb272e2a5123b23
record_format Article
spelling doaj-6e4c0ae171ba4d6baeb272e2a5123b232020-12-08T02:58:31ZengNature Publishing GroupScientific Reports2045-23222017-05-01711710.1038/s41598-017-01703-6BAMClipper: removing primers from alignments to minimize false-negative mutations in amplicon next-generation sequencingChun Hang Au0Dona N. Ho1Ava Kwong2Tsun Leung Chan3Edmond S. K. Ma4Division of Molecular Pathology, Department of Pathology, Hong Kong Sanatorium & HospitalDivision of Molecular Pathology, Department of Pathology, Hong Kong Sanatorium & HospitalDepartment of Surgery, The University of Hong KongDivision of Molecular Pathology, Department of Pathology, Hong Kong Sanatorium & HospitalDivision of Molecular Pathology, Department of Pathology, Hong Kong Sanatorium & HospitalAbstract Amplicon-based next-generation sequencing (NGS) has been widely adopted for genetic variation detection in human and other organisms. Conventional data analysis paradigm includes primer trimming before read mapping. Here we introduce BAMClipper that removes primer sequences after mapping original sequencing reads by soft-clipping SAM/BAM alignments. Mutation detection accuracy was affected by the choice of primer handling approach based on real NGS datasets of 7 human peripheral blood or breast cancer tissue samples with known BRCA1/BRCA2 mutations and >130000 simulated NGS datasets with unique mutations. BAMClipper approach detected a BRCA1 deletion (c.1620_1636del) that was otherwise missed due to edge effect. Simulation showed high false-negative rate when primers were perfectly trimmed as in conventional practice. Among the other 6 samples, variant allele frequencies of 5 BRCA1/BRCA2 mutations (indel or single-nucleotide variants) were diluted by apparently wild-type primer sequences from an overlapping amplicon (17 to 82% under-estimation). BAMClipper was robust in both situations and all 7 mutations were detected. When compared with Cutadapt, BAMClipper was faster and maintained equally high primer removal effectiveness. BAMClipper is implemented in Perl and is available under an open source MIT license at https://github.com/tommyau/bamclipper.https://doi.org/10.1038/s41598-017-01703-6
collection DOAJ
language English
format Article
sources DOAJ
author Chun Hang Au
Dona N. Ho
Ava Kwong
Tsun Leung Chan
Edmond S. K. Ma
spellingShingle Chun Hang Au
Dona N. Ho
Ava Kwong
Tsun Leung Chan
Edmond S. K. Ma
BAMClipper: removing primers from alignments to minimize false-negative mutations in amplicon next-generation sequencing
Scientific Reports
author_facet Chun Hang Au
Dona N. Ho
Ava Kwong
Tsun Leung Chan
Edmond S. K. Ma
author_sort Chun Hang Au
title BAMClipper: removing primers from alignments to minimize false-negative mutations in amplicon next-generation sequencing
title_short BAMClipper: removing primers from alignments to minimize false-negative mutations in amplicon next-generation sequencing
title_full BAMClipper: removing primers from alignments to minimize false-negative mutations in amplicon next-generation sequencing
title_fullStr BAMClipper: removing primers from alignments to minimize false-negative mutations in amplicon next-generation sequencing
title_full_unstemmed BAMClipper: removing primers from alignments to minimize false-negative mutations in amplicon next-generation sequencing
title_sort bamclipper: removing primers from alignments to minimize false-negative mutations in amplicon next-generation sequencing
publisher Nature Publishing Group
series Scientific Reports
issn 2045-2322
publishDate 2017-05-01
description Abstract Amplicon-based next-generation sequencing (NGS) has been widely adopted for genetic variation detection in human and other organisms. Conventional data analysis paradigm includes primer trimming before read mapping. Here we introduce BAMClipper that removes primer sequences after mapping original sequencing reads by soft-clipping SAM/BAM alignments. Mutation detection accuracy was affected by the choice of primer handling approach based on real NGS datasets of 7 human peripheral blood or breast cancer tissue samples with known BRCA1/BRCA2 mutations and >130000 simulated NGS datasets with unique mutations. BAMClipper approach detected a BRCA1 deletion (c.1620_1636del) that was otherwise missed due to edge effect. Simulation showed high false-negative rate when primers were perfectly trimmed as in conventional practice. Among the other 6 samples, variant allele frequencies of 5 BRCA1/BRCA2 mutations (indel or single-nucleotide variants) were diluted by apparently wild-type primer sequences from an overlapping amplicon (17 to 82% under-estimation). BAMClipper was robust in both situations and all 7 mutations were detected. When compared with Cutadapt, BAMClipper was faster and maintained equally high primer removal effectiveness. BAMClipper is implemented in Perl and is available under an open source MIT license at https://github.com/tommyau/bamclipper.
url https://doi.org/10.1038/s41598-017-01703-6
work_keys_str_mv AT chunhangau bamclipperremovingprimersfromalignmentstominimizefalsenegativemutationsinampliconnextgenerationsequencing
AT donanho bamclipperremovingprimersfromalignmentstominimizefalsenegativemutationsinampliconnextgenerationsequencing
AT avakwong bamclipperremovingprimersfromalignmentstominimizefalsenegativemutationsinampliconnextgenerationsequencing
AT tsunleungchan bamclipperremovingprimersfromalignmentstominimizefalsenegativemutationsinampliconnextgenerationsequencing
AT edmondskma bamclipperremovingprimersfromalignmentstominimizefalsenegativemutationsinampliconnextgenerationsequencing
_version_ 1724393087532269568