BAMClipper: removing primers from alignments to minimize false-negative mutations in amplicon next-generation sequencing
Abstract Amplicon-based next-generation sequencing (NGS) has been widely adopted for genetic variation detection in human and other organisms. Conventional data analysis paradigm includes primer trimming before read mapping. Here we introduce BAMClipper that removes primer sequences after mapping or...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Nature Publishing Group
2017-05-01
|
Series: | Scientific Reports |
Online Access: | https://doi.org/10.1038/s41598-017-01703-6 |
id |
doaj-6e4c0ae171ba4d6baeb272e2a5123b23 |
---|---|
record_format |
Article |
spelling |
doaj-6e4c0ae171ba4d6baeb272e2a5123b232020-12-08T02:58:31ZengNature Publishing GroupScientific Reports2045-23222017-05-01711710.1038/s41598-017-01703-6BAMClipper: removing primers from alignments to minimize false-negative mutations in amplicon next-generation sequencingChun Hang Au0Dona N. Ho1Ava Kwong2Tsun Leung Chan3Edmond S. K. Ma4Division of Molecular Pathology, Department of Pathology, Hong Kong Sanatorium & HospitalDivision of Molecular Pathology, Department of Pathology, Hong Kong Sanatorium & HospitalDepartment of Surgery, The University of Hong KongDivision of Molecular Pathology, Department of Pathology, Hong Kong Sanatorium & HospitalDivision of Molecular Pathology, Department of Pathology, Hong Kong Sanatorium & HospitalAbstract Amplicon-based next-generation sequencing (NGS) has been widely adopted for genetic variation detection in human and other organisms. Conventional data analysis paradigm includes primer trimming before read mapping. Here we introduce BAMClipper that removes primer sequences after mapping original sequencing reads by soft-clipping SAM/BAM alignments. Mutation detection accuracy was affected by the choice of primer handling approach based on real NGS datasets of 7 human peripheral blood or breast cancer tissue samples with known BRCA1/BRCA2 mutations and >130000 simulated NGS datasets with unique mutations. BAMClipper approach detected a BRCA1 deletion (c.1620_1636del) that was otherwise missed due to edge effect. Simulation showed high false-negative rate when primers were perfectly trimmed as in conventional practice. Among the other 6 samples, variant allele frequencies of 5 BRCA1/BRCA2 mutations (indel or single-nucleotide variants) were diluted by apparently wild-type primer sequences from an overlapping amplicon (17 to 82% under-estimation). BAMClipper was robust in both situations and all 7 mutations were detected. When compared with Cutadapt, BAMClipper was faster and maintained equally high primer removal effectiveness. BAMClipper is implemented in Perl and is available under an open source MIT license at https://github.com/tommyau/bamclipper.https://doi.org/10.1038/s41598-017-01703-6 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Chun Hang Au Dona N. Ho Ava Kwong Tsun Leung Chan Edmond S. K. Ma |
spellingShingle |
Chun Hang Au Dona N. Ho Ava Kwong Tsun Leung Chan Edmond S. K. Ma BAMClipper: removing primers from alignments to minimize false-negative mutations in amplicon next-generation sequencing Scientific Reports |
author_facet |
Chun Hang Au Dona N. Ho Ava Kwong Tsun Leung Chan Edmond S. K. Ma |
author_sort |
Chun Hang Au |
title |
BAMClipper: removing primers from alignments to minimize false-negative mutations in amplicon next-generation sequencing |
title_short |
BAMClipper: removing primers from alignments to minimize false-negative mutations in amplicon next-generation sequencing |
title_full |
BAMClipper: removing primers from alignments to minimize false-negative mutations in amplicon next-generation sequencing |
title_fullStr |
BAMClipper: removing primers from alignments to minimize false-negative mutations in amplicon next-generation sequencing |
title_full_unstemmed |
BAMClipper: removing primers from alignments to minimize false-negative mutations in amplicon next-generation sequencing |
title_sort |
bamclipper: removing primers from alignments to minimize false-negative mutations in amplicon next-generation sequencing |
publisher |
Nature Publishing Group |
series |
Scientific Reports |
issn |
2045-2322 |
publishDate |
2017-05-01 |
description |
Abstract Amplicon-based next-generation sequencing (NGS) has been widely adopted for genetic variation detection in human and other organisms. Conventional data analysis paradigm includes primer trimming before read mapping. Here we introduce BAMClipper that removes primer sequences after mapping original sequencing reads by soft-clipping SAM/BAM alignments. Mutation detection accuracy was affected by the choice of primer handling approach based on real NGS datasets of 7 human peripheral blood or breast cancer tissue samples with known BRCA1/BRCA2 mutations and >130000 simulated NGS datasets with unique mutations. BAMClipper approach detected a BRCA1 deletion (c.1620_1636del) that was otherwise missed due to edge effect. Simulation showed high false-negative rate when primers were perfectly trimmed as in conventional practice. Among the other 6 samples, variant allele frequencies of 5 BRCA1/BRCA2 mutations (indel or single-nucleotide variants) were diluted by apparently wild-type primer sequences from an overlapping amplicon (17 to 82% under-estimation). BAMClipper was robust in both situations and all 7 mutations were detected. When compared with Cutadapt, BAMClipper was faster and maintained equally high primer removal effectiveness. BAMClipper is implemented in Perl and is available under an open source MIT license at https://github.com/tommyau/bamclipper. |
url |
https://doi.org/10.1038/s41598-017-01703-6 |
work_keys_str_mv |
AT chunhangau bamclipperremovingprimersfromalignmentstominimizefalsenegativemutationsinampliconnextgenerationsequencing AT donanho bamclipperremovingprimersfromalignmentstominimizefalsenegativemutationsinampliconnextgenerationsequencing AT avakwong bamclipperremovingprimersfromalignmentstominimizefalsenegativemutationsinampliconnextgenerationsequencing AT tsunleungchan bamclipperremovingprimersfromalignmentstominimizefalsenegativemutationsinampliconnextgenerationsequencing AT edmondskma bamclipperremovingprimersfromalignmentstominimizefalsenegativemutationsinampliconnextgenerationsequencing |
_version_ |
1724393087532269568 |