Discovery and characterization of artifactual mutations in deep coverage targeted capture sequencing data due to oxidative DNA damage during sample preparation

As researchers begin probing deep coverage sequencing data for increasingly rare mutations and subclonal events, the fidelity of next generation sequencing (NGS) laboratory methods will become increasingly critical. Although error rates for sequencing and polymerase chain reaction (PCR) are well doc...

Full description

Bibliographic Details
Main Authors: Costello, Maura (Author), Pugh, Trevor J. (Author), Fennell, Timothy (Author), Stewart, Chip (Author), Lichtenstein, Lee (Author), Meldrim, James C. (Author), Fostel, Jennifer L. (Author), Friedrich, Dennis C. (Author), Perrin, Danielle (Author), Dionne, Danielle (Author), Kim, Sharon (Author), Gabriel, Stacey B. (Author), Fisher, Sheila (Author), Getz, Gad (Author), Lander, Eric Steven (Author)
Other Authors: Massachusetts Institute of Technology. Department of Biology (Contributor), Lander, Eric S. (Contributor)
Format: Article
Language:English
Published: Oxford University Press, 2014-07-01T14:02:06Z.
Subjects:
Online Access:Get fulltext
LEADER 02778 am a22003493u 4500
001 88159
042 |a dc 
100 1 0 |a Costello, Maura  |e author 
100 1 0 |a Massachusetts Institute of Technology. Department of Biology  |e contributor 
100 1 0 |a Lander, Eric S.  |e contributor 
700 1 0 |a Pugh, Trevor J.  |e author 
700 1 0 |a Fennell, Timothy  |e author 
700 1 0 |a Stewart, Chip  |e author 
700 1 0 |a Lichtenstein, Lee  |e author 
700 1 0 |a Meldrim, James C.  |e author 
700 1 0 |a Fostel, Jennifer L.  |e author 
700 1 0 |a Friedrich, Dennis C.  |e author 
700 1 0 |a Perrin, Danielle  |e author 
700 1 0 |a Dionne, Danielle  |e author 
700 1 0 |a Kim, Sharon  |e author 
700 1 0 |a Gabriel, Stacey B.  |e author 
700 1 0 |a Fisher, Sheila  |e author 
700 1 0 |a Getz, Gad  |e author 
700 1 0 |a Lander, Eric Steven  |e author 
245 0 0 |a Discovery and characterization of artifactual mutations in deep coverage targeted capture sequencing data due to oxidative DNA damage during sample preparation 
260 |b Oxford University Press,   |c 2014-07-01T14:02:06Z. 
856 |z Get fulltext  |u http://hdl.handle.net/1721.1/88159 
520 |a As researchers begin probing deep coverage sequencing data for increasingly rare mutations and subclonal events, the fidelity of next generation sequencing (NGS) laboratory methods will become increasingly critical. Although error rates for sequencing and polymerase chain reaction (PCR) are well documented, the effects that DNA extraction and other library preparation steps could have on downstream sequence integrity have not been thoroughly evaluated. Here, we describe the discovery of novel C > A/G > T transversion artifacts found at low allelic fractions in targeted capture data. Characteristics such as sequencer read orientation and presence in both tumor and normal samples strongly indicated a non-biological mechanism. We identified the source as oxidation of DNA during acoustic shearing in samples containing reactive contaminants from the extraction process. We show generation of 8-oxoguanine (8-oxoG) lesions during DNA shearing, present analysis tools to detect oxidation in sequencing data and suggest methods to reduce DNA oxidation through the introduction of antioxidants. Further, informatics methods are presented to confidently filter these artifacts from sequencing data sets. Though only seen in a low percentage of reads in affected samples, such artifacts could have profoundly deleterious effects on the ability to confidently call rare mutations, and eliminating other possible sources of artifacts should become a priority for the research community. 
520 |a National Human Genome Research Institute (U.S.) (HG03067-05) 
546 |a en_US 
655 7 |a Article 
773 |t Nucleic Acids Research