QPLOT: A Quality Assessment Tool for Next Generation Sequencing Data

Background. Next generation sequencing (NGS) is being widely used to identify genetic variants associated with human disease. Although the approach is cost effective, the underlying data is susceptible to many types of error. Importantly, since NGS technologies and protocols are rapidly evolving, wi...

Full description

Bibliographic Details
Main Authors: Bingshan Li, Xiaowei Zhan, Mary-Kate Wing, Paul Anderson, Hyun Min Kang, Goncalo R. Abecasis
Format: Article
Language:English
Published: Hindawi Limited 2013-01-01
Series:BioMed Research International
Online Access:http://dx.doi.org/10.1155/2013/865181
id doaj-2ab6814dd00043d38866e3692168581a
record_format Article
spelling doaj-2ab6814dd00043d38866e3692168581a2020-11-24T20:53:55ZengHindawi LimitedBioMed Research International2314-61332314-61412013-01-01201310.1155/2013/865181865181QPLOT: A Quality Assessment Tool for Next Generation Sequencing DataBingshan Li0Xiaowei Zhan1Mary-Kate Wing2Paul Anderson3Hyun Min Kang4Goncalo R. Abecasis5Department of Physiology and Biophysics, Center for Human Genetics Research, Vanderbilt University, Nashville, TN 37232, USADepartment of Biostatistics, Center for Statistical Genetics, University of Michigan, Ann Arbor, MI 48109, USADepartment of Biostatistics, Center for Statistical Genetics, University of Michigan, Ann Arbor, MI 48109, USADepartment of Biostatistics, Center for Statistical Genetics, University of Michigan, Ann Arbor, MI 48109, USADepartment of Biostatistics, Center for Statistical Genetics, University of Michigan, Ann Arbor, MI 48109, USADepartment of Biostatistics, Center for Statistical Genetics, University of Michigan, Ann Arbor, MI 48109, USABackground. Next generation sequencing (NGS) is being widely used to identify genetic variants associated with human disease. Although the approach is cost effective, the underlying data is susceptible to many types of error. Importantly, since NGS technologies and protocols are rapidly evolving, with constantly changing steps ranging from sample preparation to data processing software updates, it is important to enable researchers to routinely assess the quality of sequencing and alignment data prior to downstream analyses. Results. Here we describe QPLOT, an automated tool that can facilitate the quality assessment of sequencing run performance. Taking standard sequence alignments as input, QPLOT generates a series of diagnostic metrics summarizing run quality and produces convenient graphical summaries for these metrics. QPLOT is computationally efficient, generates webpages for interactive exploration of detailed results, and can handle the joint output of many sequencing runs. Conclusion. QPLOT is an automated tool that facilitates assessment of sequence run quality. We routinely apply QPLOT to ensure quick detection of diagnostic of sequencing run problems. We hope that QPLOT will be useful to the community as well.http://dx.doi.org/10.1155/2013/865181
collection DOAJ
language English
format Article
sources DOAJ
author Bingshan Li
Xiaowei Zhan
Mary-Kate Wing
Paul Anderson
Hyun Min Kang
Goncalo R. Abecasis
spellingShingle Bingshan Li
Xiaowei Zhan
Mary-Kate Wing
Paul Anderson
Hyun Min Kang
Goncalo R. Abecasis
QPLOT: A Quality Assessment Tool for Next Generation Sequencing Data
BioMed Research International
author_facet Bingshan Li
Xiaowei Zhan
Mary-Kate Wing
Paul Anderson
Hyun Min Kang
Goncalo R. Abecasis
author_sort Bingshan Li
title QPLOT: A Quality Assessment Tool for Next Generation Sequencing Data
title_short QPLOT: A Quality Assessment Tool for Next Generation Sequencing Data
title_full QPLOT: A Quality Assessment Tool for Next Generation Sequencing Data
title_fullStr QPLOT: A Quality Assessment Tool for Next Generation Sequencing Data
title_full_unstemmed QPLOT: A Quality Assessment Tool for Next Generation Sequencing Data
title_sort qplot: a quality assessment tool for next generation sequencing data
publisher Hindawi Limited
series BioMed Research International
issn 2314-6133
2314-6141
publishDate 2013-01-01
description Background. Next generation sequencing (NGS) is being widely used to identify genetic variants associated with human disease. Although the approach is cost effective, the underlying data is susceptible to many types of error. Importantly, since NGS technologies and protocols are rapidly evolving, with constantly changing steps ranging from sample preparation to data processing software updates, it is important to enable researchers to routinely assess the quality of sequencing and alignment data prior to downstream analyses. Results. Here we describe QPLOT, an automated tool that can facilitate the quality assessment of sequencing run performance. Taking standard sequence alignments as input, QPLOT generates a series of diagnostic metrics summarizing run quality and produces convenient graphical summaries for these metrics. QPLOT is computationally efficient, generates webpages for interactive exploration of detailed results, and can handle the joint output of many sequencing runs. Conclusion. QPLOT is an automated tool that facilitates assessment of sequence run quality. We routinely apply QPLOT to ensure quick detection of diagnostic of sequencing run problems. We hope that QPLOT will be useful to the community as well.
url http://dx.doi.org/10.1155/2013/865181
work_keys_str_mv AT bingshanli qplotaqualityassessmenttoolfornextgenerationsequencingdata
AT xiaoweizhan qplotaqualityassessmenttoolfornextgenerationsequencingdata
AT marykatewing qplotaqualityassessmenttoolfornextgenerationsequencingdata
AT paulanderson qplotaqualityassessmenttoolfornextgenerationsequencingdata
AT hyunminkang qplotaqualityassessmenttoolfornextgenerationsequencingdata
AT goncalorabecasis qplotaqualityassessmenttoolfornextgenerationsequencingdata
_version_ 1716795764526546944