ExScalibur: A High-Performance Cloud-Enabled Suite for Whole Exome Germline and Somatic Mutation Identification.

Whole exome sequencing has facilitated the discovery of causal genetic variants associated with human diseases at deep coverage and low cost. In particular, the detection of somatic mutations from tumor/normal pairs has provided insights into the cancer genome. Although there is an abundance of publ...

Full description

Bibliographic Details
Main Authors: Riyue Bao, Kyle Hernandez, Lei Huang, Wenjun Kang, Elizabeth Bartom, Kenan Onel, Samuel Volchenboum, Jorge Andrade
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2015-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC4535852?pdf=render
id doaj-d7f0061cf18f44488b65f03386c5bf23
record_format Article
spelling doaj-d7f0061cf18f44488b65f03386c5bf232020-11-25T01:21:22ZengPublic Library of Science (PLoS)PLoS ONE1932-62032015-01-01108e013580010.1371/journal.pone.0135800ExScalibur: A High-Performance Cloud-Enabled Suite for Whole Exome Germline and Somatic Mutation Identification.Riyue BaoKyle HernandezLei HuangWenjun KangElizabeth BartomKenan OnelSamuel VolchenboumJorge AndradeWhole exome sequencing has facilitated the discovery of causal genetic variants associated with human diseases at deep coverage and low cost. In particular, the detection of somatic mutations from tumor/normal pairs has provided insights into the cancer genome. Although there is an abundance of publicly-available software for the detection of germline and somatic variants, concordance is generally limited among variant callers and alignment algorithms. Successful integration of variants detected by multiple methods requires in-depth knowledge of the software, access to high-performance computing resources, and advanced programming techniques. We present ExScalibur, a set of fully automated, highly scalable and modulated pipelines for whole exome data analysis. The suite integrates multiple alignment and variant calling algorithms for the accurate detection of germline and somatic mutations with close to 99% sensitivity and specificity. ExScalibur implements streamlined execution of analytical modules, real-time monitoring of pipeline progress, robust handling of errors and intuitive documentation that allows for increased reproducibility and sharing of results and workflows. It runs on local computers, high-performance computing clusters and cloud environments. In addition, we provide a data analysis report utility to facilitate visualization of the results that offers interactive exploration of quality control files, read alignment and variant calls, assisting downstream customization of potential disease-causing mutations. ExScalibur is open-source and is also available as a public image on Amazon cloud.http://europepmc.org/articles/PMC4535852?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Riyue Bao
Kyle Hernandez
Lei Huang
Wenjun Kang
Elizabeth Bartom
Kenan Onel
Samuel Volchenboum
Jorge Andrade
spellingShingle Riyue Bao
Kyle Hernandez
Lei Huang
Wenjun Kang
Elizabeth Bartom
Kenan Onel
Samuel Volchenboum
Jorge Andrade
ExScalibur: A High-Performance Cloud-Enabled Suite for Whole Exome Germline and Somatic Mutation Identification.
PLoS ONE
author_facet Riyue Bao
Kyle Hernandez
Lei Huang
Wenjun Kang
Elizabeth Bartom
Kenan Onel
Samuel Volchenboum
Jorge Andrade
author_sort Riyue Bao
title ExScalibur: A High-Performance Cloud-Enabled Suite for Whole Exome Germline and Somatic Mutation Identification.
title_short ExScalibur: A High-Performance Cloud-Enabled Suite for Whole Exome Germline and Somatic Mutation Identification.
title_full ExScalibur: A High-Performance Cloud-Enabled Suite for Whole Exome Germline and Somatic Mutation Identification.
title_fullStr ExScalibur: A High-Performance Cloud-Enabled Suite for Whole Exome Germline and Somatic Mutation Identification.
title_full_unstemmed ExScalibur: A High-Performance Cloud-Enabled Suite for Whole Exome Germline and Somatic Mutation Identification.
title_sort exscalibur: a high-performance cloud-enabled suite for whole exome germline and somatic mutation identification.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2015-01-01
description Whole exome sequencing has facilitated the discovery of causal genetic variants associated with human diseases at deep coverage and low cost. In particular, the detection of somatic mutations from tumor/normal pairs has provided insights into the cancer genome. Although there is an abundance of publicly-available software for the detection of germline and somatic variants, concordance is generally limited among variant callers and alignment algorithms. Successful integration of variants detected by multiple methods requires in-depth knowledge of the software, access to high-performance computing resources, and advanced programming techniques. We present ExScalibur, a set of fully automated, highly scalable and modulated pipelines for whole exome data analysis. The suite integrates multiple alignment and variant calling algorithms for the accurate detection of germline and somatic mutations with close to 99% sensitivity and specificity. ExScalibur implements streamlined execution of analytical modules, real-time monitoring of pipeline progress, robust handling of errors and intuitive documentation that allows for increased reproducibility and sharing of results and workflows. It runs on local computers, high-performance computing clusters and cloud environments. In addition, we provide a data analysis report utility to facilitate visualization of the results that offers interactive exploration of quality control files, read alignment and variant calls, assisting downstream customization of potential disease-causing mutations. ExScalibur is open-source and is also available as a public image on Amazon cloud.
url http://europepmc.org/articles/PMC4535852?pdf=render
work_keys_str_mv AT riyuebao exscaliburahighperformancecloudenabledsuiteforwholeexomegermlineandsomaticmutationidentification
AT kylehernandez exscaliburahighperformancecloudenabledsuiteforwholeexomegermlineandsomaticmutationidentification
AT leihuang exscaliburahighperformancecloudenabledsuiteforwholeexomegermlineandsomaticmutationidentification
AT wenjunkang exscaliburahighperformancecloudenabledsuiteforwholeexomegermlineandsomaticmutationidentification
AT elizabethbartom exscaliburahighperformancecloudenabledsuiteforwholeexomegermlineandsomaticmutationidentification
AT kenanonel exscaliburahighperformancecloudenabledsuiteforwholeexomegermlineandsomaticmutationidentification
AT samuelvolchenboum exscaliburahighperformancecloudenabledsuiteforwholeexomegermlineandsomaticmutationidentification
AT jorgeandrade exscaliburahighperformancecloudenabledsuiteforwholeexomegermlineandsomaticmutationidentification
_version_ 1725130686102241280