RefCNV: Identification of Gene-Based Copy Number Variants Using Whole Exome Sequencing

With rapid advances in DNA sequencing technologies, whole exome sequencing (WES) has become a popular approach for detecting somatic mutations in oncology studies. The initial intent of WES was to characterize single nucleotide variants, but it was observed that the number of sequencing reads that m...

Full description

Bibliographic Details
Main Authors: Lun-Ching Chang, Biswajit Das, Chih-Jian Lih, Han Si, Corinne E. Camalier, Paul M. McGregor, Eric Polley
Format: Article
Language:English
Published: SAGE Publishing 2016-01-01
Series:Cancer Informatics
Online Access:https://doi.org/10.4137/CIN.S36612
id doaj-b9275063eada4994baf1a3c9055a31d2
record_format Article
spelling doaj-b9275063eada4994baf1a3c9055a31d22020-11-25T03:24:38ZengSAGE PublishingCancer Informatics1176-93512016-01-011510.4137/CIN.S36612RefCNV: Identification of Gene-Based Copy Number Variants Using Whole Exome SequencingLun-Ching Chang0Biswajit Das1Chih-Jian Lih2Han Si3Corinne E. Camalier4Paul M. McGregor5Eric Polley6Division of Cancer Treatment and Diagnosis, National Cancer Institute, Bethesda, MD, USA.Molecular Characterization and Clinical Assay Development Laboratory, Leidos Biomedical Research Inc., Frederick National Laboratory for Cancer Research, Frederick, MD, USA.Molecular Characterization and Clinical Assay Development Laboratory, Leidos Biomedical Research Inc., Frederick National Laboratory for Cancer Research, Frederick, MD, USA.Molecular Characterization and Clinical Assay Development Laboratory, Leidos Biomedical Research Inc., Frederick National Laboratory for Cancer Research, Frederick, MD, USA.Molecular Characterization and Clinical Assay Development Laboratory, Leidos Biomedical Research Inc., Frederick National Laboratory for Cancer Research, Frederick, MD, USA.Molecular Characterization and Clinical Assay Development Laboratory, Leidos Biomedical Research Inc., Frederick National Laboratory for Cancer Research, Frederick, MD, USA.Division of Cancer Treatment and Diagnosis, National Cancer Institute, Bethesda, MD, USA.With rapid advances in DNA sequencing technologies, whole exome sequencing (WES) has become a popular approach for detecting somatic mutations in oncology studies. The initial intent of WES was to characterize single nucleotide variants, but it was observed that the number of sequencing reads that mapped to a genomic region correlated with the DNA copy number variants (CNVs). We propose a method RefCNV that uses a reference set to estimate the distribution of the coverage for each exon. The construction of the reference set includes an evaluation of the sources of variability in the coverage distribution. We observed that the processing steps had an impact on the coverage distribution. For each exon, we compared the observed coverage with the expected normal coverage. Thresholds for determining CNVs were selected to control the false-positive error rate. RefCNV prediction correlated significantly ( r = 0.96–0.86) with CNV measured by digital polymerase chain reaction for MET (7q31), EGFR (7p12), or ERBB2 (17q12) in 13 tumor cell lines. The genome-wide CNV analysis showed a good overall correlation (Spearman's coefficient = 0.82) between RefCNV estimation and publicly available CNV data in Cancer Cell Line Encyclopedia. RefCNV also showed better performance than three other CNV estimation methods in genome-wide CNV analysis.https://doi.org/10.4137/CIN.S36612
collection DOAJ
language English
format Article
sources DOAJ
author Lun-Ching Chang
Biswajit Das
Chih-Jian Lih
Han Si
Corinne E. Camalier
Paul M. McGregor
Eric Polley
spellingShingle Lun-Ching Chang
Biswajit Das
Chih-Jian Lih
Han Si
Corinne E. Camalier
Paul M. McGregor
Eric Polley
RefCNV: Identification of Gene-Based Copy Number Variants Using Whole Exome Sequencing
Cancer Informatics
author_facet Lun-Ching Chang
Biswajit Das
Chih-Jian Lih
Han Si
Corinne E. Camalier
Paul M. McGregor
Eric Polley
author_sort Lun-Ching Chang
title RefCNV: Identification of Gene-Based Copy Number Variants Using Whole Exome Sequencing
title_short RefCNV: Identification of Gene-Based Copy Number Variants Using Whole Exome Sequencing
title_full RefCNV: Identification of Gene-Based Copy Number Variants Using Whole Exome Sequencing
title_fullStr RefCNV: Identification of Gene-Based Copy Number Variants Using Whole Exome Sequencing
title_full_unstemmed RefCNV: Identification of Gene-Based Copy Number Variants Using Whole Exome Sequencing
title_sort refcnv: identification of gene-based copy number variants using whole exome sequencing
publisher SAGE Publishing
series Cancer Informatics
issn 1176-9351
publishDate 2016-01-01
description With rapid advances in DNA sequencing technologies, whole exome sequencing (WES) has become a popular approach for detecting somatic mutations in oncology studies. The initial intent of WES was to characterize single nucleotide variants, but it was observed that the number of sequencing reads that mapped to a genomic region correlated with the DNA copy number variants (CNVs). We propose a method RefCNV that uses a reference set to estimate the distribution of the coverage for each exon. The construction of the reference set includes an evaluation of the sources of variability in the coverage distribution. We observed that the processing steps had an impact on the coverage distribution. For each exon, we compared the observed coverage with the expected normal coverage. Thresholds for determining CNVs were selected to control the false-positive error rate. RefCNV prediction correlated significantly ( r = 0.96–0.86) with CNV measured by digital polymerase chain reaction for MET (7q31), EGFR (7p12), or ERBB2 (17q12) in 13 tumor cell lines. The genome-wide CNV analysis showed a good overall correlation (Spearman's coefficient = 0.82) between RefCNV estimation and publicly available CNV data in Cancer Cell Line Encyclopedia. RefCNV also showed better performance than three other CNV estimation methods in genome-wide CNV analysis.
url https://doi.org/10.4137/CIN.S36612
work_keys_str_mv AT lunchingchang refcnvidentificationofgenebasedcopynumbervariantsusingwholeexomesequencing
AT biswajitdas refcnvidentificationofgenebasedcopynumbervariantsusingwholeexomesequencing
AT chihjianlih refcnvidentificationofgenebasedcopynumbervariantsusingwholeexomesequencing
AT hansi refcnvidentificationofgenebasedcopynumbervariantsusingwholeexomesequencing
AT corinneecamalier refcnvidentificationofgenebasedcopynumbervariantsusingwholeexomesequencing
AT paulmmcgregor refcnvidentificationofgenebasedcopynumbervariantsusingwholeexomesequencing
AT ericpolley refcnvidentificationofgenebasedcopynumbervariantsusingwholeexomesequencing
_version_ 1724600893263839232