Assessing the performance of methods for copy number aberration detection from single-cell DNA sequencing data.

Single-cell DNA sequencing technologies are enabling the study of mutations and their evolutionary trajectories in cancer. Somatic copy number aberrations (CNAs) have been implicated in the development and progression of various types of cancer. A wide array of methods for CNA detection has been eit...

Full description

Bibliographic Details
Main Authors: Xian F Mallory, Mohammadamin Edrisi, Nicholas Navin, Luay Nakhleh
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2020-07-01
Series:PLoS Computational Biology
Online Access:https://doi.org/10.1371/journal.pcbi.1008012
id doaj-c8aef7fb36c24e35bea5c0f4fa8a126a
record_format Article
spelling doaj-c8aef7fb36c24e35bea5c0f4fa8a126a2021-04-21T15:16:36ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582020-07-01167e100801210.1371/journal.pcbi.1008012Assessing the performance of methods for copy number aberration detection from single-cell DNA sequencing data.Xian F MalloryMohammadamin EdrisiNicholas NavinLuay NakhlehSingle-cell DNA sequencing technologies are enabling the study of mutations and their evolutionary trajectories in cancer. Somatic copy number aberrations (CNAs) have been implicated in the development and progression of various types of cancer. A wide array of methods for CNA detection has been either developed specifically for or adapted to single-cell DNA sequencing data. Understanding the strengths and limitations that are unique to each of these methods is very important for obtaining accurate copy number profiles from single-cell DNA sequencing data. We benchmarked three widely used methods-Ginkgo, HMMcopy, and CopyNumber-on simulated as well as real datasets. To facilitate this, we developed a novel simulator of single-cell genome evolution in the presence of CNAs. Furthermore, to assess performance on empirical data where the ground truth is unknown, we introduce a phylogeny-based measure for identifying potentially erroneous inferences. While single-cell DNA sequencing is very promising for elucidating and understanding CNAs, our findings show that even the best existing method does not exceed 80% accuracy. New methods that significantly improve upon the accuracy of these three methods are needed. Furthermore, with the large datasets being generated, the methods must be computationally efficient.https://doi.org/10.1371/journal.pcbi.1008012
collection DOAJ
language English
format Article
sources DOAJ
author Xian F Mallory
Mohammadamin Edrisi
Nicholas Navin
Luay Nakhleh
spellingShingle Xian F Mallory
Mohammadamin Edrisi
Nicholas Navin
Luay Nakhleh
Assessing the performance of methods for copy number aberration detection from single-cell DNA sequencing data.
PLoS Computational Biology
author_facet Xian F Mallory
Mohammadamin Edrisi
Nicholas Navin
Luay Nakhleh
author_sort Xian F Mallory
title Assessing the performance of methods for copy number aberration detection from single-cell DNA sequencing data.
title_short Assessing the performance of methods for copy number aberration detection from single-cell DNA sequencing data.
title_full Assessing the performance of methods for copy number aberration detection from single-cell DNA sequencing data.
title_fullStr Assessing the performance of methods for copy number aberration detection from single-cell DNA sequencing data.
title_full_unstemmed Assessing the performance of methods for copy number aberration detection from single-cell DNA sequencing data.
title_sort assessing the performance of methods for copy number aberration detection from single-cell dna sequencing data.
publisher Public Library of Science (PLoS)
series PLoS Computational Biology
issn 1553-734X
1553-7358
publishDate 2020-07-01
description Single-cell DNA sequencing technologies are enabling the study of mutations and their evolutionary trajectories in cancer. Somatic copy number aberrations (CNAs) have been implicated in the development and progression of various types of cancer. A wide array of methods for CNA detection has been either developed specifically for or adapted to single-cell DNA sequencing data. Understanding the strengths and limitations that are unique to each of these methods is very important for obtaining accurate copy number profiles from single-cell DNA sequencing data. We benchmarked three widely used methods-Ginkgo, HMMcopy, and CopyNumber-on simulated as well as real datasets. To facilitate this, we developed a novel simulator of single-cell genome evolution in the presence of CNAs. Furthermore, to assess performance on empirical data where the ground truth is unknown, we introduce a phylogeny-based measure for identifying potentially erroneous inferences. While single-cell DNA sequencing is very promising for elucidating and understanding CNAs, our findings show that even the best existing method does not exceed 80% accuracy. New methods that significantly improve upon the accuracy of these three methods are needed. Furthermore, with the large datasets being generated, the methods must be computationally efficient.
url https://doi.org/10.1371/journal.pcbi.1008012
work_keys_str_mv AT xianfmallory assessingtheperformanceofmethodsforcopynumberaberrationdetectionfromsinglecelldnasequencingdata
AT mohammadaminedrisi assessingtheperformanceofmethodsforcopynumberaberrationdetectionfromsinglecelldnasequencingdata
AT nicholasnavin assessingtheperformanceofmethodsforcopynumberaberrationdetectionfromsinglecelldnasequencingdata
AT luaynakhleh assessingtheperformanceofmethodsforcopynumberaberrationdetectionfromsinglecelldnasequencingdata
_version_ 1714667567513075712