Comparing copy-number profiles under multi-copy amplifications and deletions

Abstract Background During cancer progression, malignant cells accumulate somatic mutations that can lead to genetic aberrations. In particular, evolutionary events akin to segmental duplications or deletions can alter the copy-number profile (CNP) of a set of genes in a genome. Our aim is to comput...

Full description

Bibliographic Details
Main Authors: Garance Cordonnier, Manuel Lafond
Format: Article
Language:English
Published: BMC 2020-04-01
Series:BMC Genomics
Subjects:
Online Access:http://link.springer.com/article/10.1186/s12864-020-6611-3
id doaj-3d60504fe3d34884ab4161d05780012d
record_format Article
spelling doaj-3d60504fe3d34884ab4161d05780012d2020-11-25T02:07:51ZengBMCBMC Genomics1471-21642020-04-0121S211210.1186/s12864-020-6611-3Comparing copy-number profiles under multi-copy amplifications and deletionsGarance Cordonnier0Manuel Lafond1Department of Computer Science, École polytechniqueDepartment of Computer Science, Université de SherbrookeAbstract Background During cancer progression, malignant cells accumulate somatic mutations that can lead to genetic aberrations. In particular, evolutionary events akin to segmental duplications or deletions can alter the copy-number profile (CNP) of a set of genes in a genome. Our aim is to compute the evolutionary distance between two cells for which only CNPs are known. This asks for the minimum number of segmental amplifications and deletions to turn one CNP into another. This was recently formalized into a model where each event is assumed to alter a copy-number by 1 or −1, even though these events can affect large portions of a chromosome. Results We propose a general cost framework where an event can modify the copy-number of a gene by larger amounts. We show that any cost scheme that allows segmental deletions of arbitrary length makes computing the distance strongly NP-hard. We then devise a factor 2 approximation algorithm for the problem when copy-numbers are non-zero and provide an implementation called cnp2cnp. We evaluate our approach experimentally by reconstructing simulated cancer phylogenies from the pairwise distances inferred by cnp2cnp and compare it against two other alternatives, namely the MEDICC distance and the Euclidean distance. Conclusions The experimental results show that our distance yields more accurate phylogenies on average than these alternatives if the given CNPs are error-free, but that the MEDICC distance is slightly more robust against error in the data. In all cases, our experiments show that either our approach or the MEDICC approach should preferred over the Euclidean distance.http://link.springer.com/article/10.1186/s12864-020-6611-3Copy-number evolutionAlgorithmsCancer phylogeniesNP-hardness
collection DOAJ
language English
format Article
sources DOAJ
author Garance Cordonnier
Manuel Lafond
spellingShingle Garance Cordonnier
Manuel Lafond
Comparing copy-number profiles under multi-copy amplifications and deletions
BMC Genomics
Copy-number evolution
Algorithms
Cancer phylogenies
NP-hardness
author_facet Garance Cordonnier
Manuel Lafond
author_sort Garance Cordonnier
title Comparing copy-number profiles under multi-copy amplifications and deletions
title_short Comparing copy-number profiles under multi-copy amplifications and deletions
title_full Comparing copy-number profiles under multi-copy amplifications and deletions
title_fullStr Comparing copy-number profiles under multi-copy amplifications and deletions
title_full_unstemmed Comparing copy-number profiles under multi-copy amplifications and deletions
title_sort comparing copy-number profiles under multi-copy amplifications and deletions
publisher BMC
series BMC Genomics
issn 1471-2164
publishDate 2020-04-01
description Abstract Background During cancer progression, malignant cells accumulate somatic mutations that can lead to genetic aberrations. In particular, evolutionary events akin to segmental duplications or deletions can alter the copy-number profile (CNP) of a set of genes in a genome. Our aim is to compute the evolutionary distance between two cells for which only CNPs are known. This asks for the minimum number of segmental amplifications and deletions to turn one CNP into another. This was recently formalized into a model where each event is assumed to alter a copy-number by 1 or −1, even though these events can affect large portions of a chromosome. Results We propose a general cost framework where an event can modify the copy-number of a gene by larger amounts. We show that any cost scheme that allows segmental deletions of arbitrary length makes computing the distance strongly NP-hard. We then devise a factor 2 approximation algorithm for the problem when copy-numbers are non-zero and provide an implementation called cnp2cnp. We evaluate our approach experimentally by reconstructing simulated cancer phylogenies from the pairwise distances inferred by cnp2cnp and compare it against two other alternatives, namely the MEDICC distance and the Euclidean distance. Conclusions The experimental results show that our distance yields more accurate phylogenies on average than these alternatives if the given CNPs are error-free, but that the MEDICC distance is slightly more robust against error in the data. In all cases, our experiments show that either our approach or the MEDICC approach should preferred over the Euclidean distance.
topic Copy-number evolution
Algorithms
Cancer phylogenies
NP-hardness
url http://link.springer.com/article/10.1186/s12864-020-6611-3
work_keys_str_mv AT garancecordonnier comparingcopynumberprofilesundermulticopyamplificationsanddeletions
AT manuellafond comparingcopynumberprofilesundermulticopyamplificationsanddeletions
_version_ 1724929286497894400