An EM algorithm to improve the estimation of the probability of clonal relatedness of pairs of tumors in cancer patients
Abstract Background We previously introduced a random-effects model to analyze a set of patients, each of which has two distinct tumors. The goal is to estimate the proportion of patients for which one of the tumors is a metastasis of the other, i.e. where the tumors are clonally related. Matches of...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2019-11-01
|
Series: | BMC Bioinformatics |
Subjects: | |
Online Access: | http://link.springer.com/article/10.1186/s12859-019-3148-z |
id |
doaj-28636b07808e425089dd12c9228c0c8b |
---|---|
record_format |
Article |
spelling |
doaj-28636b07808e425089dd12c9228c0c8b2020-11-25T04:09:58ZengBMCBMC Bioinformatics1471-21052019-11-012011810.1186/s12859-019-3148-zAn EM algorithm to improve the estimation of the probability of clonal relatedness of pairs of tumors in cancer patientsAudrey Mauguen0Venkatraman E. Seshan1Irina Ostrovnaya2Colin B. Begg3Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer CenterDepartment of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer CenterDepartment of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer CenterDepartment of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer CenterAbstract Background We previously introduced a random-effects model to analyze a set of patients, each of which has two distinct tumors. The goal is to estimate the proportion of patients for which one of the tumors is a metastasis of the other, i.e. where the tumors are clonally related. Matches of mutations within a tumor pair provide the evidence for clonal relatedness. In this article, using simulations, we compare two estimation approaches that we considered for our model: use of a constrained quasi-Newton algorithm to maximize the likelihood conditional on the random effect, and an Expectation-Maximization algorithm where we further condition the random-effect distribution on the data. Results In some specific settings, especially with sparse information, the estimation of the parameter of interest is at the boundary a non-negligible number of times using the first approach, while the EM algorithm gives more satisfactory estimates. This is of considerable importance for our application, since an estimate of either 0 or 1 for the proportion of cases that are clonal leads to individual probabilities being 0 or 1 in settings where the evidence is clearly not sufficient for such definitive probability estimates. Conclusions The EM algorithm is a preferable approach for our clonality random-effect model. It is now the method implemented in our R package Clonality, making available an easy and fast way to estimate this model on a range of applications.http://link.springer.com/article/10.1186/s12859-019-3148-zCancerClonalityEM algorithmTumor mutationParameter estimationRandom effect model |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Audrey Mauguen Venkatraman E. Seshan Irina Ostrovnaya Colin B. Begg |
spellingShingle |
Audrey Mauguen Venkatraman E. Seshan Irina Ostrovnaya Colin B. Begg An EM algorithm to improve the estimation of the probability of clonal relatedness of pairs of tumors in cancer patients BMC Bioinformatics Cancer Clonality EM algorithm Tumor mutation Parameter estimation Random effect model |
author_facet |
Audrey Mauguen Venkatraman E. Seshan Irina Ostrovnaya Colin B. Begg |
author_sort |
Audrey Mauguen |
title |
An EM algorithm to improve the estimation of the probability of clonal relatedness of pairs of tumors in cancer patients |
title_short |
An EM algorithm to improve the estimation of the probability of clonal relatedness of pairs of tumors in cancer patients |
title_full |
An EM algorithm to improve the estimation of the probability of clonal relatedness of pairs of tumors in cancer patients |
title_fullStr |
An EM algorithm to improve the estimation of the probability of clonal relatedness of pairs of tumors in cancer patients |
title_full_unstemmed |
An EM algorithm to improve the estimation of the probability of clonal relatedness of pairs of tumors in cancer patients |
title_sort |
em algorithm to improve the estimation of the probability of clonal relatedness of pairs of tumors in cancer patients |
publisher |
BMC |
series |
BMC Bioinformatics |
issn |
1471-2105 |
publishDate |
2019-11-01 |
description |
Abstract Background We previously introduced a random-effects model to analyze a set of patients, each of which has two distinct tumors. The goal is to estimate the proportion of patients for which one of the tumors is a metastasis of the other, i.e. where the tumors are clonally related. Matches of mutations within a tumor pair provide the evidence for clonal relatedness. In this article, using simulations, we compare two estimation approaches that we considered for our model: use of a constrained quasi-Newton algorithm to maximize the likelihood conditional on the random effect, and an Expectation-Maximization algorithm where we further condition the random-effect distribution on the data. Results In some specific settings, especially with sparse information, the estimation of the parameter of interest is at the boundary a non-negligible number of times using the first approach, while the EM algorithm gives more satisfactory estimates. This is of considerable importance for our application, since an estimate of either 0 or 1 for the proportion of cases that are clonal leads to individual probabilities being 0 or 1 in settings where the evidence is clearly not sufficient for such definitive probability estimates. Conclusions The EM algorithm is a preferable approach for our clonality random-effect model. It is now the method implemented in our R package Clonality, making available an easy and fast way to estimate this model on a range of applications. |
topic |
Cancer Clonality EM algorithm Tumor mutation Parameter estimation Random effect model |
url |
http://link.springer.com/article/10.1186/s12859-019-3148-z |
work_keys_str_mv |
AT audreymauguen anemalgorithmtoimprovetheestimationoftheprobabilityofclonalrelatednessofpairsoftumorsincancerpatients AT venkatramaneseshan anemalgorithmtoimprovetheestimationoftheprobabilityofclonalrelatednessofpairsoftumorsincancerpatients AT irinaostrovnaya anemalgorithmtoimprovetheestimationoftheprobabilityofclonalrelatednessofpairsoftumorsincancerpatients AT colinbbegg anemalgorithmtoimprovetheestimationoftheprobabilityofclonalrelatednessofpairsoftumorsincancerpatients AT audreymauguen emalgorithmtoimprovetheestimationoftheprobabilityofclonalrelatednessofpairsoftumorsincancerpatients AT venkatramaneseshan emalgorithmtoimprovetheestimationoftheprobabilityofclonalrelatednessofpairsoftumorsincancerpatients AT irinaostrovnaya emalgorithmtoimprovetheestimationoftheprobabilityofclonalrelatednessofpairsoftumorsincancerpatients AT colinbbegg emalgorithmtoimprovetheestimationoftheprobabilityofclonalrelatednessofpairsoftumorsincancerpatients |
_version_ |
1724421038888976384 |