An EM algorithm to improve the estimation of the probability of clonal relatedness of pairs of tumors in cancer patients

Abstract Background We previously introduced a random-effects model to analyze a set of patients, each of which has two distinct tumors. The goal is to estimate the proportion of patients for which one of the tumors is a metastasis of the other, i.e. where the tumors are clonally related. Matches of...

Full description

Bibliographic Details
Main Authors: Audrey Mauguen, Venkatraman E. Seshan, Irina Ostrovnaya, Colin B. Begg
Format: Article
Language:English
Published: BMC 2019-11-01
Series:BMC Bioinformatics
Subjects:
Online Access:http://link.springer.com/article/10.1186/s12859-019-3148-z
id doaj-28636b07808e425089dd12c9228c0c8b
record_format Article
spelling doaj-28636b07808e425089dd12c9228c0c8b2020-11-25T04:09:58ZengBMCBMC Bioinformatics1471-21052019-11-012011810.1186/s12859-019-3148-zAn EM algorithm to improve the estimation of the probability of clonal relatedness of pairs of tumors in cancer patientsAudrey Mauguen0Venkatraman E. Seshan1Irina Ostrovnaya2Colin B. Begg3Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer CenterDepartment of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer CenterDepartment of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer CenterDepartment of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer CenterAbstract Background We previously introduced a random-effects model to analyze a set of patients, each of which has two distinct tumors. The goal is to estimate the proportion of patients for which one of the tumors is a metastasis of the other, i.e. where the tumors are clonally related. Matches of mutations within a tumor pair provide the evidence for clonal relatedness. In this article, using simulations, we compare two estimation approaches that we considered for our model: use of a constrained quasi-Newton algorithm to maximize the likelihood conditional on the random effect, and an Expectation-Maximization algorithm where we further condition the random-effect distribution on the data. Results In some specific settings, especially with sparse information, the estimation of the parameter of interest is at the boundary a non-negligible number of times using the first approach, while the EM algorithm gives more satisfactory estimates. This is of considerable importance for our application, since an estimate of either 0 or 1 for the proportion of cases that are clonal leads to individual probabilities being 0 or 1 in settings where the evidence is clearly not sufficient for such definitive probability estimates. Conclusions The EM algorithm is a preferable approach for our clonality random-effect model. It is now the method implemented in our R package Clonality, making available an easy and fast way to estimate this model on a range of applications.http://link.springer.com/article/10.1186/s12859-019-3148-zCancerClonalityEM algorithmTumor mutationParameter estimationRandom effect model
collection DOAJ
language English
format Article
sources DOAJ
author Audrey Mauguen
Venkatraman E. Seshan
Irina Ostrovnaya
Colin B. Begg
spellingShingle Audrey Mauguen
Venkatraman E. Seshan
Irina Ostrovnaya
Colin B. Begg
An EM algorithm to improve the estimation of the probability of clonal relatedness of pairs of tumors in cancer patients
BMC Bioinformatics
Cancer
Clonality
EM algorithm
Tumor mutation
Parameter estimation
Random effect model
author_facet Audrey Mauguen
Venkatraman E. Seshan
Irina Ostrovnaya
Colin B. Begg
author_sort Audrey Mauguen
title An EM algorithm to improve the estimation of the probability of clonal relatedness of pairs of tumors in cancer patients
title_short An EM algorithm to improve the estimation of the probability of clonal relatedness of pairs of tumors in cancer patients
title_full An EM algorithm to improve the estimation of the probability of clonal relatedness of pairs of tumors in cancer patients
title_fullStr An EM algorithm to improve the estimation of the probability of clonal relatedness of pairs of tumors in cancer patients
title_full_unstemmed An EM algorithm to improve the estimation of the probability of clonal relatedness of pairs of tumors in cancer patients
title_sort em algorithm to improve the estimation of the probability of clonal relatedness of pairs of tumors in cancer patients
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2019-11-01
description Abstract Background We previously introduced a random-effects model to analyze a set of patients, each of which has two distinct tumors. The goal is to estimate the proportion of patients for which one of the tumors is a metastasis of the other, i.e. where the tumors are clonally related. Matches of mutations within a tumor pair provide the evidence for clonal relatedness. In this article, using simulations, we compare two estimation approaches that we considered for our model: use of a constrained quasi-Newton algorithm to maximize the likelihood conditional on the random effect, and an Expectation-Maximization algorithm where we further condition the random-effect distribution on the data. Results In some specific settings, especially with sparse information, the estimation of the parameter of interest is at the boundary a non-negligible number of times using the first approach, while the EM algorithm gives more satisfactory estimates. This is of considerable importance for our application, since an estimate of either 0 or 1 for the proportion of cases that are clonal leads to individual probabilities being 0 or 1 in settings where the evidence is clearly not sufficient for such definitive probability estimates. Conclusions The EM algorithm is a preferable approach for our clonality random-effect model. It is now the method implemented in our R package Clonality, making available an easy and fast way to estimate this model on a range of applications.
topic Cancer
Clonality
EM algorithm
Tumor mutation
Parameter estimation
Random effect model
url http://link.springer.com/article/10.1186/s12859-019-3148-z
work_keys_str_mv AT audreymauguen anemalgorithmtoimprovetheestimationoftheprobabilityofclonalrelatednessofpairsoftumorsincancerpatients
AT venkatramaneseshan anemalgorithmtoimprovetheestimationoftheprobabilityofclonalrelatednessofpairsoftumorsincancerpatients
AT irinaostrovnaya anemalgorithmtoimprovetheestimationoftheprobabilityofclonalrelatednessofpairsoftumorsincancerpatients
AT colinbbegg anemalgorithmtoimprovetheestimationoftheprobabilityofclonalrelatednessofpairsoftumorsincancerpatients
AT audreymauguen emalgorithmtoimprovetheestimationoftheprobabilityofclonalrelatednessofpairsoftumorsincancerpatients
AT venkatramaneseshan emalgorithmtoimprovetheestimationoftheprobabilityofclonalrelatednessofpairsoftumorsincancerpatients
AT irinaostrovnaya emalgorithmtoimprovetheestimationoftheprobabilityofclonalrelatednessofpairsoftumorsincancerpatients
AT colinbbegg emalgorithmtoimprovetheestimationoftheprobabilityofclonalrelatednessofpairsoftumorsincancerpatients
_version_ 1724421038888976384