Sampling genotypes in large pedigrees with loops

<p>Abstract</p> <p>Markov chain Monte Carlo (MCMC) methods have been proposed to overcome computational problems in linkage and segregation analyses. This approach involves sampling genotypes at the marker and trait loci. Scalar-Gibbs is easy to implement, and it is widely used in...

Full description

Bibliographic Details
Main Authors: Guldbrandtsen Bernt, Fernando Rohan L, Fernández Soledad A, Totir Liviu R, Carriquiry Alicia L
Format: Article
Language:deu
Published: BMC 2001-07-01
Series:Genetics Selection Evolution
Subjects:
Online Access:http://www.gsejournal.org/content/33/4/337
id doaj-758fc130829d4c2e97bfce8c5086dd53
record_format Article
spelling doaj-758fc130829d4c2e97bfce8c5086dd532020-11-24T22:12:42ZdeuBMCGenetics Selection Evolution0999-193X1297-96862001-07-0133433736710.1186/1297-9686-33-4-337Sampling genotypes in large pedigrees with loopsGuldbrandtsen BerntFernando Rohan LFernández Soledad ATotir Liviu RCarriquiry Alicia L<p>Abstract</p> <p>Markov chain Monte Carlo (MCMC) methods have been proposed to overcome computational problems in linkage and segregation analyses. This approach involves sampling genotypes at the marker and trait loci. Scalar-Gibbs is easy to implement, and it is widely used in genetics. However, the Markov chain that corresponds to scalar-Gibbs may not be irreducible when the marker locus has more than two alleles, and even when the chain is irreducible, mixing has been observed to be slow. These problems do not arise if the genotypes are sampled jointly from the entire pedigree. This paper proposes a method to jointly sample genotypes. The method combines the Elston-Stewart algorithm and iterative peeling, and is called the ESIP sampler. For a hypothetical pedigree, genotype probabilities are estimated from samples obtained using ESIP and also scalar-Gibbs. Approximate probabilities were also obtained by iterative peeling. Comparisons of these with exact genotypic probabilities obtained by the Elston-Stewart algorithm showed that ESIP and iterative peeling yielded genotypic probabilities that were very close to the exact values. Nevertheless, estimated probabilities from scalar-Gibbs with a chain of length 235 000, including a burn-in of 200 000 steps, were less accurate than probabilities estimated using ESIP with a chain of length 10 000, with a burn-in of 5 000 steps. The effective chain size (ECS) was estimated from the last 25 000 elements of the chain of length 125 000. For one of the ESIP samplers, the ECS ranged from 21 579 to 22 741, while for the scalar-Gibbs sampler, the ECS ranged from 64 to 671. Genotype probabilities were also estimated for a large real pedigree consisting of 3 223 individuals. For this pedigree, it is not feasible to obtain exact genotype probabilities by the Elston-Stewart algorithm. ESIP and iterative peeling yielded very similar results. However, results from scalar-Gibbs were less accurate.</p> http://www.gsejournal.org/content/33/4/337genotype samplerMarkov chain Monte Carlopeeling
collection DOAJ
language deu
format Article
sources DOAJ
author Guldbrandtsen Bernt
Fernando Rohan L
Fernández Soledad A
Totir Liviu R
Carriquiry Alicia L
spellingShingle Guldbrandtsen Bernt
Fernando Rohan L
Fernández Soledad A
Totir Liviu R
Carriquiry Alicia L
Sampling genotypes in large pedigrees with loops
Genetics Selection Evolution
genotype sampler
Markov chain Monte Carlo
peeling
author_facet Guldbrandtsen Bernt
Fernando Rohan L
Fernández Soledad A
Totir Liviu R
Carriquiry Alicia L
author_sort Guldbrandtsen Bernt
title Sampling genotypes in large pedigrees with loops
title_short Sampling genotypes in large pedigrees with loops
title_full Sampling genotypes in large pedigrees with loops
title_fullStr Sampling genotypes in large pedigrees with loops
title_full_unstemmed Sampling genotypes in large pedigrees with loops
title_sort sampling genotypes in large pedigrees with loops
publisher BMC
series Genetics Selection Evolution
issn 0999-193X
1297-9686
publishDate 2001-07-01
description <p>Abstract</p> <p>Markov chain Monte Carlo (MCMC) methods have been proposed to overcome computational problems in linkage and segregation analyses. This approach involves sampling genotypes at the marker and trait loci. Scalar-Gibbs is easy to implement, and it is widely used in genetics. However, the Markov chain that corresponds to scalar-Gibbs may not be irreducible when the marker locus has more than two alleles, and even when the chain is irreducible, mixing has been observed to be slow. These problems do not arise if the genotypes are sampled jointly from the entire pedigree. This paper proposes a method to jointly sample genotypes. The method combines the Elston-Stewart algorithm and iterative peeling, and is called the ESIP sampler. For a hypothetical pedigree, genotype probabilities are estimated from samples obtained using ESIP and also scalar-Gibbs. Approximate probabilities were also obtained by iterative peeling. Comparisons of these with exact genotypic probabilities obtained by the Elston-Stewart algorithm showed that ESIP and iterative peeling yielded genotypic probabilities that were very close to the exact values. Nevertheless, estimated probabilities from scalar-Gibbs with a chain of length 235 000, including a burn-in of 200 000 steps, were less accurate than probabilities estimated using ESIP with a chain of length 10 000, with a burn-in of 5 000 steps. The effective chain size (ECS) was estimated from the last 25 000 elements of the chain of length 125 000. For one of the ESIP samplers, the ECS ranged from 21 579 to 22 741, while for the scalar-Gibbs sampler, the ECS ranged from 64 to 671. Genotype probabilities were also estimated for a large real pedigree consisting of 3 223 individuals. For this pedigree, it is not feasible to obtain exact genotype probabilities by the Elston-Stewart algorithm. ESIP and iterative peeling yielded very similar results. However, results from scalar-Gibbs were less accurate.</p>
topic genotype sampler
Markov chain Monte Carlo
peeling
url http://www.gsejournal.org/content/33/4/337
work_keys_str_mv AT guldbrandtsenbernt samplinggenotypesinlargepedigreeswithloops
AT fernandorohanl samplinggenotypesinlargepedigreeswithloops
AT fernandezsoledada samplinggenotypesinlargepedigreeswithloops
AT totirliviur samplinggenotypesinlargepedigreeswithloops
AT carriquiryalicial samplinggenotypesinlargepedigreeswithloops
_version_ 1725802740736589824