Enhancing genome-enabled prediction by bagging genomic BLUP.

We examined whether or not the predictive ability of genomic best linear unbiased prediction (GBLUP) could be improved via a resampling method used in machine learning: bootstrap aggregating sampling ("bagging"). In theory, bagging can be useful when the predictor has large variance or whe...

Full description

Bibliographic Details
Main Authors:	Daniel Gianola, Kent A Weigel, Nicole Krämer, Alessandra Stella, Chris-Carolin Schön
Format:	Article
Language:	English
Published:	Public Library of Science (PLoS) 2014-01-01
Series:	PLoS ONE
Online Access:	http://europepmc.org/articles/PMC3982963?pdf=render

id	doaj-7d17af68c2414d3da7928fa40deccb2d
record_format	Article
spelling	doaj-7d17af68c2414d3da7928fa40deccb2d2020-11-25T01:56:02ZengPublic Library of Science (PLoS)PLoS ONE1932-62032014-01-0194e9169310.1371/journal.pone.0091693Enhancing genome-enabled prediction by bagging genomic BLUP.Daniel GianolaKent A WeigelNicole KrämerAlessandra StellaChris-Carolin SchönWe examined whether or not the predictive ability of genomic best linear unbiased prediction (GBLUP) could be improved via a resampling method used in machine learning: bootstrap aggregating sampling ("bagging"). In theory, bagging can be useful when the predictor has large variance or when the number of markers is much larger than sample size, preventing effective regularization. After presenting a brief review of GBLUP, bagging was adapted to the context of GBLUP, both at the level of the genetic signal and of marker effects. The performance of bagging was evaluated with four simulated case studies including known or unknown quantitative trait loci, and an application was made to real data on grain yield in wheat planted in four environments. A metric aimed to quantify candidate-specific cross-validation uncertainty was proposed and assessed; as expected, model derived theoretical reliabilities bore no relationship with cross-validation accuracy. It was found that bagging can ameliorate predictive performance of GBLUP and make it more robust against over-fitting. Seemingly, 25-50 bootstrap samples was enough to attain reasonable predictions as well as stable measures of individual predictive mean squared errors.http://europepmc.org/articles/PMC3982963?pdf=render
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Daniel Gianola Kent A Weigel Nicole Krämer Alessandra Stella Chris-Carolin Schön
spellingShingle	Daniel Gianola Kent A Weigel Nicole Krämer Alessandra Stella Chris-Carolin Schön Enhancing genome-enabled prediction by bagging genomic BLUP. PLoS ONE
author_facet	Daniel Gianola Kent A Weigel Nicole Krämer Alessandra Stella Chris-Carolin Schön
author_sort	Daniel Gianola
title	Enhancing genome-enabled prediction by bagging genomic BLUP.
title_short	Enhancing genome-enabled prediction by bagging genomic BLUP.
title_full	Enhancing genome-enabled prediction by bagging genomic BLUP.
title_fullStr	Enhancing genome-enabled prediction by bagging genomic BLUP.
title_full_unstemmed	Enhancing genome-enabled prediction by bagging genomic BLUP.
title_sort	enhancing genome-enabled prediction by bagging genomic blup.
publisher	Public Library of Science (PLoS)
series	PLoS ONE
issn	1932-6203
publishDate	2014-01-01
description	We examined whether or not the predictive ability of genomic best linear unbiased prediction (GBLUP) could be improved via a resampling method used in machine learning: bootstrap aggregating sampling ("bagging"). In theory, bagging can be useful when the predictor has large variance or when the number of markers is much larger than sample size, preventing effective regularization. After presenting a brief review of GBLUP, bagging was adapted to the context of GBLUP, both at the level of the genetic signal and of marker effects. The performance of bagging was evaluated with four simulated case studies including known or unknown quantitative trait loci, and an application was made to real data on grain yield in wheat planted in four environments. A metric aimed to quantify candidate-specific cross-validation uncertainty was proposed and assessed; as expected, model derived theoretical reliabilities bore no relationship with cross-validation accuracy. It was found that bagging can ameliorate predictive performance of GBLUP and make it more robust against over-fitting. Seemingly, 25-50 bootstrap samples was enough to attain reasonable predictions as well as stable measures of individual predictive mean squared errors.
url	http://europepmc.org/articles/PMC3982963?pdf=render
work_keys_str_mv	AT danielgianola enhancinggenomeenabledpredictionbybagginggenomicblup AT kentaweigel enhancinggenomeenabledpredictionbybagginggenomicblup AT nicolekramer enhancinggenomeenabledpredictionbybagginggenomicblup AT alessandrastella enhancinggenomeenabledpredictionbybagginggenomicblup AT chriscarolinschon enhancinggenomeenabledpredictionbybagginggenomicblup
_version_	1724982084252991488

Enhancing genome-enabled prediction by bagging genomic BLUP.

Similar Items