Probabilistic forecasting of replication studies.

Throughout the last decade, the so-called replication crisis has stimulated many researchers to conduct large-scale replication projects. With data from four of these projects, we computed probabilistic forecasts of the replication outcomes, which we then evaluated regarding discrimination, calibrat...

Full description

Bibliographic Details
Main Authors:	Samuel Pawel, Leonhard Held
Format:	Article
Language:	English
Published:	Public Library of Science (PLoS) 2020-01-01
Series:	PLoS ONE
Online Access:	https://doi.org/10.1371/journal.pone.0231416

id	doaj-b2fc1f4e8913469f888b74d49f3dc5c8
record_format	Article
spelling	doaj-b2fc1f4e8913469f888b74d49f3dc5c82021-03-03T21:39:59ZengPublic Library of Science (PLoS)PLoS ONE1932-62032020-01-01154e023141610.1371/journal.pone.0231416Probabilistic forecasting of replication studies.Samuel PawelLeonhard HeldThroughout the last decade, the so-called replication crisis has stimulated many researchers to conduct large-scale replication projects. With data from four of these projects, we computed probabilistic forecasts of the replication outcomes, which we then evaluated regarding discrimination, calibration and sharpness. A novel model, which can take into account both inflation and heterogeneity of effects, was used and predicted the effect estimate of the replication study with good performance in two of the four data sets. In the other two data sets, predictive performance was still substantially improved compared to the naive model which does not consider inflation and heterogeneity of effects. The results suggest that many of the estimates from the original studies were inflated, possibly caused by publication bias or questionable research practices, and also that some degree of heterogeneity between original and replication effects should be expected. Moreover, the results indicate that the use of statistical significance as the only criterion for replication success may be questionable, since from a predictive viewpoint, non-significant replication results are often compatible with significant results from the original study. The developed statistical methods as well as the data sets are available in the R package ReplicationSuccess.https://doi.org/10.1371/journal.pone.0231416
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Samuel Pawel Leonhard Held
spellingShingle	Samuel Pawel Leonhard Held Probabilistic forecasting of replication studies. PLoS ONE
author_facet	Samuel Pawel Leonhard Held
author_sort	Samuel Pawel
title	Probabilistic forecasting of replication studies.
title_short	Probabilistic forecasting of replication studies.
title_full	Probabilistic forecasting of replication studies.
title_fullStr	Probabilistic forecasting of replication studies.
title_full_unstemmed	Probabilistic forecasting of replication studies.
title_sort	probabilistic forecasting of replication studies.
publisher	Public Library of Science (PLoS)
series	PLoS ONE
issn	1932-6203
publishDate	2020-01-01
description	Throughout the last decade, the so-called replication crisis has stimulated many researchers to conduct large-scale replication projects. With data from four of these projects, we computed probabilistic forecasts of the replication outcomes, which we then evaluated regarding discrimination, calibration and sharpness. A novel model, which can take into account both inflation and heterogeneity of effects, was used and predicted the effect estimate of the replication study with good performance in two of the four data sets. In the other two data sets, predictive performance was still substantially improved compared to the naive model which does not consider inflation and heterogeneity of effects. The results suggest that many of the estimates from the original studies were inflated, possibly caused by publication bias or questionable research practices, and also that some degree of heterogeneity between original and replication effects should be expected. Moreover, the results indicate that the use of statistical significance as the only criterion for replication success may be questionable, since from a predictive viewpoint, non-significant replication results are often compatible with significant results from the original study. The developed statistical methods as well as the data sets are available in the R package ReplicationSuccess.
url	https://doi.org/10.1371/journal.pone.0231416
work_keys_str_mv	AT samuelpawel probabilisticforecastingofreplicationstudies AT leonhardheld probabilisticforecastingofreplicationstudies
_version_	1714815804173713408

Probabilistic forecasting of replication studies.

Similar Items