Distributions associated with simultaneous multiple hypothesis testing

Abstract We develop the distribution for the number of hypotheses found to be statistically significant using the rule from Simes (Biometrika 73: 751–754, 1986) for controlling the family-wise error rate (FWER). We find the distribution of the number of statistically significant p-values under the n...

Full description

Bibliographic Details
Main Authors: Chang Yu, Daniel Zelterman
Format: Article
Language:English
Published: SpringerOpen 2020-10-01
Series:Journal of Statistical Distributions and Applications
Subjects:
Online Access:http://link.springer.com/article/10.1186/s40488-020-00109-6
id doaj-4d71b2e98a3e4a2e9ed63a3567a87691
record_format Article
spelling doaj-4d71b2e98a3e4a2e9ed63a3567a876912020-11-25T03:09:36ZengSpringerOpenJournal of Statistical Distributions and Applications2195-58322020-10-017111710.1186/s40488-020-00109-6Distributions associated with simultaneous multiple hypothesis testingChang Yu0Daniel Zelterman1Department of Biostatistics, Vanderbilt University Medical CenterDepartment of Biostatistics, Yale UniversityAbstract We develop the distribution for the number of hypotheses found to be statistically significant using the rule from Simes (Biometrika 73: 751–754, 1986) for controlling the family-wise error rate (FWER). We find the distribution of the number of statistically significant p-values under the null hypothesis and show this follows a normal distribution under the alternative. We propose a parametric distribution Ψ I (·) to model the marginal distribution of p-values sampled from a mixture of null uniform and non-uniform distributions under different alternative hypotheses. The Ψ I distribution is useful when there are many different alternative hypotheses and these are not individually well understood. We fit Ψ I to data from three cancer studies and use it to illustrate the distribution of the number of notable hypotheses observed in these examples. We model dependence in sampled p-values using a latent variable. These methods can be combined to illustrate a power analysis in planning a larger study on the basis of a smaller pilot experiment.http://link.springer.com/article/10.1186/s40488-020-00109-6Bonferroni correctionSimes criteriaFalse discovery ratep-values
collection DOAJ
language English
format Article
sources DOAJ
author Chang Yu
Daniel Zelterman
spellingShingle Chang Yu
Daniel Zelterman
Distributions associated with simultaneous multiple hypothesis testing
Journal of Statistical Distributions and Applications
Bonferroni correction
Simes criteria
False discovery rate
p-values
author_facet Chang Yu
Daniel Zelterman
author_sort Chang Yu
title Distributions associated with simultaneous multiple hypothesis testing
title_short Distributions associated with simultaneous multiple hypothesis testing
title_full Distributions associated with simultaneous multiple hypothesis testing
title_fullStr Distributions associated with simultaneous multiple hypothesis testing
title_full_unstemmed Distributions associated with simultaneous multiple hypothesis testing
title_sort distributions associated with simultaneous multiple hypothesis testing
publisher SpringerOpen
series Journal of Statistical Distributions and Applications
issn 2195-5832
publishDate 2020-10-01
description Abstract We develop the distribution for the number of hypotheses found to be statistically significant using the rule from Simes (Biometrika 73: 751–754, 1986) for controlling the family-wise error rate (FWER). We find the distribution of the number of statistically significant p-values under the null hypothesis and show this follows a normal distribution under the alternative. We propose a parametric distribution Ψ I (·) to model the marginal distribution of p-values sampled from a mixture of null uniform and non-uniform distributions under different alternative hypotheses. The Ψ I distribution is useful when there are many different alternative hypotheses and these are not individually well understood. We fit Ψ I to data from three cancer studies and use it to illustrate the distribution of the number of notable hypotheses observed in these examples. We model dependence in sampled p-values using a latent variable. These methods can be combined to illustrate a power analysis in planning a larger study on the basis of a smaller pilot experiment.
topic Bonferroni correction
Simes criteria
False discovery rate
p-values
url http://link.springer.com/article/10.1186/s40488-020-00109-6
work_keys_str_mv AT changyu distributionsassociatedwithsimultaneousmultiplehypothesistesting
AT danielzelterman distributionsassociatedwithsimultaneousmultiplehypothesistesting
_version_ 1724661650811781120