Distributions associated with simultaneous multiple hypothesis testing
Abstract We develop the distribution for the number of hypotheses found to be statistically significant using the rule from Simes (Biometrika 73: 751–754, 1986) for controlling the family-wise error rate (FWER). We find the distribution of the number of statistically significant p-values under the n...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
SpringerOpen
2020-10-01
|
Series: | Journal of Statistical Distributions and Applications |
Subjects: | |
Online Access: | http://link.springer.com/article/10.1186/s40488-020-00109-6 |
id |
doaj-4d71b2e98a3e4a2e9ed63a3567a87691 |
---|---|
record_format |
Article |
spelling |
doaj-4d71b2e98a3e4a2e9ed63a3567a876912020-11-25T03:09:36ZengSpringerOpenJournal of Statistical Distributions and Applications2195-58322020-10-017111710.1186/s40488-020-00109-6Distributions associated with simultaneous multiple hypothesis testingChang Yu0Daniel Zelterman1Department of Biostatistics, Vanderbilt University Medical CenterDepartment of Biostatistics, Yale UniversityAbstract We develop the distribution for the number of hypotheses found to be statistically significant using the rule from Simes (Biometrika 73: 751–754, 1986) for controlling the family-wise error rate (FWER). We find the distribution of the number of statistically significant p-values under the null hypothesis and show this follows a normal distribution under the alternative. We propose a parametric distribution Ψ I (·) to model the marginal distribution of p-values sampled from a mixture of null uniform and non-uniform distributions under different alternative hypotheses. The Ψ I distribution is useful when there are many different alternative hypotheses and these are not individually well understood. We fit Ψ I to data from three cancer studies and use it to illustrate the distribution of the number of notable hypotheses observed in these examples. We model dependence in sampled p-values using a latent variable. These methods can be combined to illustrate a power analysis in planning a larger study on the basis of a smaller pilot experiment.http://link.springer.com/article/10.1186/s40488-020-00109-6Bonferroni correctionSimes criteriaFalse discovery ratep-values |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Chang Yu Daniel Zelterman |
spellingShingle |
Chang Yu Daniel Zelterman Distributions associated with simultaneous multiple hypothesis testing Journal of Statistical Distributions and Applications Bonferroni correction Simes criteria False discovery rate p-values |
author_facet |
Chang Yu Daniel Zelterman |
author_sort |
Chang Yu |
title |
Distributions associated with simultaneous multiple hypothesis testing |
title_short |
Distributions associated with simultaneous multiple hypothesis testing |
title_full |
Distributions associated with simultaneous multiple hypothesis testing |
title_fullStr |
Distributions associated with simultaneous multiple hypothesis testing |
title_full_unstemmed |
Distributions associated with simultaneous multiple hypothesis testing |
title_sort |
distributions associated with simultaneous multiple hypothesis testing |
publisher |
SpringerOpen |
series |
Journal of Statistical Distributions and Applications |
issn |
2195-5832 |
publishDate |
2020-10-01 |
description |
Abstract We develop the distribution for the number of hypotheses found to be statistically significant using the rule from Simes (Biometrika 73: 751–754, 1986) for controlling the family-wise error rate (FWER). We find the distribution of the number of statistically significant p-values under the null hypothesis and show this follows a normal distribution under the alternative. We propose a parametric distribution Ψ I (·) to model the marginal distribution of p-values sampled from a mixture of null uniform and non-uniform distributions under different alternative hypotheses. The Ψ I distribution is useful when there are many different alternative hypotheses and these are not individually well understood. We fit Ψ I to data from three cancer studies and use it to illustrate the distribution of the number of notable hypotheses observed in these examples. We model dependence in sampled p-values using a latent variable. These methods can be combined to illustrate a power analysis in planning a larger study on the basis of a smaller pilot experiment. |
topic |
Bonferroni correction Simes criteria False discovery rate p-values |
url |
http://link.springer.com/article/10.1186/s40488-020-00109-6 |
work_keys_str_mv |
AT changyu distributionsassociatedwithsimultaneousmultiplehypothesistesting AT danielzelterman distributionsassociatedwithsimultaneousmultiplehypothesistesting |
_version_ |
1724661650811781120 |