Citation:
Cabras, S. (2016). A Markov chain representation of the multiple testing problem. Statistical Methods in Medical Research, 27(2), pp. 364–383.
xmlui.dri2xhtml.METS-1.0.item-contributor-funder:
Ministerio de Economía y Competitividad (España)
Sponsor:
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: The author has been funded by Ministerio de Ciencia e Innovación grant MTM2013-42323, ECO2012-38442, RYC-2012-11455, and ACOMP/2015/202, by Ministero dell'Istruzione, dell'Univesità e della Ricerca of Italy, and Regione Autonoma della Sardegna CRP-59903.
Project:
Gobierno de España. ECO2012-38442 Gobierno de España. RYC-2012-11455 Gobierno de España. MTM2013-42323
The problem of multiple hypothesis testing can be represented as a Markov process where a new alternative hypothesis is accepted in accordance with its relative evidence to the currently accepted one. This virtual and not formally observed process provides theThe problem of multiple hypothesis testing can be represented as a Markov process where a new alternative hypothesis is accepted in accordance with its relative evidence to the currently accepted one. This virtual and not formally observed process provides the most probable set of non null hypotheses given the data; it plays the same role as Markov Chain Monte Carlo in approximating a posterior distribution. To apply this representation and obtain the posterior probabilities over all alternative hypotheses, it is enough to have, for each test, barely defined Bayes Factors, e.g. Bayes Factors obtained up to an unknown constant. Such Bayes Factors may either arise from using default and improper priors or from calibrating p-values with respect to their corresponding Bayes Factor lower bound. Both sources of evidence are used to form a Markov transition kernel on the space of hypotheses. The approach leads to easy interpretable results and involves very simple formulas suitable to analyze large datasets as those arising from gene expression data (microarray or RNA-seq experiments).[+][-]