Permutation tests

Published:

A permutation test (also called a randomization test, re-randomization test, or an exact test) is a type of statistical significance test in which the distribution of the test statistic under the null hypothesis is obtained by calculating all possible values of the test statistic under rearrangements of the labels on the observed data points. In other words, the method by which treatments are allocated to subjects in an experimental design is mirrored in the analysis of that design. If the labels are exchangeable under the null hypothesis, then the resulting tests yield exact significance levels; see also exchangeability. Confidence intervals can then be derived from the tests. The theory has evolved from the works of Ronald Fisher and E. J. G. Pitman in the 1930s.

The process goes as follows:

  • Computing the observed value of the test statistic using the whole samples.
  • The points are permuted and reasigned computing the value of the test statistic using the transformed samples. The set of these calculated test statistic is the exact distribution of possible differences under the null hypothesis that group label does not matter.

Permutation tests are a subset of non-parametric statistics.

The permutation tests assume you compute the null hypothesis by computing all the possible permutations. For big datasets that’s become impossible. In that cases we use Montecarlo for computing an asymptotically equivalent permutation test.

See also

Resampling methods

Papers

Books