# Exact test

> Mediated Wiki article. Canonical URL: https://mediated.wiki/source/Exact_test
> Markdown URL: https://mediated.wiki/source/Exact_test.md
> Source: https://en.wikipedia.org/wiki/Exact_test
> Source revision: 1253013585
> License: Creative Commons Attribution-ShareAlike 4.0 International (https://creativecommons.org/licenses/by-sa/4.0/)

{{Multiple issues|
{{Expert needed|statistics|date=November 2008|reason=Needs discussion in body of exact tests in general}}
{{No footnotes|date=October 2021}}
}}
An '''exact (significance) test''' is a [statistical test](/source/statistical_test) such that if the [null hypothesis](/source/null_hypothesis) is true, then all [assumptions](/source/Statistical_assumption) made during the derivation of the distribution of the [test statistic](/source/test_statistic) are met. Using an exact test provides a [significance test](/source/significance_test) that maintains the [type I error rate](/source/Type_I_and_type_II_errors) of the test (<math>\alpha</math>) at the desired significance level of the test. For example, an exact test at a [significance level](/source/significance_level) of <math>\alpha = 5\%</math>, when repeated over many samples where the [null hypothesis](/source/null_hypothesis) is true, will reject at most <math>5\%</math> of the time. This is in contrast to an ''approximate test'' in which the desired type I error rate is only [approximately](/source/Approximation) maintained (i.e.: the test might reject  > 5% of the time), while this approximation may be made as close to <math>\alpha</math> as desired by making the sample size sufficiently large.

Exact tests that are based on discrete [test statistic](/source/test_statistic)s may be conservative, indicating that the actual rejection rate lies below the nominal significance level <math>\alpha</math>. As an example, this is the case for [Fisher's exact test](/source/Fisher's_exact_test) and its more powerful alternative, [Boschloo's test](/source/Boschloo's_test). If the test statistic is continuous, it will reach the significance level exactly.{{citation_needed|date=July 2019}}

[Parametric tests](/source/Parametric_statistics), such as those used in [exact statistics](/source/exact_statistics), are exact tests when the parametric assumptions are fully met, but in practice, the use of the term ''exact'' (significance) ''test'' is reserved for non-parametric tests, i.e., tests that do not rest on parametric assumptions{{citation_needed|date=July 2019}}. However, in practice, most implementations of non-parametric test software use asymptotical algorithms to obtain the significance value, which renders the test non-exact.

Hence, when a result of statistical analysis is termed an “exact test” or specifies an “exact [p-value](/source/p-value)”, this implies that the test is defined without parametric assumptions and is evaluated without making use of approximate algorithms.  In principle, however, this could also signify that a parametric test has been employed in a situation where all parametric assumptions are fully met, but it is in most cases impossible to prove this completely in a real-world situation. Exceptions in which it is certain that parametric tests are exact include tests based on the binomial or Poisson distributions. The term [permutation test](/source/permutation_test) is sometimes used as a synonym for exact test, but it should be kept in mind that all permutation tests are exact tests, but not all exact tests are permutation tests.

==Formulation==

The basic equation underlying exact tests is

:<math>\Pr(\text{exact})=\sum_{\mathbf{y}\,:\,T(\mathbf{y})\ge T(\mathbf{x)}} \Pr(\mathbf{y})</math>

where:
:*'''x''' is the actual observed outcome,
:*Pr('''y''') is the probability under the null hypothesis of a potentially observed outcome '''y''',
:*''T''('''y''') is the value of the test statistic for an outcome '''y''', with larger values of ''T'' representing cases which notionally represent greater departures from the null hypothesis,
and where the sum ranges over all outcomes '''y''' (including the observed one) that have the same value of the test statistic obtained for the observed sample '''x''', or a larger one.

== Example: Pearson's chi-squared test versus an exact test ==

{{Main|Pearson's chi-squared test}}

A simple example of this concept involves the observation that [Pearson's chi-squared test](/source/Pearson's_chi-squared_test) is an approximate test.  Suppose Pearson's chi-squared test is used to ascertain whether a six-sided die is "fair", indicating that it renders each of the six possible outcomes equally often.  If the die is thrown ''n'' times, then one ["expects"](/source/expected_value) to see each outcome ''n''/6 times.  The test statistic is

: <math> \sum \frac{(\text{observed}-\text{expected})^2}{\text{expected}}
= \sum_{k=1}^6 \frac{(X_k - n/6)^2}{n/6}, </math>

where ''X''<sub>''k''</sub> is the number of times outcome ''k'' is observed.  If the null hypothesis of "fairness" is true, then the [probability distribution](/source/probability_distribution) of the test statistic can be made as close as desired to the chi-squared distribution with <nowiki>5</nowiki> degrees of freedom by making the sample size ''n'' sufficiently large.  On the other hand, if ''n'' is small, then the probabilities based on chi-squared distributions may not be sufficiently close approximations.  Finding the exact probability that this test statistic exceeds a certain value would then require a [combinatorial enumeration](/source/combinatorics) of all outcomes of the experiment that gives rise to such a large value of the test statistic.  It is then questionable whether the same test statistic ought to be used.  A [likelihood-ratio test](/source/likelihood-ratio_test) might be preferred, and the test statistic might not be a monotone function of the one above.

== Example: Fisher's exact test ==

{{Main|Fisher's exact test}}

[Fisher's exact test](/source/Fisher's_exact_test), based on the work of [Ronald Fisher](/source/Ronald_Fisher) and [E. J. G. Pitman](/source/E._J._G._Pitman) in the 1930s, is exact because the sampling distribution (conditional on the marginals) is known exactly.  This should be compared with [Pearson's chi-squared test](/source/Pearson's_chi-squared_test), which (although it tests the same null) is not exact because the distribution of the test statistic is only asymptotically correct.

==See also==
* [Exact statistics](/source/Exact_statistics)
* [Optimal discriminant analysis](/source/Optimal_discriminant_analysis)

== References ==

* [Ronald Fisher](/source/Ronald_Fisher) (1954) ''[Statistical Methods for Research Workers](/source/Statistical_Methods_for_Research_Workers)''. Oliver and Boyd.
* Mehta, C.R.; Patel, N.R. (1998). "Exact Inference for Categorical Data". In P. Armitage and T. Colton, eds., ''Encyclopedia of Biostatistics'', Chichester: John Wiley, pp.&nbsp;1411–1422. [https://resources.cytel.com/sites/default/files/resources/exact-inference-for-categorical-data.pdf unpublished preprint]
* {{Cite book | last1 = Corcoran | first1 = C. D. | last2 = Senchaudhuri | first2 = P. | last3 = Mehta | first3 = C. R. | last4 = Patel | first4 = N. R. | chapter = Exact Inference for Categorical Data | doi = 10.1002/0470011815.b2a10019 | title = Encyclopedia of Biostatistics | year = 2005 | isbn = 047084907X }}

{{statistics|inference|collapsed}}

^

[de:Statistischer Test#Exakter Test](/source/de%3AStatistischer_Test)

---
Adapted from the Wikipedia article [Exact test](https://en.wikipedia.org/wiki/Exact_test) by Wikipedia contributors ([contributor history](https://en.wikipedia.org/wiki/Exact_test?action=history)). Available under [Creative Commons Attribution-ShareAlike 4.0 International](https://creativecommons.org/licenses/by-sa/4.0/). Changes may have been made.