p-Value Calculator
Find the p-value for any hypothesis test - Z, t, F, or chi-square - in one click.
📖 What is a p-Value?
The p-value is the probability of obtaining a test statistic as extreme as (or more extreme than) the one observed, assuming the null hypothesis (H₀) is true. It quantifies the evidence against the null hypothesis: a small p-value means the observed data would be unlikely if H₀ were true, providing evidence to reject it.
The p-value was introduced by Karl Pearson and popularised by Ronald Fisher in the 1920s as a measure of evidence against the null hypothesis. It has become the most widely used - and most often misunderstood - concept in statistics. Crucially, the p-value is not the probability that the null hypothesis is true. It is a conditional probability: P(data as extreme as observed | H₀ true).
In practice, p-values are used in medicine (clinical trials), psychology (experimental studies), economics (econometric analysis), quality control (process testing), and machine learning (feature selection). The conventional threshold of 0.05 was proposed by Fisher as a rule of thumb, but many fields now require stricter thresholds (0.01 or even 0.001) to reduce false positives.
This calculator computes p-values from four types of test statistics - Z, t, F, and chi-square - the four most common in introductory and intermediate statistics.
📐 Formulas
One-tailed right Z: p = 1 − Φ(Z)
One-tailed left Z: p = Φ(Z)
t-test (df): p = 2 × P(T > |t|) using t-distribution with df degrees of freedom
F-test (df₁, df₂): p = P(F > f) using F-distribution (always right-tailed)
Chi-square (df): p = P(χ² > χ²_obs) using chi-square distribution (always right-tailed)
Φ = standard normal CDF (area to the left of Z)
α = significance level. If p ≤ α: reject H₀. If p > α: fail to reject H₀.
📖 How to Use This Calculator
📝 Example Calculations
Example 1 - Z-test, Two-tailed
A Z-statistic of 2.31 is calculated from a large sample test. Tail: two-tailed. α = 0.05.
p = 2 × (1 − Φ(2.31)) = 2 × (1 − 0.9896) = 2 × 0.0104 = 0.0209
p (0.021) < α (0.05): Reject H₀ - statistically significant at 5% level.
Example 2 - t-test, Two-tailed, df = 15
t-statistic = 2.10, df = 15, two-tailed, α = 0.05.
From t-distribution with 15 df, p ≈ 0.053
p (0.053) > α (0.05): Fail to reject H₀ - not significant at 5% level (borderline).
Example 3 - Chi-square, df = 3
χ² = 9.21, df = 3 (always right-tailed), α = 0.05.
p ≈ 0.027 - less than 0.05: Reject H₀. The categorical distribution differs from expected.
Example 4 - F-test ANOVA, df₁ = 3, df₂ = 36
F = 4.28, df₁ = 3, df₂ = 36, α = 0.05.
p ≈ 0.010 - less than 0.05: Reject H₀. At least one group mean differs significantly.
Example 5 - One-tailed t-test (right), df = 29
t = 1.70, df = 29, right-tailed (H₁: μ > μ₀), α = 0.05.
p ≈ 0.050 - exactly at the threshold. Borderline significance - report both the p-value and effect size.