Statistics 2nd ed

ordinal-data

Lesson 11 — Non-parametric Tests

mann whitney
wilcoxon test
kruskal wallis test
friedman test

Most tests so far (t-tests, ANOVA, regression) are parametric.
They assume:

  • Interval/ratio data
  • Approximately normal distribution
  • Homogeneity of variance

But what if these assumptions are not met?
Or if data are ranks or categories?

Then we use non-parametric tests.
They make fewer assumptions and are based on ranks, not raw scores.


Mann–Whitney U Test

When to Use:

  • Compare two independent groups, ordinal or non-normal data.
  • Non-parametric alternative to independent t-test.

Formula:
$$U = n_1 n_2 + \frac{n_1(n_1 + 1)}{2} - R_1$$

Where $$R_1$$ = sum of ranks for group 1.

Example:
Two groups (n = 5 each) ranked by performance. Compute rank sums, plug into U formula.


Wilcoxon Signed-Rank Test

When to Use:

  • Compare the same group measured twice.
  • Ordinal or non-normal data.
  • Non-parametric alternative to paired t-test.

Procedure:

  1. Compute differences (After – Before).
  2. Rank absolute differences.
  3. Add signs.
  4. Test statistic = smaller signed sum.

Example:
5 students tested before/after training → positive ranks dominate → training helps.


Kruskal–Wallis Test

When to Use:

  • Compare 3+ independent groups.
  • Ordinal or non-normal data.
  • Non-parametric alternative to one-way ANOVA.

Formula:
$$H = \frac{12}{N(N+1)} \sum \frac{R_j^2}{n_j} - 3(N+1)$$

Where:

  • $$R_j$$ = sum of ranks in group j
  • $$n_j$$ = group size
  • $$N$$ = total number of cases

Example:
Three therapy groups (n = 6 each). Rank improvement scores → compare H to χ² distribution.


Friedman Test

When to Use:

  • Compare 3+ related groups (repeated measures).
  • Ordinal or non-normal data.
  • Non-parametric alternative to repeated-measures ANOVA.

Formula:
$$Q = \frac{12}{nk(k+1)} \sum R_j^2 - 3n(k+1)$$

Where:

  • $$R_j$$ = rank sum for each condition
  • $$n$$ = number of subjects
  • $$k$$ = number of conditions

Example:
10 participants ranked across 3 learning tasks. Compare Q to χ² distribution.


Definition

  • Non-parametric tests: statistical tests based on ranks, not raw scores
  • Used when parametric assumptions fail or data are ordinal

Visuals

Figure 11.1 — Mann–Whitney U test: two groups compared by rank distributions.

Figure 11.2 — Wilcoxon signed-rank: before/after ranks with arrows.

Figure 11.3 — Kruskal–Wallis layout: three groups compared by median ranks.

Figure 11.4 — Friedman layout: subjects compared across repeated conditions.


Why This Matters

Non-parametric tests give us flexibility.
They extend statistical reasoning to real-world data that are messy, skewed, or categorical.
They are essential tools for psychology, education, and biology.

Practice self-test quiz

In the space below, please find practice problems and self-test quizzes. For full access, please signup free.

Lesson 8 — Post Hoc Tests

tukey hsd
bonferroni

When ANOVA finds a significant F, we know that not all group means are equal.
But ANOVA does not tell us which groups differ.

For that, we need post hoc tests (Latin: after this).
They compare pairs of group means while controlling for the increased chance of error.


Tukey’s Honestly Significant Difference (HSD)

When to Use:

  • Equal group sizes
  • Pairwise comparisons after one-way ANOVA

Formula:
$$\text{HSD} = q \sqrt{\frac{MS_{\text{within}}}{n}}$$

In words:
$$\text{HSD} = \text{Studentized range statistic } q \times \sqrt{\frac{\text{mean square within groups}}{\text{sample size per group}}}$$

If the difference between two means ≥ HSD, they are significantly different.

Example:
3 groups, n = 10 each, $$MS_{\text{within}} = 16.7$$, critical $$q = 3.5$$.

$$\text{HSD} = 3.5 \times \sqrt{\tfrac{16.7}{10}} = 3.5 \times 1.29 = 4.52$$

So any pair of means that differ by 4.52 or more is significant.


Bonferroni Correction

When to Use:

  • Simple and conservative
  • Divide significance level by number of comparisons

Formula:
$$\alpha' = \frac{\alpha}{m}$$

In words:
$$\text{adjusted significance level} = \frac{\text{original significance level}}{\text{number of comparisons}}$$

Example: If α = 0.05 and 10 comparisons, α′ = 0.005 per test.


Scheffé Test

When to Use:

  • Unequal sample sizes
  • Most conservative post hoc test

Formula (summary):
Scheffé’s critical F = (k – 1) × F(critical, df_between, df_within).


Definition

  • Post hoc test: statistical test used after ANOVA to identify which means differ
  • Tukey HSD: balanced groups, pairwise
  • Bonferroni: adjusts α for multiple comparisons
  • Scheffé: conservative, flexible for unequal n

Visual Placeholders

Figure 8.1 — Tukey HSD example: three group means with horizontal bars showing which pairs differ.

Figure 8.2 — Bonferroni correction illustration: α = 0.05 split into smaller pieces.


Why This Matters

ANOVA tells us there is a difference somewhere.
Post hoc tests tell us where.
They protect against false positives while allowing multiple group comparisons.

Practice self-test quiz

In the space below, please find practice problems and self-test quizzes. For full access, please signup free.