Title	Link
My science and philosophy books	Open
My theology books	Open
My books on Classics	Open
My literary work	Open

Appendix 8 — Glossary of Key Terms

Mean (average)
Sum of all scores divided by number of scores.
Example: (6 + 8 + 10) / 3 = 8.

Median
Middle score when data are ordered.
Example: For [5, 7, 8], median = 7.

Mode
Most frequent score.
Example: For [2, 3, 3, 5], mode = 3.

Variance (s²)
Average squared deviation from the mean.

Standard Deviation (s)
Square root of variance. Spread of scores around the mean.

Standard Error of the Mean (SEM)
How much sample means vary.
Formula: $$SEM = \frac{s}{\sqrt{n}}$$

t-test
Compares two means.

ANOVA (F-test)
Compares three or more means.

Post Hoc Test
Used after ANOVA to find which groups differ.

Correlation (r)
Strength and direction of a linear relationship. Range: –1 to +1.

Regression
Equation that predicts Y from X.
Example: $$\hat{Y} = a + bX$$

Chi-square (χ²)
Test for categorical data (counts).

Degrees of Freedom (df)
Independent pieces of information in a test.

p-value
Probability of getting the observed result (or more extreme) if the null hypothesis is true.

📱 QR: Interactive glossary (search symbols, formulas, definitions)

Practice self-test quiz

In the space below, please find practice problems and self-test quizzes. For full access, please signup free.

Appendix 6 — Data Sets for Practice

```html

Appendix 6 — Data Sets for Practice

Working with real numbers is the best way to learn statistics. This appendix provides small “mini datasets” you can analyze by hand (or with a calculator), plus larger files for practice with spreadsheets.

Dataset Provenance (Read This First)

Pedagogical = small, simplified numbers chosen to make learning and checking easier.
Simulated = computer-generated numbers designed to resemble real data (not collected from real people).
Empirical = collected from real observations (only used if explicitly stated).

Note: Unless a dataset is explicitly labeled Empirical, you should treat it as Pedagogical or Simulated practice data.

Mini Datasets (In-Page)

1) Quiz Scores

Provenance: Pedagogical
n: 10
Scale: Ratio (points)
Data: 6, 7, 8, 9, 10, 7, 8, 6, 9, 10

Suggested Lessons:
- Lesson 2 — The Averages: mean, median, mode
- Lesson 3 — Variance & Standard Deviation: variance, SD, z-scores
- Lesson 4 — The Standard Normal Curve: interpret z-scores (as a bridge)
Check values (optional): Mean = 8.0; SD ≈ 1.41

2) Reaction Times (ms)

Provenance: Pedagogical (human-like values)
n: 8
Scale: Ratio (milliseconds)
Units: ms
Data: 220, 250, 270, 230, 260, 280, 240, 300

Suggested Lessons:
- Lesson 3 — Variance & Standard Deviation: spread, outliers, SD
- Lesson 6 — The t-test: use as a template dataset (e.g., compare two conditions by splitting into two groups)
- Lesson 7 — ANOVA: extend to 3+ groups by creating conditions
Instructor tip: reaction time data often show mild skew in real life. If you want skew, see the larger practice files below.

3) Stress Reduction Scores (Three Groups)

Provenance: Pedagogical (grouped scores)
Scale: Interval/Ratio (score units; treat as interval for ANOVA practice)
Groups:

Meditation (n = 3): 65, 70, 72
Exercise (n = 3): 68, 71, 75
Music (n = 3): 75, 78, 82
Suggested Lessons:
- Lesson 7 — ANOVA: one-way ANOVA (three independent groups)
- Lesson 8 — Post Hoc Tests: follow-up comparisons after ANOVA (conceptual)
- Lesson 13 — Degrees of Freedom Cookbook: df for one-way ANOVA
Important note: The sample sizes are intentionally small for learning mechanics. In real studies, groups are usually larger.

Larger Practice Datasets (Download Files)

These datasets are designed for spreadsheet work, graphing, and full problem sets.

Exam Scores (n = 100)
Provenance: Simulated
Suggested Lessons: Lesson 4 (normal curve), Lesson 5 (SEM), Lesson 6 (t-test foundations)
Survey Data (preferences by gender/age)
Provenance: Simulated (categorical practice)
Suggested Lessons: Lesson 12 (chi-square), Lesson 1 (why statistics matters in decisions)
Simulated Medical Trial (treatment vs. control, repeated measures)
Provenance: Simulated (instructional “trial-style” dataset; not clinical research)
Suggested Lessons: Lesson 6 (t-test concepts), Lesson 7 (variance partitioning concepts), and for advanced learners: repeated-measures ideas (optional)

Downloads: CSV and Excel files are provided via the QR code(s) on this page (and/or direct links, if enabled on your device).

Reproducibility note (simulated files): If you revise these datasets in future editions, consider generating them with a fixed random seed so instructors and students can reproduce results across versions.

Trusted External Sources (Optional)

If you want additional datasets beyond the practice files above, the following repositories are widely used for learning and benchmarking:

NIST Statistical Reference Datasets (SRD)
High-quality benchmark datasets for practice and verification (excellent for checking calculations and software).
UCI Machine Learning Repository
Larger, more complex datasets. Recommended only for advanced students or enrichment projects.

Visual Reference

Figure F.1 — Example spreadsheet view of a dataset (columns such as ID, Score, Group). Use this as a template for organizing your own data before running calculations.

Self-Test Quiz Access

Practice problems and self-test quizzes may appear below. If full access is restricted, please sign up (free) to unlock the quiz section.

```

Appendix 1 — Symbols and Notation (Cheat Sheet)

A quick reference to the symbols used in this book.

Symbol	Meaning	Example
$$\Sigma$$	Summation (add them up)	$$\Sigma X = 2+4+6=12$$
$$\bar{X}$$	Sample mean	$$\bar{X} = \tfrac{12}{3} = 4$$
$$\mu$$	Population mean	“The true average of all scores”
$$s$$	Sample standard deviation	Spread of quiz scores
$$\sigma$$	Population standard deviation	Spread of SAT scores
$$df$$	Degrees of freedom	$$df = n-1 = 29$$ if $$n=30$$
$$t$$	t-test statistic	Compare two group means
$$F$$	ANOVA statistic	Compare 3+ group means
$$r$$	Pearson correlation	Strength of linear relationship
$$R^2$$	Coefficient of determination	Proportion of variance explained
$$\chi^2$$	Chi-square statistic	Compare observed vs. expected counts
$$p$$	Probability value	“p < 0.05” → significant result

Practice self-test quiz

In the space below, please find practice problems and self-test quizzes. For full access, please signup free.

chi-square

Practice self-test quiz

Appendix 6 — Data Sets for Practice

Dataset Provenance (Read This First)

Mini Datasets (In-Page)

1) Quiz Scores

2) Reaction Times (ms)

3) Stress Reduction Scores (Three Groups)

Larger Practice Datasets (Download Files)

Trusted External Sources (Optional)

Visual Reference

Self-Test Quiz Access

Practice self-test quiz