Task	Formula	Example
Mean	`=AVERAGE(A1:A10)`	Mean of scores in A1–A10
Standard Deviation	`=STDEV.S(A1:A10)`	Spread of scores
t-test	`=T.TEST(A1:A10,B1:B10,2,2)`	Compare two groups

R (RStudio or RStudio Cloud)

Task	Command	Example
Mean	`mean(x)`	`mean(c(6,8,10)) = 8`
SD	`sd(x)`	`sd(c(6,8,10)) = 2`
t-test	`t.test(x,y)`	Compare two groups

Python (NumPy / SciPy / Pandas)

Task	Command	Example
Mean	`np.mean(x)`	`np.mean([6,8,10]) = 8`
SD	`np.std(x, ddof=1)`	`np.std([6,8,10],ddof=1) = 2`
t-test	`stats.ttest_ind(x,y)`	Compare two groups

iPhone Calculator

Rotate sideways → scientific mode
Use √ for square root
Parentheses matter: type numerator, then divide by denominator
Fine for small problems, but not for full datasets

Summary

For quick homework: iPhone calculator
For assignments: Excel / Google Sheets
For coding: Python (Colab) or R (RStudio Cloud)

📱 QR: Open sample data in Google Sheets (ready to practice mean, SD, t-test)

Visuals

Figure E.1 — Screenshots of the same mean calculation in Sheets, R, and Python side by side.

Practice self-test quiz

In the space below, please find practice problems and self-test quizzes. For full access, please signup free.

Tags

scientific-calculator

spreadsheet-formulas

coding-for-statistics

educational-statistics

online-textbook

self-test-quiz

Appendix 4 — Using the z-table

The z-table gives areas (probabilities) under the standard normal curve (mean $$\mu=0$$, SD $$\sigma=1$$).
Use it after you standardize a score:

Standardization (z-score):
$$z=\frac{x-\mu}{\sigma}$$
In words: $$z=\frac{\text{score} - \text{mean}}{\text{standard deviation}}$$

What the z-table shows

Most tables list the area to the left of a z value (cumulative probability).

Left area at $$z=0$$ is 0.5000 (half the curve).
Far left (negative big z) approaches 0; far right (positive big z) approaches 1.

Quick recipes

1) Probability below a score (left tail)
Example: $$z=1.00$$ → table gives 0.8413.
Interpretation: $$P(Z \le 1.00)=0.8413$$ (84.13% below).

2) Probability above a score (right tail)
Use complement: $$P(Z \ge z)=1-\text{left area}$$.
Example: $$z=1.00 \Rightarrow P(Z \ge 1.00)=1-0.8413=0.1587.$$

3) Probability between two scores
Subtract left areas.
Example: between $$z= -0.50$$ (left area 0.3085) and $$z=1.20$$ (0.8849):
$$P(-0.50 \le Z \le 1.20)=0.8849-0.3085=0.5764.$$

4) From a raw score to probability
Test scores: $$\mu=100, \ \sigma=15$$. What % are below 115?
Standardize: $$z=\frac{115-100}{15}=1.00 \Rightarrow 0.8413 \ (\text{84.13%}).$$

5) From probability to raw score (percentile)
What score is the 90th percentile?
Find z with left area ≈ 0.9000 → $$z \approx 1.2816$$.
Convert back: $$x=\mu+z\sigma=100+(1.2816)(15)=119.22.$$

Tips

For negative z, use the table’s symmetry: left area at $$-z$$ equals 1 − left area at $$+z$$.
Rounding: two decimals is common (e.g., 1.23).
Modern tools (calculator/Sheets/Python) can give exact p-values directly.

Visuals

Figure D.1 — Normal curve with area left of z = 1.00 shaded (0.8413).
Figure D.2 — Two-z shaded band for “between” probability.

📱 QR: Online z-calculator (type z or x, get areas instantly)

Practice self-test quiz

In the space below, please find practice problems and self-test quizzes. For full access, please signup free.

Tags

z-table

standard-normal-distribution

educational-statistics

self-test-quiz

Appendix 2 — Math Review for Statistics

Algebra refresher video (scan for a quick math warm-up)

A quick refresher on the math you’ll need in this book.

Order of Operations (PEMDAS)

Parentheses → Exponents → Multiplication/Division → Addition/Subtraction
Example:
$$3 + 2 \times (4^2) = 3 + 2 \times 16 = 35$$

Fractions and Division

Example:
$$\frac{24}{6} = 4$$

Square Roots

Example:
$$\sqrt{9} = 3$$
Example:
$$\sqrt{\frac{16}{4}} = \sqrt{4} = 2$$

Summation Notation (Σ)

Means “add them up.”
Example:
$$\Sigma X = 2+5+7 = 14$$
Example:
$$\Sigma (X-\bar{X})^2 = (2-4)^2 + (5-4)^2 + (7-4)^2 = 4+1+9 = 14$$

Exponents and Squares

$$x^2 = x \times x$$
Example:
$$5^2 = 25$$

Mini Example: Variance and Standard Deviation

Data: 6, 8, 10

Mean:
$$\bar{X} = \frac{6+8+10}{3} = 8$$
Deviations: –2, 0, +2
Squared deviations: 4, 0, 4
Variance:
$$\frac{8}{2} = 4$$
Standard deviation:
$$\sqrt{4} = 2$$

📱 QR: Algebra refresher video (scan for a quick math warm-up)

Practice self-test quiz

In the space below, please find practice problems and self-test quizzes. For full access, please signup free.

Tags

quantitative-reasoning

educational-statistics

applied-math

online-textbook

self-test-quiz

Appendix 1 — Symbols and Notation (Cheat Sheet)

A quick reference to the symbols used in this book.

Symbol	Meaning	Example
$$\Sigma$$	Summation (add them up)	$$\Sigma X = 2+4+6=12$$
$$\bar{X}$$	Sample mean	$$\bar{X} = \tfrac{12}{3} = 4$$
$$\mu$$	Population mean	“The true average of all scores”
$$s$$	Sample standard deviation	Spread of quiz scores
$$\sigma$$	Population standard deviation	Spread of SAT scores
$$df$$	Degrees of freedom	$$df = n-1 = 29$$ if $$n=30$$
$$t$$	t-test statistic	Compare two group means
$$F$$	ANOVA statistic	Compare 3+ group means
$$r$$	Pearson correlation	Strength of linear relationship
$$R^2$$	Coefficient of determination	Proportion of variance explained
$$\chi^2$$	Chi-square statistic	Compare observed vs. expected counts
$$p$$	Probability value	“p < 0.05” → significant result

Practice self-test quiz

In the space below, please find practice problems and self-test quizzes. For full access, please signup free.

Tags

educational-statistics

self-test-quiz

Lecture 3 — Variance & Standard Deviation

The mean tells us the “typical” score. But how tightly do scores cluster around the mean? Do they spread widely, or are they close together?

To answer, we measure variability. Two key measures are the variance and the standard deviation.

Variance

Variance is the average squared distance of scores from the mean.

Symbolic formula:
$$s^2 = \frac{\sum (X - \bar{X})^2}{n - 1}$$

Formula in words:
$$\text{Variance} = \frac{\text{sum of squared deviations from the mean}}{\text{number of scores} - 1}$$

Where:

$$s^2$$ = variance
$$X$$ = each score
$$\bar{X}$$ = mean
$$n$$ = number of scores

Standard Deviation

The standard deviation is the square root of the variance. It puts variability back into the same units as the data.

Symbolic formula:
$$s = \sqrt{\frac{\sum (X - \bar{X})^2}{n - 1}}$$

Formula in words:
$$\text{Standard deviation} = \sqrt{\frac{\text{sum of squared deviations from the mean}}{\text{number of scores} - 1}}$$

Example

Data: 6, 8, 10

Mean = 8
Deviations: –2, 0, 2
Squared deviations: 4, 0, 4
Sum = 8

Variance:
$$s^2 = \frac{8}{3-1} = 4$$

Standard deviation:
$$s = \sqrt{4} = 2$$

So, on average, scores are 2 units away from the mean.

Definition

Variance: average squared distance from the mean.
Standard Deviation: square root of variance; typical distance from the mean.

Visuals

Figure L3.1 — Variability Around the Mean. Dot plot of scores with the mean marked, vertical lines for deviations, and shaded boxes for squared deviations.

Why This Matters

Two sets of scores can have the same mean but very different spreads.
Variance and standard deviation give us the language to describe spread, and they are the building blocks for t-tests, ANOVA, and all inferential statistics.

Practice self-test quiz

In the space below, please find practice problems and self-test quizzes. For full access, please signup free.

Tags

descriptive-statistics

deviation-from-mean

sample-variance

population-variance

root-mean-square-deviation

data-dispersion

inferential-statistics

educational-statistics

online-textbook

self-test-quiz

Lecture 2 — The Goddess Normal Curve

The normal curve (bell curve) is one of the most important concepts in statistics.
It is elegant, symmetrical, and central to probability and inference.
It appears whenever many small, independent factors combine: height, exam scores, measurement errors.

Properties of the Normal Curve

Symmetrical around the mean
One peak (unimodal)
Mean = Median = Mode
Total area under the curve = 1 (100%)

Formula for the Normal Distribution

Symbolic formula:
$$f(x) = \frac{1}{\sigma \sqrt{2\pi}} e^{-\frac{(x - \mu)^2}{2\sigma^2}}$$

Formula in words:
$$\text{Probability density} = \frac{1}{\text{standard deviation} \times \sqrt{2\pi}} \times e^{-\frac{(\text{score} - \text{mean})^2}{2 \times (\text{standard deviation})^2}}$$

Where:

$$\mu$$ = mean
$$\sigma$$ = standard deviation
$$x$$ = a score

Standardization (z-scores)

Symbolic formula:
$$z = \frac{x - \mu}{\sigma}$$

Formula in words:
$$z = \frac{\text{score} - \text{mean}}{\text{standard deviation}}$$

A z-score tells us how many standard deviations a score is above or below the mean.

Key Percentages

Under the normal curve:

About 68% of scores are within 1 standard deviation of the mean
About 95% are within 2 standard deviations
About 99.7% are within 3 standard deviations

This is called the 68–95–99.7 rule.

Drama Box — “The Goddess Normal Curve”

Imagine a temple where a perfect curve stands tall — balanced and symmetrical.

At the center is the mean, the balance point.
Half of the people (data) stand on each side.
As you move further away, fewer remain.
The Goddess teaches fairness: most scores are near the center, extreme scores are rare.

This image helps students remember the normal curve not as a dry formula, but as a principle of balance and probability.