Title	Link
My science and philosophy books	Open
My theology books	Open
My books on Classics	Open
My literary work	Open

Statistics 2nd ed

modern-statistics

Lesson 18 — AI and Neural Networks (Intro)

Artificial Intelligence (AI) aims to build systems that can learn, adapt, and make decisions.
One powerful tool is the neural network, inspired by the brain.

From Statistics to AI

Regression predicts Y from X
Logistic regression predicts probability (0–1)
Neural networks generalize this idea: many inputs, many layers, nonlinear patterns

The Structure of a Neural Network

Input layer — variables (X₁, X₂, …)
Hidden layers — units that transform the input
Output layer — prediction or classification

Each connection has a weight (like a slope in regression).

Formula for a Neuron

A single unit in the network:

$$z = \sum w_i X_i + b$$

$$y = f(z)$$

Where:

$$w_i$$ = weights
$$X_i$$ = inputs
$$b$$ = bias (like an intercept)
$$f(z)$$ = activation function (e.g., logistic, ReLU)

Learning in a Network

The network predicts outputs and compares them with the true answers.
The error is sent backward through the network to adjust weights.
This is called backpropagation.

Example

Predicting if a student will pass or fail based on:

Study hours
Attendance
Practice problems completed

Inputs → combined with weights → logistic activation → output: probability of passing.

Visuals

Figure 18.1 — Simple Neural Network (Inputs → Hidden → Output)

Figure 18.2 — Activation Functions

Why This Matters

Neural networks extend regression and logistic regression.
They allow learning from large, complex datasets (images, speech, language).
Modern AI (translation, recognition, chatbots) is powered by these models.

Practice self-test quiz

In the space below, please find practice problems and self-test quizzes. For full access, please signup free.

Lesson 15 — Resampling and Simulation

Classical statistics uses formulas and tables.
Modern computing gives us another way: resampling and simulation.

Instead of relying only on theory, we let the computer generate thousands of samples and see what happens.

Bootstrapping

Bootstrapping means resampling with replacement from the original data.

Steps:

Take a sample of size $$n$$ from the data (with replacement).
Compute the statistic (mean, median, correlation).
Repeat thousands of times.
Use the distribution of resampled statistics to estimate confidence intervals.

Example:
Data = [5, 6, 7, 9].
Resample 1000 times, compute mean each time.
The distribution of means gives an estimate of the true mean’s variability.

Randomization (Permutation) Tests

Used to test hypotheses by shuffling labels.

Steps:

Combine all data.
Randomly assign to groups.
Compute the difference in means.
Repeat thousands of times.
Compare the observed difference to this distribution.

This shows whether the observed effect could be due to chance.

Monte Carlo Simulation

Monte Carlo methods use random numbers to model complex processes.

Example: Estimating $$\pi$$.

Randomly throw points into a square.
Count how many fall inside the circle quarter.
$$\pi \approx 4 \times \tfrac{\text{inside circle}}{\text{total points}}$$.