


























⭐ Central Limit Theorem (CLT)
The Central Limit Theorem says something surprisingly powerful:
If you take many random samples and compute their means,
the distribution of those sample means will be approximately normal,
even if the original population is not normal —
as long as the sample size is large enough.
📌 What the CLT guarantees
- The shape of the sampling distribution of the mean becomes normal.
- The center of that distribution is the true population mean
.
- The spread of that distribution is
where = population standard deviation
= sample size
This spread is called the standard error.
🎯 Why it matters
The CLT is the reason we can:
- use normal distributions to make inferences
- build confidence intervals
- run hypothesis tests
- trust sample means as good estimators
Without the CLT, modern statistics basically collapses.
⭐ Confidence Intervals (CI)
A confidence interval gives a range of plausible values for a population parameter (usually the mean).
The general form is:
For a mean, the formula is:
Where:
= sample mean
= sample standard deviation
= sample size
= critical value (e.g., 1.96 for 95% confidence)
📌 Interpretation
A 95% confidence interval means:
If we repeated the sampling process many times,
about 95% of the intervals we build would contain the true population mean.
It does not mean “there is a 95% chance the true mean is in this specific interval.”
The true mean is fixed; the interval is what varies.
⭐ How CLT and CI connect
Confidence intervals rely on the idea that:
- The sampling distribution of the mean is normal
- The standard error tells us how much sample means vary
Both of these come directly from the Central Limit Theorem.
So the logic is:
- CLT → sample means follow a normal distribution
- Normal distribution → we can use z‑scores
- z‑scores → we can build confidence intervals
This is why the CLT is the backbone of inferential statistics.
🎨 A simple analogy
Imagine you’re trying to guess the average height of all people in Norway.
- Each sample you take gives a different mean.
- Those sample means bounce around the true mean.
- The CLT says those sample means form a bell curve.
- A confidence interval is like saying:
“Given how much sample means typically bounce around, here’s a range that probably contains the true average height.”
⭐ Examples of the Central Limit Theorem (CLT)
Example 1: Candy Weights
Suppose each piece of candy from a factory has a weight that is not normally distributed — maybe it’s skewed because machines sometimes overfill.
- Population mean weight: 10 grams
- Population standard deviation: 2 grams
- Distribution: skewed right
Now take samples of size and compute the sample mean each time.
Even though the individual weights are skewed, the distribution of sample means will be:
- approximately normal
- centered at 10 grams
- with standard error
This is the CLT in action:
skewed population → normal sampling distribution (if n is large).
Example 2: Rolling Dice
A single die roll is uniform, not normal.
But if you take the average of 30 rolls, and repeat that process many times, the distribution of those averages becomes:
- bell‑shaped
- centered at 3.5
- with standard error
Again: non‑normal population → normal sample means.
⭐ Examples of Confidence Intervals (CI)
Example 1: Estimating Average Height
You measure the heights of 50 students.
- Sample mean:
cm
- Sample standard deviation:
cm
- Confidence level: 95% →
Standard error:
Margin of error:
Confidence interval:
Interpretation:
We are 95% confident the true average height of the population is between 167.8 and 172.2 cm.
Example 2: Proportion of Students Who Own a Pet
Out of 200 students, 120 own a pet.
- Sample proportion:
- Standard error:
- 95% CI:
Interpretation:
We are 95% confident that 53.2% to 66.8% of all students own a pet.
⭐ A Combined Example (CLT + CI Together)
Estimating the Average Time Spent on TikTok
A researcher wants to estimate how long teenagers spend on TikTok daily.
They take a random sample of 64 teens.
- Sample mean:
minutes
- Sample standard deviation:
minutes
Because of the Central Limit Theorem, the sampling distribution of the mean is approximately normal (n = 64 is large).
Standard error:
95% CI:
Interpretation:
We are 95% confident the true average TikTok time for teens is between 87.7 and 102.4 minutes per day.