


























⭐ Central Limit Theorem (CLT)
The Central Limit Theorem says something surprisingly powerful:
If you take many random samples and compute their means,
the distribution of those sample means will be approximately normal,
even if the original population is not normal —
as long as the sample size is large enough.
📌 What the CLT guarantees
- The shape of the sampling distribution of the mean becomes normal.
- The center of that distribution is the true population mean
.
- The spread of that distribution is
where = population standard deviation
= sample size
This spread is called the standard error.
🎯 Why it matters
The CLT is the reason we can:
- use normal distributions to make inferences
- build confidence intervals
- run hypothesis tests
- trust sample means as good estimators
Without the CLT, modern statistics basically collapses.
⭐ Confidence Intervals (CI)
A confidence interval gives a range of plausible values for a population parameter (usually the mean).
The general form is:
For a mean, the formula is:
Where:
= sample mean
= sample standard deviation
= sample size
= critical value (e.g., 1.96 for 95% confidence)
📌 Interpretation
A 95% confidence interval means:
If we repeated the sampling process many times,
about 95% of the intervals we build would contain the true population mean.
It does not mean “there is a 95% chance the true mean is in this specific interval.”
The true mean is fixed; the interval is what varies.
⭐ How CLT and CI connect
Confidence intervals rely on the idea that:
- The sampling distribution of the mean is normal
- The standard error tells us how much sample means vary
Both of these come directly from the Central Limit Theorem.
So the logic is:
- CLT → sample means follow a normal distribution
- Normal distribution → we can use z‑scores
- z‑scores → we can build confidence intervals
This is why the CLT is the backbone of inferential statistics.
🎨 A simple analogy
Imagine you’re trying to guess the average height of all people in Norway.
- Each sample you take gives a different mean.
- Those sample means bounce around the true mean.
- The CLT says those sample means form a bell curve.
- A confidence interval is like saying:
“Given how much sample means typically bounce around, here’s a range that probably contains the true average height.”
⭐ Examples of the Central Limit Theorem (CLT)
Example 1: Candy Weights
Suppose each piece of candy from a factory has a weight that is not normally distributed — maybe it’s skewed because machines sometimes overfill.
- Population mean weight: 10 grams
- Population standard deviation: 2 grams
- Distribution: skewed right
Now take samples of size and compute the sample mean each time.
Even though the individual weights are skewed, the distribution of sample means will be:
- approximately normal
- centered at 10 grams
- with standard error
This is the CLT in action:
skewed population → normal sampling distribution (if n is large).
Example 2: Rolling Dice
A single die roll is uniform, not normal.
But if you take the average of 30 rolls, and repeat that process many times, the distribution of those averages becomes:
- bell‑shaped
- centered at 3.5
- with standard error
Again: non‑normal population → normal sample means.
⭐ Examples of Confidence Intervals (CI)
Example 1: Estimating Average Height
You measure the heights of 50 students.
- Sample mean:
cm
- Sample standard deviation:
cm
- Confidence level: 95% →
Standard error:
Margin of error:
Confidence interval:
Interpretation:
We are 95% confident the true average height of the population is between 167.8 and 172.2 cm.
Example 2: Proportion of Students Who Own a Pet
Out of 200 students, 120 own a pet.
- Sample proportion:
- Standard error:
- 95% CI:
Interpretation:
We are 95% confident that 53.2% to 66.8% of all students own a pet.
⭐ A Combined Example (CLT + CI Together)
Estimating the Average Time Spent on TikTok
A researcher wants to estimate how long teenagers spend on TikTok daily.
They take a random sample of 64 teens.
- Sample mean:
minutes
- Sample standard deviation:
minutes
Because of the Central Limit Theorem, the sampling distribution of the mean is approximately normal (n = 64 is large).
Standard error:
95% CI:
Interpretation:
We are 95% confident the true average TikTok time for teens is between 87.7 and 102.4 minutes per day.
Discover more from Knowledge sparks
Subscribe to get the latest posts sent to your email.