Watch the sample mean distribution converge to normal as sample size grows. The CLT is why actuaries can model aggregate claims with the normal distribution.
Population Distribution
Sampling Parameters
n = 5
1,000 draws
Results
๐ What Is the Central Limit Theorem?
The Central Limit Theorem is one of the most important results in all of probability and statistics. It says:
If Xโ, Xโ, ..., Xn are i.i.d. with E[Xแตข] = ฮผ and Var(Xแตข) = ฯยฒ, then
as n โ โ: โn ยท (X_n โ ฮผ) / ฯ โd N(0, 1)
In plain language: the distribution of the sample mean (xฬ = (Xโ+โฆ+Xn)/n) approaches a normal distribution as the sample size grows โ no matter what the original population distribution looks like.
The Two Key Facts
E[xฬ] = ฮผ โ the sample mean is centered at the population mean
Var(xฬ) = ฯยฒ/n โ the variance shrinks as n increases
๐ The Convergence Journey โ Step by Step
n = 1 โ xฬ follows the population distribution
n = 5 โ starting to converge, but still rough
n = 30 โ good Normal approximation (common rule of thumb)
n = 100 โ almost perfectly normal, much narrower
๐ Why Does the Variance Shrink?
Each observation adds independent information. When you average n i.i.d. variables:
The standard deviation of xฬ is ฯ/โn. This means to halve the uncertainty, you need four times the sample size. Here's what it looks like:
Notice the huge drop between n=1 and n=10, then diminishing returns. That's why on the exam, you're usually told n = 30+ is "large enough" (for most distributions).
๐ฏ Why Actuaries Need the CLT
Real insurance problems don't come with "the data is normally distributed" pre-attached. The CLT gives actuaries a superpower: they can make normal approximations about aggregate claims from any underlying loss distribution.
๐ Exam P Applications
Sum of i.i.d. variables approximates normal โ even if the individual variables are skewed (like claim amounts)
Approximate probabilities for total loss: S = Xโ + ... + Xn โ N(nฮผ, nฯยฒ)
Normal approximation to the Binomial: when n is large, Bin(n,p) โ N(np, np(1-p))
Normal approximation to Poisson: Poisson(ฮป) โ N(ฮป, ฮป) for large ฮป
๐ข Real-World Use
Pricing: Total premium = E[S] + safety loading based on standard deviation
Reserving: Estimate the 95th percentile of total claims
Reinsurance: Price stop-loss treaties using normal approximations
Solvency: Calculate probability of ruin / capital adequacy
โ ๏ธ When Does the CLT Not Apply?
The CLT is powerful, but it has conditions. Here's when to be careful:
n โฅ 30
Rule of thumb for "large n" works for most distributions
Symmetric
Distributions need n โ 15-20
Heavy-tailed
May need n โซ 30 to converge
Requires i.i.d. variables โ independently drawn from the same distribution
Requires finite variance โ Cauchy distribution has no mean/variance, CLT does not apply
Note: Rules P and CAS exam questions are multiple-choice, so they often test whether you can set up the CLT approximation correctly โ not just compute the final z-score.