SAN Statistics

Central Limit Theorem (CLT)

Theorem: Let be a sequence of independent and identically distributed (i.i.d.) random variables with mean and finite variance .

Let denote the sum. Then:

where denotes convergence in distribution.

Equivalently for the sample mean :

or:

Key insight: Standardize by dividing by (or for the mean), not just .


Applications

1. Approximating distributions

  • Even if are not normally distributed, their sum/mean is approximately normal for large
  • Rule of thumb: often sufficient

2. Confidence intervals

  • Construction:

3. Hypothesis testing

  • Z-tests for means with large samples

4. Quality control

  • Process monitoring (control charts)

5. Finance

  • Portfolio returns modeling (sum of many small effects)

6. Monte Carlo simulation

  • Error estimation

Why it matters: CLT explains why the normal distribution appears so frequently in practice—any phenomenon that results from the sum of many independent factors will be approximately normally distributed.