The normal distribution

The video below shows an example of a Galton board or a Quincunx board, which demonstrates how the normal distribution arises from the sum of many independent random variables.

This video demonstrates a device known as a Galton Board (also called a quincunx or bean machine). It is a desktop probability machine that visualizes abstract statistical concepts.

Here is a breakdown of what is happening:

This machine, invented by Sir Francis Galton in the 19th century, is a perfect physical demonstration of the Central Limit Theorem. It shows how the sum of many independent random variables (the left/right bounces) tends to produce a normal distribution, regardless of the original distribution of the variables.

Simulating a Galton board

Use the interactive simulation below to experiment with the Central Limit Theorem and the concept of Drift.

  • Number of balls: Controls the sample size. This demonstrates the Law of Large Numbers: as the number of trials increases, the empirical distribution converges to the theoretical distribution (the red curve).
  • Bias: Represents the probability of a “right” move versus a “left” move at each peg.
    • 50% Bias: Represents a standard random walk where \(p=0.5\). The distribution centers in the middle.
    • > 50% Bias: Analogous to an asset with a positive expected return (positive drift). The distribution shifts right.
    • < 50% Bias: Analogous to a negative drift. The distribution shifts left.

Visualizing the normal density

Below you can find a plot of two density functions from a Normal distribution. The blue curve shows the standard Normal(0, 1), while the red curve lets you adjust the mean and standard deviation to see how the shape shifts.

The relationship between a general normal variable \(X\) and the standard normal variable \(Z\) is given by the Z-score formula: \[ Z = \frac{X - \mu}{\sigma} \] This standardization allows us to compare different normal distributions and is fundamental for calculations like Value-at-Risk (VaR).

Change those parameters using the sliders below and see how this affects the density function (area in red).

For more on the Normal distribution refer to:

The normal density for \(X \sim \mathcal{N}(\mu, \sigma^2)\) is

\[ f_X(x) = \frac{1}{\sigma \sqrt{2\pi}}\exp\left(-\frac{(x-\mu)^2}{2\sigma^2}\right). \]

Financial Context: Returns vs. Prices

In finance, we often assume that the log-returns of an asset are normally distributed: \[ \ln(S_t/S_{t-1}) \sim \mathcal{N}(\mu, \sigma^2) \] This implies that the asset prices themselves follow a Log-Normal Distribution.

  • Limited Liability: While returns can be negative, asset prices cannot be negative. This is a key reason we model prices as Log-Normal (domain \(0\) to \(\infty\)) rather than Normal (domain \(-\infty\) to \(\infty\)).

The Cumulative Distribution Function (CDF) of the standard normal distribution, often denoted as \(N(x)\) or \(\Phi(x)\), represents the probability that a variable takes a value less than or equal to \(x\) (the area under the curve to the left). This plays a critical role in the Black-Scholes-Merton option pricing model (specifically in the \(N(d_1)\) and \(N(d_2)\) terms).

Critique: Real-world financial data often deviates from normality:

  • “Fat Tails” (Excess Kurtosis): Extreme events (crashes or booms) happen more frequently than predicted.
  • Skewness: Markets often exhibit asymmetry (e.g., equity markets frequently have negative skewness, implying higher crash risk).