Matter = require("https://cdn.jsdelivr.net/npm/[email protected]/build/matter.min.js");The normal distribution
The video below shows an example of a Galton board or a Quincunx board, which demonstrates how the normal distribution arises from the sum of many independent random variables.
This video demonstrates a device known as a Galton Board (also called a quincunx or bean machine). It is a desktop probability machine that visualizes abstract statistical concepts.
Here is a breakdown of what is happening:
- The Setup: The device contains thousands of tiny steel beads. When the board is flipped, the beads flow through a funnel at the top and cascade down through rows of pegs arranged in a hexagonal pattern.
- The “Random Walk”: As each bead hits a peg, it has a roughly 50/50 chance of bouncing to the left or to the right. This happens repeatedly as the bead falls through multiple rows of pegs.
- The Result: While the path of any single bead is random and unpredictable, the result of thousands of beads falling is mathematically predictable. To end up in the bins on the far left or far right, a bead would have to bounce in the same direction almost every single time (e.g., Left-Left-Left-Left), which is statistically rare. It is much more likely for a bead to bounce left and right roughly an equal number of times, causing it to land in the center bins.
- The Bell Curve: As the beads accumulate at the bottom, they form a shape known as the Normal Distribution (or Bell Curve).
This machine, invented by Sir Francis Galton in the 19th century, is a perfect physical demonstration of the Central Limit Theorem. It shows how the sum of many independent random variables (the left/right bounces) tends to produce a normal distribution, regardless of the original distribution of the variables.
Simulating a Galton board
Use the interactive simulation below to experiment with the Central Limit Theorem and the concept of Drift.
- Number of balls: Controls the sample size. This demonstrates the Law of Large Numbers: as the number of trials increases, the empirical distribution converges to the theoretical distribution (the red curve).
- Bias: Represents the probability of a “right” move versus a “left” move at each peg.
- 50% Bias: Represents a standard random walk where \(p=0.5\). The distribution centers in the middle.
- > 50% Bias: Analogous to an asset with a positive expected return (positive drift). The distribution shifts right.
- < 50% Bias: Analogous to a negative drift. The distribution shifts left.
Visualizing the normal density
Below you can find a plot of two density functions from a Normal distribution. The blue curve shows the standard Normal(0, 1), while the red curve lets you adjust the mean and standard deviation to see how the shape shifts.
The relationship between a general normal variable \(X\) and the standard normal variable \(Z\) is given by the Z-score formula: \[ Z = \frac{X - \mu}{\sigma} \] This standardization allows us to compare different normal distributions and is fundamental for calculations like Value-at-Risk (VaR).
Change those parameters using the sliders below and see how this affects the density function (area in red).
For more on the Normal distribution refer to:
The normal density for \(X \sim \mathcal{N}(\mu, \sigma^2)\) is
\[ f_X(x) = \frac{1}{\sigma \sqrt{2\pi}}\exp\left(-\frac{(x-\mu)^2}{2\sigma^2}\right). \]
Financial Context: Returns vs. Prices
In finance, we often assume that the log-returns of an asset are normally distributed: \[ \ln(S_t/S_{t-1}) \sim \mathcal{N}(\mu, \sigma^2) \] This implies that the asset prices themselves follow a Log-Normal Distribution.
- Limited Liability: While returns can be negative, asset prices cannot be negative. This is a key reason we model prices as Log-Normal (domain \(0\) to \(\infty\)) rather than Normal (domain \(-\infty\) to \(\infty\)).
The Cumulative Distribution Function (CDF) of the standard normal distribution, often denoted as \(N(x)\) or \(\Phi(x)\), represents the probability that a variable takes a value less than or equal to \(x\) (the area under the curve to the left). This plays a critical role in the Black-Scholes-Merton option pricing model (specifically in the \(N(d_1)\) and \(N(d_2)\) terms).
Critique: Real-world financial data often deviates from normality:
- “Fat Tails” (Excess Kurtosis): Extreme events (crashes or booms) happen more frequently than predicted.
- Skewness: Markets often exhibit asymmetry (e.g., equity markets frequently have negative skewness, implying higher crash risk).