flowchart LR
B[Binomial<br/>n, p] -->|n→∞, p→0, np=λ| P[Poisson λ]
P -->|λ → ∞| N[Normal μ, σ]
B -->|n→∞, p=0.5| N
classDef default fill:#003366,color:#ffffff,stroke:#ffcc00,stroke-width:3px,rx:10px,ry:10px;
43 Probability distributions: Binomial, poisson and normal distributions
43.1 Random Variables and Distributions
A random variable is a real-valued function defined on the sample space of a random experiment. Its probability distribution describes how probabilities are assigned to its possible values. Random variables come in two flavours: discrete (countable values — number of heads, defectives, customer arrivals) and continuous (uncountable values — height, weight, time, price). This topic covers the three most important distributions in business statistics: the Binomial and Poisson (discrete) and the Normal (continuous). They are the workhorses of probability and the gateway to inferential statistics.
43.2 Binomial Distribution
The Binomial distribution describes the number of successes in n independent Bernoulli trials, each with probability of success p.
- Fixed number of trials n.
- Each trial has only two outcomes — success or failure.
- Constant probability p of success.
- Trials are independent.
43.2.1 Probability Mass Function (PMF)
\[P(X = k) = \binom{n}{k} p^k (1-p)^{n-k}, \quad k = 0, 1, \ldots, n\]
43.2.2 Mean and Variance
| Parameter | Value |
|---|---|
| Mean | np |
| Variance | npq = np(1−p) |
| Standard Deviation | √(npq) |
| Mode | ≈ np |
| Range | 0 to n |
Note: For binomial, mean > variance since q < 1.
43.2.3 Examples
- Coin tosses — number of heads in n tosses (p = 0.5).
- Quality control — number of defectives in a batch of n.
- Marketing — number of customers responding positively out of n contacted.
43.3 Poisson Distribution
The Poisson distribution describes the number of events occurring in a fixed interval of time or space, when events occur independently at a constant average rate λ. Discovered by Siméon Denis Poisson (1837).
43.3.1 Probability Mass Function
\[P(X = k) = \frac{e^{-\lambda} \lambda^k}{k!}, \quad k = 0, 1, 2, \ldots\]
43.3.2 Mean and Variance
| Parameter | Value |
|---|---|
| Mean | λ |
| Variance | λ |
| Standard Deviation | √λ |
| Range | 0 to ∞ |
Mean = Variance is the defining property of the Poisson distribution.
43.3.3 Conditions for Poisson
- Discrete events in a continuous interval.
- Independent — one event doesn’t affect another.
- Constant rate λ in non-overlapping intervals.
- Rare events — many trials, small p, finite λ = np.
43.3.4 Binomial → Poisson Approximation
When n is large (n → ∞) and p is small (p → 0) with np = λ fixed, the Binomial distribution approaches the Poisson distribution with mean λ.
43.3.5 Examples
- Number of customer arrivals per hour at a bank counter.
- Number of telephone calls in a switchboard per minute.
- Defects in a long bolt of cloth.
- Goals scored in a football match.
- Accidents at an intersection per day.
43.4 Normal (Gaussian) Distribution
The Normal distribution is the most important continuous distribution. Discovered by Abraham de Moivre (1733), developed by Carl Friedrich Gauss (1809), applied to errors by Laplace. Its bell-shape arises from the Central Limit Theorem — the sum/mean of many independent random variables tends to normal regardless of their original distributions.
43.4.1 Probability Density Function
\[f(x) = \frac{1}{\sigma \sqrt{2\pi}} e^{-(x - \mu)^2 / 2\sigma^2}, \quad -\infty < x < \infty\]
43.4.2 Properties
- Bell-shaped, symmetric about μ.
- Mean = Median = Mode = μ.
- Skewness = 0; Kurtosis β₂ = 3 (mesokurtic).
- Defined by two parameters: μ (mean) and σ (SD).
- Total area under the curve = 1.
- Asymptotic to x-axis — never touches.
-
Empirical Rule (68-95-99.7):
- ≈ 68 % within μ ± 1σ
- ≈ 95 % within μ ± 2σ
- ≈ 99.7 % within μ ± 3σ.
43.4.3 Standard Normal Distribution
The standard normal has μ = 0 and σ = 1, denoted Z. Any normal variable is converted:
\[Z = \frac{X - \mu}{\sigma}\]
Z-tables give cumulative probabilities of the standard normal.
43.4.4 Central Limit Theorem (CLT)
Central Limit Theorem: For a random sample of size n from a population with mean μ and finite variance σ², the sampling distribution of the sample mean tends to be normal with mean μ and SD σ/√n as n → ∞, regardless of the population’s distribution.
Rule of thumb: n ≥ 30 usually suffices.
| Property | Binomial | Poisson | Normal |
|---|---|---|---|
| Type | Discrete | Discrete | Continuous |
| Parameters | n, p | λ | μ, σ |
| Range | 0 to n | 0 to ∞ | −∞ to +∞ |
| Mean | np | λ | μ |
| Variance | npq | λ | σ² |
| Symmetric? | Only if p = 0.5 | No (skewed for small λ) | Always |
| Limiting form | → Poisson (np = λ, n→∞, p→0) | → Normal (large λ) | — |
PYQ trap: Poisson — Mean = Variance = λ. Binomial — Mean (np) > Variance (npq) because q < 1. Normal — Mean, Median, Mode coincide.
43.5 Other Distributions — Brief
- Bernoulli — single trial with two outcomes (special case of Binomial with n = 1).
- Geometric — number of trials until first success.
- Negative Binomial — number of trials until r-th success.
- Hypergeometric — sampling without replacement.
- Uniform (Rectangular) — equal probability over an interval.
- Exponential — time between Poisson events; memoryless.
- t-distribution — Student’s t (Gosset 1908); used when σ is unknown.
- Chi-square (χ²) — sum of squared standard normals; goodness-of-fit, independence.
- F-distribution — ratio of two chi-squares; ANOVA.
43.6 Practice Questions
Mean of a Binomial distribution is:
View solution
Mean and variance of Poisson distribution are:
View solution
The standard normal distribution has:
View solution
Approximately 95 % of observations of a normal distribution lie within:
View solution
The Central Limit Theorem says the sampling distribution of the sample mean is:
View solution
Binomial approaches Poisson when:
View solution
For Binomial with n = 10, p = 0.5: mean and SD are:
View solution
The normal distribution is associated with:
View solution
For X ~ N(50, 10²), the Z-score of X = 70 is:
View solution
Which is **best modeled** by Poisson?
View solution
Normal distribution is:
View solution
Match each distribution with its parameters:
| Distribution | Parameters | ||
| (i) | Binomial | (a) | λ |
| (ii) | Poisson | (b) | μ, σ |
| (iii) | Normal | (c) | n, p |
View solution
Rule of thumb for n in the CLT:
View solution
Standard error of the sample mean is:
View solution
The *t-distribution* was developed by:
View solution
Which is a **continuous** distribution?
View solution
In a normal distribution:
View solution
Siméon Denis Poisson discovered the distribution in:
View solution
A Bernoulli distribution is a special case of:
View solution
Time *between* successive Poisson events follows:
View solution
43.7 Quick Recall
- Random variable: discrete (Binomial, Poisson) vs continuous (Normal).
- Binomial(n, p): PMF \(\binom{n}{k} p^k q^{n-k}\); Mean = np; Variance = npq.
- Poisson(λ): PMF \(e^{-\lambda} \lambda^k / k!\); Mean = Variance = λ.
- Binomial → Poisson when n→∞, p→0, np = λ.
- Normal(μ, σ²): bell-shaped, symmetric; Mean = Median = Mode = μ. Empirical Rule 68-95-99.7.
- Standard normal Z = (X − μ)/σ ~ N(0,1).
- Central Limit Theorem — sample mean → Normal(μ, σ/√n) for n ≥ 30.
- t-distribution (Gosset 1908 “Student”); χ² (sum of squared standard normals); F (ratio of χ²s).
- Exponential — time between Poisson events; memoryless.
- Bernoulli — Binomial with n = 1.