42  Probability: Approaches to probability; Bayes’ theorem

42.1 Concept of Probability

Probability is a numerical measure of uncertainty — a number between 0 and 1 (or 0 % and 100 %) expressing the likelihood of an event. It is the mathematical language of chance and the foundation of all inferential statistics. The roots of probability theory go back to the 17th-century correspondence between Blaise Pascal and Pierre de Fermat on gambling problems (1654). It was axiomatised in modern form by A.N. Kolmogorov in 1933. Four major approaches to probability have emerged — classical, empirical (relative frequency), subjective, and axiomatic. Bayes’ theorem (Thomas Bayes, posthumously published 1763) provides the rule for updating probabilities in the light of new information — the foundation of all modern Bayesian inference.

42.2 Approaches to Probability

TipFour Approaches
Approach Working content
Classical (a priori) P(A) = favourable outcomes / total outcomes, assuming equally likely outcomes (Laplace)
Empirical (Relative Frequency) P(A) = number of times A occurs / total trials, as number of trials → ∞ (von Mises)
Subjective P(A) = degree of personal belief; basis of Bayesian school (Ramsey, de Finetti)
Axiomatic Kolmogorov’s three axioms (1933)

42.3 Kolmogorov’s Axioms

TipThree Axioms
  1. Non-negativity: P(A) ≥ 0 for every event A.
  2. Normalisation: P(S) = 1, where S is the sample space.
  3. Additivity: For mutually exclusive events A₁, A₂, …, \(P(A_1 \cup A_2 \cup \ldots) = P(A_1) + P(A_2) + \ldots\)

42.4 Basic Terms

TipBasic Probability Terminology
Term Meaning
Random experiment Process with uncertain outcome
Sample space (S) Set of all possible outcomes
Event (A) Subset of S
Mutually exclusive Events that cannot both occur
Exhaustive Events that together cover S
Independent events Occurrence of one does not affect probability of another
Complementary events A and Aᶜ; P(A) + P(Aᶜ) = 1
Conditional probability P(A

42.5 Key Theorems

42.5.1 Addition Rule (Union of Events)

TipAddition Rules
  • General: \(P(A \cup B) = P(A) + P(B) - P(A \cap B)\).
  • Mutually exclusive: \(P(A \cup B) = P(A) + P(B)\).

42.5.2 Multiplication Rule (Intersection)

TipMultiplication Rules
  • General: \(P(A \cap B) = P(A) \cdot P(B \mid A)\).
  • Independent: \(P(A \cap B) = P(A) \cdot P(B)\).

42.6 Bayes’ Theorem (1763)

For events A and B (with P(B) > 0):

\[P(A | B) = \frac{P(B | A) \cdot P(A)}{P(B)}\]

For a partition {A₁, A₂, …, Aₙ} of S:

\[P(A_i | B) = \frac{P(B | A_i) \cdot P(A_i)}{\sum_{j} P(B | A_j) \cdot P(A_j)}\]

42.6.1 Terminology

TipBayes’ Terminology
  • Prior P(A_i) — probability before seeing evidence B.
  • Likelihood P(B|A_i) — probability of evidence B given A_i.
  • Posterior P(A_i|B) — updated probability after seeing B.
  • Marginal / Evidence P(B) — total probability of B (denominator).

42.6.2 Classic Example — Diagnostic Test

A disease has prevalence 1 %; test has sensitivity 99 % (P(+|D) = 0.99) and specificity 95 % (P(−|H) = 0.95 → false positive 5 %). What is the probability a positive-tested person actually has the disease?

P(D|+) = (0.99 × 0.01) / [(0.99 × 0.01) + (0.05 × 0.99)] ≈ 0.167 or 16.7 %.

This counter-intuitive result — only ~17 % of positive tests are true positives despite a 99 %-accurate test — is the classic illustration of base-rate neglect.

flowchart LR
  PR[Prior P(A)] -->|update| PO[Posterior P(A|B)]
  L[Likelihood P(B|A)] -->|times prior| PO
  E[Evidence P(B)] -->|divides| PO
    classDef default fill:#003366,color:#ffffff,stroke:#ffcc00,stroke-width:3px,rx:10px,ry:10px;

NoteDistractor warning

PYQs often confuse mutually exclusive with independent. Mutually exclusive means cannot both happen; independent means occurrence of one doesn’t affect probability of the other. They are opposites in most practical cases — if A and B both have positive probability, mutual exclusion implies dependence (knowing A occurred means B did not).

42.7 Practice Questions

Q 01RangeEasy

Probability of an event lies between:

  • A0 and 100
  • B−1 and +1
  • C0 and 1
  • D1 and 10
View solution
Correct Option: C
0 ≤ P(A) ≤ 1.
Q 02KolmogorovMedium

The axiomatic foundation of probability was provided by:

  • APascal
  • BLaplace
  • CKolmogorov (1933)
  • DBayes
View solution
Correct Option: C
**Kolmogorov 1933** — three axioms.
Q 03ApproachesMedium

Match each approach with its proponent:

Approach Proponent
(i) Classical (a) Bayes / Ramsey / de Finetti
(ii) Relative frequency (b) Kolmogorov
(iii) Subjective (c) Laplace
(iv) Axiomatic (d) von Mises
  • A(i)-(c), (ii)-(d), (iii)-(a), (iv)-(b)
  • B(i)-(b), (ii)-(c), (iii)-(d), (iv)-(a)
  • C(i)-(a), (ii)-(d), (iii)-(c), (iv)-(b)
  • D(i)-(d), (ii)-(c), (iii)-(b), (iv)-(a)
View solution
Correct Option: A
Classical — Laplace; Relative-frequency — von Mises; Subjective — Bayes/Ramsey/de Finetti; Axiomatic — Kolmogorov.
Q 04Mutually exclMedium

For two mutually exclusive events A and B:

  • AP(A ∪ B) = P(A) + P(B) − P(A ∩ B)
  • BP(A ∪ B) = P(A) + P(B)
  • CP(A ∪ B) = P(A) × P(B)
  • DP(A ∪ B) = 1
View solution
Correct Option: B
Mutually exclusive ⇒ P(A∩B) = 0, so P(A∪B) = P(A) + P(B).
Q 05IndependentMedium

For two *independent* events:

  • AP(A ∩ B) = P(A) + P(B)
  • BP(A ∩ B) = P(A) × P(B)
  • CP(A ∩ B) = 0
  • DP(A ∩ B) = 1
View solution
Correct Option: B
Independent ⇒ product rule.
Q 06CoinEasy

Probability of getting two heads in two tosses of a fair coin:

  • A1/2
  • B1/3
  • C1/4
  • D1/8
View solution
Correct Option: C
(1/2) × (1/2) = **1/4**.
Q 07DieEasy

P(getting a 6 on a fair die) is:

  • A1/2
  • B1/3
  • C1/4
  • D1/6
View solution
Correct Option: D
One favourable out of six equally likely outcomes.
Q 08CardsMedium

From a standard 52-card deck, P(drawing a King OR a Heart) is:

  • A17/52
  • B16/52
  • C4/52
  • D13/52
View solution
Correct Option: B
P(King) + P(Heart) − P(King of Hearts) = 4/52 + 13/52 − 1/52 = **16/52**.
Q 09BayesMedium

Bayes' theorem allows us to:

  • ACompute prior from posterior
  • BUpdate prior probabilities given new evidence
  • CCompute variance
  • DTest hypotheses
View solution
Correct Option: B
Bayes — posterior = prior × likelihood / evidence.
Q 10Bayes YearHard

Thomas Bayes' "Essay" was published posthumously in:

  • A1654
  • B1763
  • C1812
  • D1933
View solution
Correct Option: B
**1763** — published by Richard Price after Bayes' death.
Q 11ConditionalMedium

P(A|B) = P(A∩B)/P(B). This is the formula for:

  • AJoint probability
  • BConditional probability
  • CMarginal probability
  • DIndependence
View solution
Correct Option: B
**Conditional probability** of A given B.
Q 12Bayes calcHard

Two urns: Urn 1 has 3 red, 2 blue; Urn 2 has 1 red, 4 blue. An urn is chosen randomly and a red ball drawn. P(it was Urn 1):

  • A3/5
  • B3/4
  • C1/2
  • D1/4
View solution
Correct Option: B
P(U1|R) = (0.5 × 0.6)/(0.5 × 0.6 + 0.5 × 0.2) = 0.3/0.4 = **3/4**.
Q 13ComplementEasy

If P(A) = 0.3, then P(Aᶜ) is:

  • A0.3
  • B0.7
  • C1.3
  • D0.0
View solution
Correct Option: B
P(A) + P(Aᶜ) = 1 → P(Aᶜ) = **0.7**.
Q 14PascalMedium

The origins of probability theory are credited to 17th-century correspondence between:

  • ANewton and Leibniz
  • BPascal and Fermat (1654)
  • CBayes and Price
  • DFisher and Pearson
View solution
Correct Option: B
**Pascal-Fermat 1654** correspondence on gambling problems.
Q 15Base rateHard

The famous *base-rate fallacy* arises in:

  • AFrequentist tests
  • BBayesian medical-test interpretation
  • CRandom sampling
  • DRegression analysis
View solution
Correct Option: B
Ignoring base rates (prevalence / prior) when interpreting test results.
Q 16ExhaustiveMedium

Two events are **exhaustive** if:

  • ATheir intersection is empty
  • BTheir union is the sample space
  • CP(A) = P(B)
  • DP(A) + P(B) = 0
View solution
Correct Option: B
**Exhaustive** ⇒ together cover S.
Q 17P oddEasy

P(getting an odd number on a fair die) is:

  • A1/6
  • B1/3
  • C1/2
  • D2/3
View solution
Correct Option: C
3 odd outcomes (1, 3, 5) out of 6 → **1/2**.
Q 18PriorMedium

In Bayes' framework, *prior* means:

  • AProbability after seeing data
  • BProbability before observing the data
  • CSample mean
  • DCoefficient of variation
View solution
Correct Option: B
**Prior** = initial belief; **posterior** = updated belief.
Q 19Two headsMedium

In tossing two dice, P(sum = 7) is:

  • A1/6
  • B1/12
  • C7/36
  • D1/36
View solution
Correct Option: A
6 favourable (1+6, 2+5, 3+4, 4+3, 5+2, 6+1) of 36 → **6/36 = 1/6**.
Q 20Mutually excl vs IndepHard

Two events with positive probability that are *mutually exclusive* must be:

  • AIndependent
  • BDependent
  • CEqual in probability
  • DAlways sum to 1
View solution
Correct Option: B
If both positive and mutually exclusive, knowing one occurred means the other did not — *dependent*.

42.8 Quick Recall

ImportantQuick recall
  • Probability ∈ [0, 1]; language of uncertainty. Origin: Pascal-Fermat 1654.
  • Four approaches: Classical (Laplace), Empirical (von Mises), Subjective (Bayes/Ramsey/de Finetti), Axiomatic (Kolmogorov 1933).
  • Three axioms: non-negativity, normalisation (P(S) = 1), additivity for mutually exclusive events.
  • Addition rule: P(A∪B) = P(A) + P(B) − P(A∩B); mutually exclusive ⇒ no overlap term.
  • Multiplication rule: P(A∩B) = P(A) × P(B|A); independent ⇒ P(A) × P(B).
  • Complement: P(A) + P(Aᶜ) = 1.
  • Bayes (1763): P(A|B) = P(B|A) × P(A) / P(B). Prior × Likelihood / Evidence = Posterior.
  • Base-rate fallacy — ignoring prior probability when interpreting test results.
  • Mutually exclusive ≠ Independent (with positive probabilities, M.E. implies dependence).