38  Measures of central tendency

38.1 Concept of Central Tendency

A measure of central tendency is “a single value that attempts to describe a set of data by identifying the central position within that data” (Croxton & Cowden). It is the typical or representative value around which observations cluster. Three classical measures are mathematicalArithmetic Mean, Geometric Mean, Harmonic Mean — and two are positionalMedian and Mode. Each measure has its specific assumptions, properties, and use-cases. A good measure should be: easy to understand, rigidly defined, based on all observations, capable of further algebraic treatment, not unduly affected by extreme values, and possible to determine even for open-end classes.

38.2 Arithmetic Mean (AM)

The AM is the most commonly used measure: sum of observations divided by the number of observations.

TipArithmetic Mean — Formulae
Type of data Formula
Individual series \(\bar{X} = \frac{\sum X_i}{N}\)
Discrete (frequency) \(\bar{X} = \frac{\sum f_i X_i}{\sum f_i}\)
Continuous (class intervals) \(\bar{X} = \frac{\sum f_i m_i}{\sum f_i}\) where \(m_i\) = mid-point
Weighted \(\bar{X}_w = \frac{\sum w_i X_i}{\sum w_i}\)

38.2.1 Properties of AM

TipProperties of AM
  • Sum of deviations from the mean is zero: \(\sum (X - \bar{X}) = 0\).
  • Sum of squared deviations from the mean is minimum.
  • Affected by every observation — including extreme values.
  • Algebraic combination: AM of pooled groups computable from group means and sizes.
  • Most rigorously defined; widely used in further statistical analysis.

38.2.2 Limitations of AM

  • Severely affected by extreme values (outliers).
  • Cannot be computed for open-end class intervals.
  • Not suitable for qualitative data.
  • Can be misleading — “average Indian family has 1.8 children”.

38.3 Median

The median is the middle value when observations are arranged in ascending or descending order. It divides the data into two equal halves.

TipMedian — Formulae
Data Formula
Ungrouped, odd N Middle value, i.e., \((N+1)/2\)-th
Ungrouped, even N Average of \(N/2\)-th and \((N/2 + 1)\)-th
Continuous \(Median = L + \frac{N/2 - cf}{f} \times h\) where L = lower limit of median class, cf = cumulative freq before, f = freq of median class, h = class width
TipProperties of Median
  • Positional measure — depends on rank, not magnitude.
  • Not affected by extreme values — robust.
  • Can be computed for open-end intervals.
  • Suitable for qualitative ordinal data (rankings).
  • Limitation: not amenable to further algebraic treatment.

38.4 Quartiles, Deciles, Percentiles

These are positional measures that generalise the median: - Quartiles (Q₁, Q₂, Q₃) divide data into 4 equal parts; Q₂ = Median. - Deciles (D₁ … D₉) divide into 10 equal parts. - Percentiles (P₁ … P₉₉) divide into 100 equal parts.

38.5 Mode

The mode is the value that occurs most frequently — derived from French à la mode (“fashionable”).

TipMode — Formulae
Data Working
Ungrouped Inspect; the most frequent value
Continuous \(Mode = L + \frac{f_1 - f_0}{2f_1 - f_0 - f_2} \times h\) where L = lower limit of modal class, f₁ = modal-class freq, f₀ = preceding, f₂ = succeeding, h = width
TipProperties of Mode
  • Not affected by extreme values.
  • Can be calculated for qualitative data.
  • May be non-existent or non-unique (bimodal, multimodal).
  • Suitable for categorical and nominal data.

38.6 Empirical Relationship (Karl Pearson)

For moderately skewed distributions:

\[Mode = 3 \times Median - 2 \times Mean\]

This is Karl Pearson’s empirical relationship.

38.7 Geometric Mean (GM)

The GM is the n-th root of the product of n observations:

\[GM = \sqrt[n]{X_1 \cdot X_2 \cdot \ldots \cdot X_n} = \left(\prod_{i=1}^{n} X_i\right)^{1/n}\]

Logarithmic form for computation: \[\log GM = \frac{\sum \log X_i}{N}\]

Use cases: averaging ratios, rates, index numbers, compound growth rates (CAGR).

38.7.1 CAGR

\[CAGR = \left(\frac{\text{End Value}}{\text{Begin Value}}\right)^{1/n} - 1\]

38.8 Harmonic Mean (HM)

The HM is the reciprocal of the AM of reciprocals:

\[HM = \frac{N}{\sum (1/X_i)}\]

Use cases: averaging rates (speeds for equal distances), prices for equal expenditures.

38.9 Relationship — AM ≥ GM ≥ HM

For any set of positive observations: \[AM \geq GM \geq HM\]

with equality only when all observations are equal.

Also: \(GM^2 = AM \times HM\).

38.10 Choice of Measure

TipWhen to Use Which Measure
Situation Best measure
Symmetric, no outliers, further computation needed AM
Skewed distribution or outliers Median
Categorical or modal-value question Mode
Averaging growth rates / ratios GM
Averaging rates over equal distances HM
Open-end class intervals Median or Mode
Quick eyeball estimate Mode

flowchart TB
  CT[Central Tendency] --> MA[Mathematical]
  CT --> PO[Positional]
  MA --> AM[Arithmetic Mean]
  MA --> GM[Geometric Mean]
  MA --> HM[Harmonic Mean]
  PO --> M[Median]
  PO --> MO[Mode]
  PO --> Q[Quartiles, Deciles, Percentiles]
    classDef default fill:#003366,color:#ffffff,stroke:#ffcc00,stroke-width:3px,rx:10px,ry:10px;

NoteDistractor warning

PYQ trap: Empirical relation is Mode = 3 Median − 2 Mean (Karl Pearson). Don’t reverse the multipliers.

38.11 Practice Questions

Q 01 AM Easy

The arithmetic mean of 5, 7, 9, 11, 13 is:

  • A7
  • B9
  • C11
  • D13
View solution
Correct Option: B
(5+7+9+11+13)/5 = 45/5 = **9**.
Q 02 Median Easy

The median of 3, 5, 7, 8, 11, 15, 18 is:

  • A7
  • B8
  • C11
  • D9.5
View solution
Correct Option: B
7 values → middle = 4th value = **8**.
Q 03 Mode Easy

The mode of 2, 3, 3, 5, 5, 5, 6, 8 is:

  • A3
  • B5
  • C6
  • D8
View solution
Correct Option: B
**5** occurs three times — most frequent.
Q 04 Empirical Medium

Karl Pearson's empirical relation is:

  • AMean = 3 Median − 2 Mode
  • BMode = 3 Median − 2 Mean
  • CMedian = 2 Mean + Mode
  • DMode = Mean − Median
View solution
Correct Option: B
**Mode = 3 Median − 2 Mean** for moderately skewed data.
Q 05 AM property Medium

The sum of deviations of observations from the AM is:

  • AAlways positive
  • BAlways negative
  • CZero
  • DEqual to the AM
View solution
Correct Option: C
$\sum (X - \bar{X}) = 0$.
Q 06 Outlier Medium

Which measure is **least affected** by extreme values?

  • AAM
  • BGM
  • CHM
  • DMedian
View solution
Correct Option: D
**Median** is positional, robust to outliers.
Q 07 CAGR Medium

Compound annual growth rate of an asset growing from ₹100 to ₹121 in 2 years is:

  • A10 %
  • B10.5 %
  • C11 %
  • D21 %
View solution
Correct Option: A
(121/100)^(1/2) − 1 = 1.1 − 1 = **10 %**.
Q 08 HM use Medium

A traveller covers equal distances at 40 kmph and 60 kmph. The average speed is best measured by:

  • AAM = 50
  • BHM = 48
  • CGM ≈ 48.99
  • DMedian = 50
View solution
Correct Option: B
For equal distances at different speeds, **HM** is correct. HM = 2/(1/40 + 1/60) = 2/(0.0417) = **48 kmph**.
Q 09 Inequality Hard

For any set of positive observations:

  • AAM ≤ GM ≤ HM
  • BAM ≥ GM ≥ HM
  • CAM = GM = HM
  • DGM ≥ AM ≥ HM
View solution
Correct Option: B
**AM ≥ GM ≥ HM**; equality only when all values equal.
Q 10 GM-AM-HM Hard

The relationship between GM, AM, and HM is:

  • AGM² = AM × HM
  • BAM² = GM × HM
  • CHM² = AM × GM
  • DAM = GM = HM
View solution
Correct Option: A
**GM² = AM × HM**.
Q 11 Positional Medium

Which is a **positional** measure of central tendency?

  • AAM
  • BGM
  • CHM
  • DMedian
View solution
Correct Option: D
Median and Mode are **positional**; AM, GM, HM are mathematical.
Q 12 Mode Medium

For a data set with two values of equal highest frequency, the distribution is:

  • AUnimodal
  • BBimodal
  • CSymmetric
  • DMultimodal beyond 2
View solution
Correct Option: B
Two modes ⇒ **bimodal** distribution.
Q 13 Quartile Medium

Q₂ (the second quartile) is the same as:

  • AAM
  • BMedian
  • CMode
  • DGM
View solution
Correct Option: B
Q₂ = 50th percentile = **Median**.
Q 14 Symmetry Medium

For a perfectly symmetric distribution:

  • AMean = Median = Mode
  • BMean > Median > Mode
  • CMean < Median < Mode
  • DMode = Mean only
View solution
Correct Option: A
In symmetric distribution all three coincide.
Q 15 Skew Medium

For a *positively* (right) skewed distribution:

  • AMean > Median > Mode
  • BMode > Median > Mean
  • CMean = Median = Mode
  • DMean < Median
View solution
Correct Option: A
Right skew → tail pulls **Mean** highest; Mode lowest.
Q 16 Index Medium

For averaging *index numbers* and *growth rates*, the best measure is:

  • AAM
  • BGM
  • CHM
  • DMode
View solution
Correct Option: B
**GM** preferred for ratios, percentages, and compound growth.
Q 17 AM compute Medium

Two groups have AMs 30 and 50 with 40 and 60 items. Combined AM:

  • A35
  • B40
  • C42
  • D45
View solution
Correct Option: C
(40×30 + 60×50)/100 = (1200 + 3000)/100 = **42**.
Q 18 Open-end Medium

For open-end class intervals, the most appropriate measure is:

  • AAM
  • BGM
  • CHM
  • DMedian
View solution
Correct Option: D
**Median or Mode** can be computed without knowing extreme class boundaries.
Q 19 Categorical Easy

Best measure for **qualitative / categorical** data (e.g., favourite colour):

  • AMean
  • BMedian
  • CMode
  • DGM
View solution
Correct Option: C
**Mode** — the only measure meaningful for nominal data.
Q 20 Use Medium

Match each measure with its best use-case:

Measure Use-case
(i) AM (a) Averaging growth rates / index numbers
(ii) GM (b) Skewed distribution; outliers present
(iii) Median (c) Symmetric distribution; further computation
(iv) HM (d) Speeds over equal distances
  • A(i)-(c), (ii)-(a), (iii)-(b), (iv)-(d)
  • B(i)-(a), (ii)-(b), (iii)-(c), (iv)-(d)
  • C(i)-(b), (ii)-(d), (iii)-(a), (iv)-(c)
  • D(i)-(d), (ii)-(c), (iii)-(b), (iv)-(a)
View solution
Correct Option: A
AM — symmetric; GM — index/growth; Median — skewed; HM — speeds.

38.12 Quick Recall

ImportantQuick recall
  • Central tendency = single representative value (Croxton & Cowden).
  • Mathematical: AM, GM, HM; Positional: Median, Mode (+ quartiles, deciles, percentiles).
  • AM = Σ X/N; properties — Σ(X − X̄) = 0; minimises Σ(X − X̄)²; sensitive to outliers.
  • Median — middle value; positional; not affected by outliers; usable for open-end / ordinal data.
  • Mode — most frequent; suitable for categorical/nominal.
  • Karl Pearson’s empirical relation: Mode = 3 Median − 2 Mean (moderately skewed).
  • GM — for ratios, growth rates, CAGR; log form for computation.
  • HM — for speeds / rates over equal distance/expenditure.
  • Inequality: AM ≥ GM ≥ HM, equality when all equal. GM² = AM × HM.
  • Skew: Right (positive) → Mean > Median > Mode; Left → reverse; Symmetric → all equal.