Variance, Standard Deviation & Data Representation

Table of Contents

The Logic of Dispersion
#

In Grade 10 you learned to summarise data with the mean, median, and mode — measures of central tendency (where the data clusters). But knowing the centre isn’t enough. Two classes can both average 60% on a test, yet one class has everyone between 55% and 65% while the other has marks scattered from 20% to 100%.

Dispersion measures tell you how spread out the data is. Grade 11 introduces two precise measures: the variance and the standard deviation.

1. Why Not Just Use the Range?
#

The range ($\text{max} - \text{min}$) is a simple measure of spread, but it only uses two data points. One extreme value changes the range dramatically while telling you nothing about what’s happening in between.

We need a measure that uses every data point. The natural idea: measure how far each point is from the mean, then average those distances.

The Problem with Simple Averages of Deviations
#

If you calculate $x - \bar{x}$ for every data point and add them up, you always get zero — the positive deviations cancel the negative ones perfectly. (This is a mathematical property of the mean.)

$x$	$x - \bar{x}$
4	$-4$
7	$-1$
8	$0$
10	$+2$
11	$+3$
Sum	0

The fix: Square every deviation first. Squaring makes everything positive and also penalises large deviations more heavily (which is what we want — a point 10 units away is more concerning than two points each 5 units away).

2. Variance ($\sigma^2$)
#

The variance is the mean of the squared deviations:

$$\boxed{\sigma^2 = \frac{\sum (x_i - \bar{x})^2}{n}}$$

Symbol	Meaning
$x_i$	Each individual data value
$\bar{x}$	The mean of all data values
$n$	The number of data values
$\sigma^2$	The variance

The variance tells you the average squared distance from the mean. Its unit is the square of the original unit (e.g., if data is in cm, variance is in cm²).

3. Standard Deviation ($\sigma$)
#

To get back to the original units, take the square root of the variance:

$$\boxed{\sigma = \sqrt{\frac{\sum (x_i - \bar{x})^2}{n}}}$$

What it means in plain English: On average, each data point sits about $\sigma$ units away from the mean.

Interpreting Standard Deviation
#

$\sigma$ value	What it means
Small $\sigma$	Data is tightly clustered around the mean — consistent
Large $\sigma$	Data is widely spread — variable, less predictable
$\sigma = 0$	Every data value is identical (no spread at all)

The 68% Rule: In roughly symmetric data, about 68% of the data falls within one standard deviation of the mean: $[\bar{x} - \sigma;\; \bar{x} + \sigma]$.

4. Worked Example — Standard Deviation by Hand
#

Data: $4;\; 7;\; 8;\; 10;\; 11$ ($n = 5$)

Step 1 — Calculate the mean:

$$\bar{x} = \frac{4 + 7 + 8 + 10 + 11}{5} = \frac{40}{5} = 8$$

Step 2 — Build the deviation table:

$x_i$	$x_i - \bar{x}$	$(x_i - \bar{x})^2$
4	$-4$	16
7	$-1$	1
8	$0$	0
10	$+2$	4
11	$+3$	9
Sum	0 ✓	30

Check: The deviations column must sum to 0. If it doesn’t, your mean is wrong.

Step 3 — Variance:

$$\sigma^2 = \frac{30}{5} = 6$$

Step 4 — Standard deviation:

$$\sigma = \sqrt{6} \approx 2.45$$

Interpretation: On average, each data point is about 2.45 units away from the mean of 8.

5. Standard Deviation from a Frequency Table
#

When data is given with frequencies, the formula adjusts to account for repeated values:

$$\boxed{\sigma = \sqrt{\frac{\sum f_i(x_i - \bar{x})^2}{\sum f_i}}}$$

The mean is: $\bar{x} = \frac{\sum f_i x_i}{\sum f_i}$

Worked Example — Frequency Table
#

Test scores for 20 learners:

Score ($x_i$)	Frequency ($f_i$)	$f_i \cdot x_i$
3	2	6
5	5	25
7	8	56
9	4	36
10	1	10
Total	20	133

Step 1 — Mean: $\bar{x} = \frac{133}{20} = 6.65$

Step 2 — Squared deviations × frequency:

$x_i$	$f_i$	$x_i - \bar{x}$	$(x_i - \bar{x})^2$	$f_i(x_i - \bar{x})^2$
3	2	$-3.65$	$13.3225$	$26.645$
5	5	$-1.65$	$2.7225$	$13.6125$
7	8	$0.35$	$0.1225$	$0.98$
9	4	$2.35$	$5.5225$	$22.09$
10	1	$3.35$	$11.2225$	$11.2225$
Total	20			74.55

Step 3 — Variance: $\sigma^2 = \frac{74.55}{20} = 3.7275$

Step 4 — Standard deviation: $\sigma = \sqrt{3.7275} \approx 1.93$

6. Grouped Data (Class Intervals)
#

When data is given in intervals (e.g., 10–20, 20–30, …), you don’t know the exact values. Use the midpoint of each class as $x_i$:

$$\text{midpoint} = \frac{\text{lower boundary} + \text{upper boundary}}{2}$$

Worked Example — Grouped Data
#

Heights of 50 Grade 11 learners:

Height (cm)	Midpoint ($x_i$)	Frequency ($f_i$)	$f_i \cdot x_i$
140–150	145	4	580
150–160	155	12	1860
160–170	165	20	3300
170–180	175	10	1750
180–190	185	4	740
Total		50	8230

Mean: $\bar{x} = \frac{8230}{50} = 164.6$ cm

Now calculate $\sum f_i(x_i - \bar{x})^2$:

$x_i$	$f_i$	$(x_i - 164.6)^2$	$f_i(x_i - 164.6)^2$
145	4	$384.16$	$1536.64$
155	12	$92.16$	$1105.92$
165	20	$0.16$	$3.20$
175	10	$108.16$	$1081.60$
185	4	$416.16$	$1664.64$
Total	50		5392.00

$$\sigma = \sqrt{\frac{5392}{50}} = \sqrt{107.84} \approx 10.38 \text{ cm}$$

Interpretation: The heights are spread about 10.4 cm either side of the mean on average.

⚠️ Grouped data gives an approximation — we assumed all values in each class are at the midpoint. The true $\sigma$ may differ slightly.

7. Histograms, Frequency Polygons & Ogives
#

Histograms
#

Class intervals on the x-axis, frequency on the y-axis
Bars touch (no gaps) — the data is continuous
The tallest bar = the modal class (the interval with the highest frequency)
Bar width equals the class width

Frequency Polygons
#

Plot the midpoint of each class against its frequency
Connect the points with straight lines
Extend to the x-axis one class interval before and after the data (so the polygon closes)

Ogives (Cumulative Frequency Curves)
#

An ogive shows the running total of frequencies. It answers the question: “How many data values are less than or equal to this boundary?”

How to draw an ogive:

Create a cumulative frequency column (running total)
Plot cumulative frequency against the upper boundary of each class
Start with the point (lower boundary of first class, 0)
Connect with a smooth S-shaped curve

Worked Example — Reading an Ogive
#

Using the height data from Section 6:

Upper boundary	Cumulative frequency
140	0
150	4
160	16
170	36
180	46
190	50

Reading values from the ogive:

Median ($Q_2$): at $\frac{50}{2} = 25$ → read across from 25 on the y-axis to the curve → read down to get approximately 164 cm
$Q_1$: at $\frac{50}{4} = 12.5$ → approximately 158 cm
$Q_3$: at $\frac{3 \times 50}{4} = 37.5$ → approximately 172 cm
IQR = $172 - 158 = 14$ cm

Exam tip: When a question says “use the ogive to estimate the median,” draw horizontal and vertical dashed lines on the graph and label the coordinates. Show your working on the graph — marks are awarded for this.

8. Comparing Two Datasets
#

A very common exam question: “Compare the performance of Class A and Class B.”

The Strategy
#

Compare…	Using…
Centre (who did better overall?)	Mean or median
Spread (who was more consistent?)	Standard deviation or IQR

Worked Example — Comparing Classes
#

Class A: $\bar{x} = 62$, $\sigma = 8$
Class B: $\bar{x} = 62$, $\sigma = 15$

Centre: Both classes have the same mean (62%), so their overall performance was the same.

Spread: Class A has a much lower standard deviation ($\sigma = 8$ vs $\sigma = 15$), so Class A’s marks were more consistent (clustered closer to the mean). Class B had a wider range of marks — some very high, some very low.

Another Example — Different Means
#

Team X: $\bar{x} = 45$, $\sigma = 5$
Team Y: $\bar{x} = 52$, $\sigma = 12$

Centre: Team Y scored higher on average (52 vs 45).

Spread: Team X was more consistent ($\sigma = 5$ vs $\sigma = 12$). Team Y had higher scores overall but with much more variation between players.

Exam phrasing: “The data for [X] is more consistent/spread out than [Y] because the standard deviation is lower/higher.”

9. Symmetric vs Skewed Data
#

The shape of the data distribution tells you about the relationship between mean and median.

Shape	Mean vs Median	Box plot clue	Histogram clue
Symmetric	Mean ≈ Median	Median centred in box, whiskers roughly equal	Bell-shaped, mirror image
Positively skewed (right)	Mean > Median	Median closer to $Q_1$, long right whisker	Tail stretches to the right
Negatively skewed (left)	Mean < Median	Median closer to $Q_3$, long left whisker	Tail stretches to the left

Why the mean shifts: The mean is pulled toward extreme values (outliers/tail), while the median stays in the middle of the ranked data. So in positively skewed data, a few very high values drag the mean above the median.

Which Measure to Use?
#

Data shape	Best centre measure	Best spread measure
Symmetric	Mean	Standard deviation
Skewed	Median	IQR

10. Outliers
#

An outlier is a data value that is unusually far from the rest. The formal test:

$$\boxed{x < Q_1 - 1.5 \times IQR \quad \text{or} \quad x > Q_3 + 1.5 \times IQR}$$

Where $IQR = Q_3 - Q_1$.

Worked Example — Detecting Outliers
#

Data: $2;\; 15;\; 18;\; 20;\; 22;\; 23;\; 25;\; 27;\; 28;\; 30;\; 65$

$Q_1 = 18$, $Q_3 = 28$, $IQR = 10$

Lower fence: $18 - 1.5(10) = 3$

Upper fence: $28 + 1.5(10) = 43$

Any value below 3 or above 43 is an outlier.

$x = 2$: below 3 → outlier ✓
$x = 65$: above 43 → outlier ✓

Impact of Outliers on Statistics
#

Measure	Affected by outliers?
Mean	Yes — dragged toward the outlier
Median	No — only depends on the middle value(s)
Standard deviation	Yes — outliers inflate squared deviations
IQR	No — only depends on $Q_1$ and $Q_3$
Range	Yes — completely determined by extremes

Exam tip: If a question asks “which measure of central tendency is more appropriate?”, check for skewness or outliers. If present, the median is better than the mean.

🚨 Common Mistakes
#

Mistake	Why it’s wrong	Fix
Using $s_x$ instead of $\sigma_x$ on the calculator	$s_x$ is the sample standard deviation (divides by $n-1$); school maths uses $\sigma_x$ (divides by $n$)	Always select $\sigma_x$ in STAT mode
Forgetting to multiply by frequency	In a frequency table, each $(x - \bar{x})^2$ must be weighted by its frequency	Always include the $f_i$ column in your table
Forgetting the square root	$\sigma^2$ is the variance; $\sigma$ is the standard deviation	The last step is always $\sigma = \sqrt{\sigma^2}$
Using midpoints as exact values	Grouped data uses midpoints as estimates, not exact values	Acknowledge “estimated” in your answer
Reading ogive at the wrong axis	The median is at cumulative frequency $\frac{n}{2}$, not at $x = \frac{n}{2}$	Start on the y-axis, go across to the curve, then down to the x-axis
Confusing skewness direction	“Positively skewed” means the tail goes to the right (positive direction)	Remember: the name matches the direction of the tail, not the peak
Stating “data is consistent” without referencing $\sigma$	You must mention the standard deviation value to justify a consistency claim	“The data is more consistent because $\sigma = 5$ is lower than $\sigma = 12$”

💡 Pro Tips for Exams
#

1. Calculator Method
#

Use STAT mode: enter all data → select 1-VAR stats → read $\bar{x}$ and $\sigma_x$ directly. The by-hand method is for understanding and for “show your working” questions; the calculator is for speed.

2. The “Deviation Sum = 0” Check
#

After calculating all your deviations ($x_i - \bar{x}$), add them up. If the sum isn’t 0, your mean is wrong. Fix it before continuing.

3. Comparing Datasets — The Two-Sentence Template
#

When asked to compare two datasets, always make two statements:

Centre: “Class A performed better/worse on average because [mean/median] = …”
Spread: “Class A was more/less consistent because [σ/IQR] = …”

This guarantees full marks on comparison questions.

4. Reading Ogives Accurately
#

Always draw dashed lines on the ogive when reading values. Start from the y-axis (cumulative frequency), draw a horizontal line to the curve, then a vertical line down to the x-axis. Label both coordinates.

🔗 Related Grade 11 topics:
Combined Events & Probability — contingency tables link statistics and probability
Quadratic Equations — the variance formula involves squaring, a key algebraic skill

🏠 Back to Statistics

The Logic of Dispersion#

1. Why Not Just Use the Range?#

The Problem with Simple Averages of Deviations#

2. Variance ($\sigma^2$)#

3. Standard Deviation ($\sigma$)#

Interpreting Standard Deviation#

4. Worked Example — Standard Deviation by Hand#

5. Standard Deviation from a Frequency Table#

Worked Example — Frequency Table#

6. Grouped Data (Class Intervals)#

Worked Example — Grouped Data#

7. Histograms, Frequency Polygons & Ogives#

Histograms#

Frequency Polygons#

Ogives (Cumulative Frequency Curves)#

Worked Example — Reading an Ogive#

8. Comparing Two Datasets#

The Strategy#

Worked Example — Comparing Classes#

Another Example — Different Means#

9. Symmetric vs Skewed Data#

Which Measure to Use?#

10. Outliers#

Worked Example — Detecting Outliers#

Impact of Outliers on Statistics#

🚨 Common Mistakes#

💡 Pro Tips for Exams#

1. Calculator Method#

2. The “Deviation Sum = 0” Check#

3. Comparing Datasets — The Two-Sentence Template#

4. Reading Ogives Accurately#

Related