From Real Exams Exam Paper

A Level H1 Mathematics Practice Paper 4

Free Exam-Derived Owl Alpha A Level H1 Mathematics Practice Paper 4 practice paper with questions and answers for Singapore students. This page is rendered as a direct URL so the questions and answers can be discovered without pressing in-page buttons.

These static practice materials are generated from the site's syllabus and paper-generation workflow, with source and model context shown so students and parents can evaluate the material before use.

A Level H1 Mathematics From Real Exams Generated by Owl Alpha Updated 2026-06-07

Questions

<!-- TuitionGoWhere generation metadata: stage=3-1; model=openrouter/owl-alpha; model_label=Owl Alpha; generated=2026-06-06; Sources: Stage 2-1 real exam-derived templates and Stage 2-2 exam-enriched syllabus. -->

TuitionGoWhere Practice Paper - Maths H1 A-Level

TuitionGoWhere Secondary School (AI)

Subject:Mathematics
Level:A-Level H1
Paper:Practice Paper — Statistics & Probability
Version:4 of 5
Duration:1 hour 30 minutes
Total Marks:70
Name:________________________
Class:________________________
Date:________________________

Instructions:

  • Write your answers in the spaces provided.
  • Show all working clearly. Marks may be awarded for correct working even if the final answer is wrong.
  • Give non-exact answers correct to 3 significant figures unless otherwise stated.
  • A graphing calculator may be used where appropriate.
  • The total marks for this paper is 70.
  • The number of marks available for each question or part-question is shown in brackets [ ].

Section A: Pure Statistics (30 marks)

Answer all questions in this space.


Question 1 (2 marks)

A random sample of 6 students recorded the number of hours they spent on revision in a week:

12, 15, 10, 14, 11, 1312,\ 15,\ 10,\ 14,\ 11,\ 13

Calculate the unbiased estimate of the population mean.

xˉ=xin\bar{x} = \frac{\sum x_i}{n}

[Answer space]


Question 2 (3 marks)

Using the data from Question 1, calculate the unbiased estimate of the population variance.

s2=(xixˉ)2n1s^2 = \frac{\sum(x_i - \bar{x})^2}{n-1}

[Answer space]


Question 3 (3 marks)

The daily commute times (in minutes) for a sample of employees at a company are summarised as follows:

x=840,x2=42,600,n=20\sum x = 840,\quad \sum x^2 = 42{,}600,\quad n = 20

(a) Find the sample mean. [1]

(b) Find the unbiased estimate of the population variance. [2]

[Answer space]


Question 4 (4 marks)

A discrete random variable XX has the following probability distribution:

xx12345
P(X=x)P(X=x)0.10.20.3aa0.1

(a) Find the value of aa. [1]

(b) Find E(X)\text{E}(X). [1]

(c) Find Var(X)\text{Var}(X). [2]

[Answer space]


Question 5 (3 marks)

A fair six-sided die is rolled 4 times. Let XX be the number of times a prime number (2, 3, or 5) appears.

(a) State the distribution of XX, including its parameters. [1]

(b) Find P(X=2)P(X = 2). [2]

[Answer space]


Question 6 (4 marks)

The masses of a certain type of apple are normally distributed with mean 150 g150\text{ g} and standard deviation 20 g20\text{ g}.

(a) Find the probability that a randomly chosen apple has a mass between 130 g130\text{ g} and 170 g170\text{ g}. [2]

(b) A random sample of 9 apples is selected. Find the probability that the sample mean mass exceeds 155 g155\text{ g}. [2]

[Answer space]


Question 7 (3 marks)

A factory produces light bulbs, and 5%5\% are defective. A quality control inspector randomly selects 200 bulbs.

Using a suitable approximation, find the probability that at most 12 bulbs are defective.

[Answer space]


Question 8 (4 marks)

The following table shows the marks obtained by 80 students in a mathematics test.

Mark rangeFrequency
0x<200 \leq x < 206
20x<4020 \leq x < 4014
40x<6040 \leq x < 6022
60x<8060 \leq x < 8026
80x10080 \leq x \leq 10012

(a) Estimate the mean mark. [2]

(b) Estimate the standard deviation of the marks. [2]

[Answer space]


Question 9 (4 marks)

A continuous random variable XX has probability density function given by

f(x)={kx(4x)0x40otherwisef(x) = \begin{cases} kx(4 - x) & 0 \leq x \leq 4 \\ 0 & \text{otherwise} \end{cases}

(a) Show that k=332k = \dfrac{3}{32}. [2]

(b) Find E(X)\text{E}(X). [2]

[Answer space]


Section B: Regression & Correlation (20 marks)

Answer all questions in this space.


Question 10 (5 marks)

A researcher investigates the relationship between the number of hours of weekly exercise (xx) and resting heart rate in beats per minute (yy) for 10 individuals. The following summary statistics are obtained:

n=10,x=50,y=720,x2=330,y2=52,200,xy=3,480n = 10,\quad \sum x = 50,\quad \sum y = 720,\quad \sum x^2 = 330,\quad \sum y^2 = 52{,}200,\quad \sum xy = 3{,}480

(a) Calculate the product moment correlation coefficient rr. [3]

(b) Interpret the value of rr in context. [1]

(c) Comment on whether a linear model is appropriate based on this value. [1]

[Answer space]


Question 11 (5 marks)

Using the data from Question 10:

(a) Find the equation of the regression line of yy on xx in the form y=a+bxy = a + bx. [3]

(b) Estimate the resting heart rate for a person who exercises 7 hours per week. [1]

(c) Explain why it would be unreliable to use this regression line to estimate the resting heart rate for someone who exercises 20 hours per week. [1]

[Answer space]


Question 12 (5 marks)

The table below shows the advertising expenditure (in thousands of dollars) and the corresponding monthly revenue (in thousands of dollars) for a small business over 6 months.

MonthAdvertising (xx)Revenue (yy)
12.015
23.522
31.512
44.025
52.518
65.030

(a) Calculate the equation of the regression line of yy on xx. [3]

(b) Draw a scatter diagram for the data. [2]

<image_placeholder> id: Q12-fig1 type: graph linked_question: Q12 description: Scatter diagram with advertising expenditure (x-axis, range 0 to 6, in thousands of dollars) and revenue (y-axis, range 0 to 35, in thousands of dollars). Plot the six data points: (2.0, 15), (3.5, 22), (1.5, 12), (4.0, 25), (2.5, 18), (5.0, 30). Include the regression line y = 5.6 + 4.8x drawn through the data. labels: x-axis: Advertising expenditure (1000),yaxis:Revenue(1000), y-axis: Revenue (1000), title: Scatter diagram of advertising vs revenue values: Data points as listed above; regression line equation y = 5.6 + 4.8x must_show: All six plotted points, both axes with labels and scale, regression line, title </image_placeholder>

[Answer space]


Question 13 (5 marks)

A random variable XPoisson(λ)X \sim \text{Poisson}(\lambda). It is given that P(X=0)=0.0498P(X = 0) = 0.0498.

(a) Find the value of λ\lambda. [2]

(b) Find P(X3)P(X \geq 3). [3]

[Answer space]


Section C: Applied Probability & Normal Distribution (20 marks)

Answer all questions in this space.


Question 14 (4 marks)

The heights of adult women in a city are normally distributed with mean 162 cm162\text{ cm} and standard deviation 6 cm6\text{ cm}.

(a) Find the probability that a randomly selected woman has a height greater than 170 cm170\text{ cm}. [2]

(b) Find the height that is exceeded by 15%15\% of women. [2]

[Answer space]


Question 15 (4 marks)

A bag contains 5 red balls, 4 blue balls, and 3 green balls. Three balls are drawn at random without replacement.

(a) Find the probability that all three balls are red. [2]

(b) Find the probability that exactly two balls are of the same colour. [2]

[Answer space]


Question 16 (4 marks)

The time taken by a candidate to complete an online aptitude test follows a normal distribution with mean 45 minutes and standard deviation 8 minutes.

(a) Find the probability that a randomly chosen candidate takes between 40 and 50 minutes. [2]

(b) In a group of 50 candidates, find the expected number who take more than 55 minutes. [2]

[Answer space]


Question 17 (4 marks)

A call centre receives an average of 3.5 calls per minute.

(a) State an appropriate distribution to model the number of calls received in a given minute. Give a reason for your choice. [1]

(b) Find the probability of receiving exactly 5 calls in a given minute. [2]

(c) Find the probability of receiving at least 2 calls in a given minute. [1]

[Answer space]


Question 18 (4 marks)

The weights of packets of rice are normally distributed with mean 500 g500\text{ g} and standard deviation σ g\sigma\text{ g}. It is known that 2.5%2.5\% of packets weigh less than 480 g480\text{ g}.

(a) Find the value of σ\sigma. [2]

(b) Find the probability that a randomly chosen packet weighs between 490 g490\text{ g} and 515 g515\text{ g}. [2]

[Answer space]


End of Paper

Answers

<!-- TuitionGoWhere generation metadata: stage=3-1; model=openrouter/owl-alpha; model_label=Owl Alpha; generated=2026-06-06; Sources: Stage 2-1 real exam-derived templates and Stage 2-2 exam-enriched syllabus. -->

TuitionGoWhere Practice Paper — Maths H1 A-Level

Answer Key — Version 4 of 5


Section A: Pure Statistics


Question 1 (2 marks)

Answer: xˉ=12.5\bar{x} = 12.5

Working:

xˉ=12+15+10+14+11+136=756=12.5\bar{x} = \frac{12 + 15 + 10 + 14 + 11 + 13}{6} = \frac{75}{6} = 12.5

Teaching notes: The unbiased estimate of the population mean is simply the sample mean. Add all data values and divide by the number of observations nn. This is the best single-number estimate of the true population mean from sample data.

Marking: M1 for correct substitution into the formula. A1 for the correct answer 12.5.


Question 2 (3 marks)

Answer: s2=3.5s^2 = 3.5

Working:

First compute each deviation from the mean (xˉ=12.5\bar{x} = 12.5):

xix_ixixˉx_i - \bar{x}(xixˉ)2(x_i - \bar{x})^2
120.5-0.50.25
152.56.25
102.5-2.56.25
141.52.25
111.5-1.52.25
130.50.25

(xixˉ)2=0.25+6.25+6.25+2.25+2.25+0.25=17.5\sum(x_i - \bar{x})^2 = 0.25 + 6.25 + 6.25 + 2.25 + 2.25 + 0.25 = 17.5

s2=17.561=17.55=3.5s^2 = \frac{17.5}{6 - 1} = \frac{17.5}{5} = 3.5

Teaching notes: The key distinction is using n1=5n - 1 = 5 in the denominator (not n=6n = 6). This gives the unbiased estimate of population variance. Using nn would give a biased estimate that systematically underestimates the true population variance. The n1n-1 correction (Bessel's correction) accounts for the fact that we use the sample mean xˉ\bar{x} (which itself is estimated from the data) rather than the true population mean.

Marking: M1 for correct deviations or sum of squared deviations. M1 for using n1n-1 in the denominator. A1 for the correct answer 3.5.

Common mistake: Using n=6n = 6 gives 17.5/62.9217.5/6 \approx 2.92, which is the biased estimate and would lose the final mark.


Question 3 (3 marks)

(a) [1 mark]

Answer: xˉ=42\bar{x} = 42

xˉ=xn=84020=42\bar{x} = \frac{\sum x}{n} = \frac{840}{20} = 42

(b) [2 marks]

Answer: s2=84s^2 = 84

Working:

s2=x2(x)2nn1=42,600(840)22019s^2 = \frac{\sum x^2 - \frac{(\sum x)^2}{n}}{n-1} = \frac{42{,}600 - \frac{(840)^2}{20}}{19}

=42,600705,6002019=42,60035,28019=7,32019385.26= \frac{42{,}600 - \frac{705{,}600}{20}}{19} = \frac{42{,}600 - 35{,}280}{19} = \frac{7{,}320}{19} \approx 385.26

Wait — let me recalculate:

(840)220=705,60020=35,280\frac{(840)^2}{20} = \frac{705{,}600}{20} = 35{,}280

x2(x)2n=42,60035,280=7,320\sum x^2 - \frac{(\sum x)^2}{n} = 42{,}600 - 35{,}280 = 7{,}320

s2=7,32019385.3s^2 = \frac{7{,}320}{19} \approx 385.3

Answer: s2385.3s^2 \approx 385.3

Teaching notes: When given summary statistics (x\sum x, x2\sum x^2) rather than raw data, use the computational formula for unbiased variance: s2=1n1(x2(x)2n)s^2 = \frac{1}{n-1}\left(\sum x^2 - \frac{(\sum x)^2}{n}\right). This avoids calculating individual deviations. The term (x)2n\frac{(\sum x)^2}{n} is sometimes called the "correction term."

Marking: (a) A1 for 42. (b) M1 for correct substitution into the formula. A1 for 385.3 (or exact fraction 732019\frac{7320}{19}).


Question 4 (4 marks)

(a) [1 mark]

Answer: a=0.3a = 0.3

Working: Probabilities must sum to 1:

0.1+0.2+0.3+a+0.1=10.1 + 0.2 + 0.3 + a + 0.1 = 1 0.7+a=1    a=0.30.7 + a = 1 \implies a = 0.3

(b) [1 mark]

Answer: E(X)=3.0\text{E}(X) = 3.0

Working:

E(X)=xP(X=x)=1(0.1)+2(0.2)+3(0.3)+4(0.3)+5(0.1)\text{E}(X) = \sum x \cdot P(X=x) = 1(0.1) + 2(0.2) + 3(0.3) + 4(0.3) + 5(0.1) =0.1+0.4+0.9+1.2+0.5=3.1= 0.1 + 0.4 + 0.9 + 1.2 + 0.5 = 3.1

Answer: E(X)=3.1\text{E}(X) = 3.1

(c) [2 marks]

Answer: Var(X)=1.29\text{Var}(X) = 1.29

Working:

E(X2)=12(0.1)+22(0.2)+32(0.3)+42(0.3)+52(0.1)\text{E}(X^2) = 1^2(0.1) + 2^2(0.2) + 3^2(0.3) + 4^2(0.3) + 5^2(0.1) =0.1+0.8+2.7+4.8+2.5=10.9= 0.1 + 0.8 + 2.7 + 4.8 + 2.5 = 10.9

Var(X)=E(X2)[E(X)]2=10.9(3.1)2=10.99.61=1.29\text{Var}(X) = \text{E}(X^2) - [\text{E}(X)]^2 = 10.9 - (3.1)^2 = 10.9 - 9.61 = 1.29

Teaching notes: For a discrete random variable, E(X)\text{E}(X) is the probability-weighted average of all possible values. Var(X)\text{Var}(X) measures the spread of the distribution. The formula Var(X)=E(X2)[E(X)]2\text{Var}(X) = \text{E}(X^2) - [\text{E}(X)]^2 is often easier than computing (xiμ)2P(X=xi)\sum(x_i - \mu)^2 \cdot P(X=x_i) directly.

Marking: (a) A1 for 0.3. (b) A1 for 3.1. (c) M1 for correct E(X2)\text{E}(X^2) or correct method. A1 for 1.29.


Question 5 (3 marks)

(a) [1 mark]

Answer: XB(4,0.5)X \sim \text{B}(4,\, 0.5)

Working: Each roll is independent. Probability of a prime (2, 3, or 5) on one roll is p=36=0.5p = \frac{3}{6} = 0.5. Number of trials n=4n = 4.

(b) [2 marks]

Answer: P(X=2)=0.375P(X = 2) = 0.375

Working:

P(X=2)=(42)(0.5)2(0.5)2=6×0.25×0.25=6×0.0625=0.375P(X = 2) = \binom{4}{2}(0.5)^2(0.5)^2 = 6 \times 0.25 \times 0.25 = 6 \times 0.0625 = 0.375

Teaching notes: The binomial distribution applies when there are a fixed number of independent trials, each with the same probability of success, and we count the number of successes. The general formula is P(X=r)=(nr)pr(1p)nrP(X = r) = \binom{n}{r}p^r(1-p)^{n-r}.

Marking: (a) A1 for correct distribution and both parameters. (b) M1 for correct binomial formula with n=4,r=2,p=0.5n=4, r=2, p=0.5. A1 for 0.375.


Question 6 (4 marks)

(a) [2 marks]

Answer: 0.68270.6827

Working: XN(150,202)X \sim \text{N}(150,\, 20^2)

P(130<X<170)=P(13015020<Z<17015020)=P(1<Z<1)P(130 < X < 170) = P\left(\frac{130 - 150}{20} < Z < \frac{170 - 150}{20}\right) = P(-1 < Z < 1) =2Φ(1)1=2(0.8413)1=0.6826= 2\Phi(1) - 1 = 2(0.8413) - 1 = 0.6826

Answer: 0.6830.683 (to 3 s.f.)

(b) [2 marks]

Answer: 0.22660.2266

Working: For the sample mean XˉN(150,2029)=N(150,44.44)\bar{X} \sim \text{N}\left(150,\, \frac{20^2}{9}\right) = \text{N}(150,\, 44.44)

σXˉ=2036.667\sigma_{\bar{X}} = \frac{20}{3} \approx 6.667

P(Xˉ>155)=P(Z>15515020/3)=P(Z>56.667)=P(Z>0.75)P(\bar{X} > 155) = P\left(Z > \frac{155 - 150}{20/3}\right) = P\left(Z > \frac{5}{6.667}\right) = P(Z > 0.75) =1Φ(0.75)=10.7734=0.2266= 1 - \Phi(0.75) = 1 - 0.7734 = 0.2266

Answer: 0.2270.227 (to 3 s.f.)

Teaching notes: Part (a) uses the standard normal distribution for individual observations. Part (b) uses the sampling distribution of the sample mean: XˉN(μ,σ2/n)\bar{X} \sim \text{N}(\mu,\, \sigma^2/n). The standard error of the mean is σ/n\sigma/\sqrt{n}, which decreases as sample size increases — this reflects the fact that sample means vary less than individual values.

Marking: (a) M1 for standardising correctly. A1 for 0.683. (b) M1 for using σ/n\sigma/\sqrt{n} and standardising. A1 for 0.227.


Question 7 (3 marks)

Answer: 0.78810.7881

Working: Let XX = number of defective bulbs in 200. XB(200,0.05)X \sim \text{B}(200,\, 0.05).

Since n=200n = 200 is large and p=0.05p = 0.05 is small, use the Poisson approximation:

λ=np=200×0.05=10\lambda = np = 200 \times 0.05 = 10

XapproxPo(10)X \stackrel{\text{approx}}{\sim} \text{Po}(10)

P(X12)=k=012e1010kk!P(X \leq 12) = \sum_{k=0}^{12} \frac{e^{-10} \cdot 10^k}{k!}

Using a calculator or Poisson tables: P(X12)0.7916P(X \leq 12) \approx 0.7916

Alternatively, using normal approximation (since np=105np = 10 \geq 5 and nq=1905nq = 190 \geq 5):

XapproxN(10,9.5)X \stackrel{\text{approx}}{\sim} \text{N}(10,\, 9.5) (using Var=np(1p)=9.5\text{Var} = np(1-p) = 9.5)

With continuity correction:

P(X12)P(Z<12.5109.5)=P(Z<2.53.082)=P(Z<0.811)P(X \leq 12) \approx P\left(Z < \frac{12.5 - 10}{\sqrt{9.5}}\right) = P\left(Z < \frac{2.5}{3.082}\right) = P(Z < 0.811) =Φ(0.811)0.7916= \Phi(0.811) \approx 0.7916

Answer: 0.7920.792 (to 3 s.f.)

Teaching notes: When nn is large and pp is small, the Poisson approximation to the binomial is appropriate (rule of thumb: n20n \geq 20 and p0.05p \leq 0.05). The normal approximation also works when np5np \geq 5 and n(1p)5n(1-p) \geq 5, but requires a continuity correction since we're approximating a discrete distribution with a continuous one.

Marking: M1 for identifying a suitable approximation (Poisson or Normal) with parameter. M1 for correct calculation method. A1 for answer 0.788–0.792.


Question 8 (4 marks)

(a) [2 marks]

Answer: Mean 55.5\approx 55.5

Working: Use midpoints of each class:

ClassMidpoint mmFrequency ffmfmfm2fm^2f
0x<200 \leq x < 2010660600
20x<4020 \leq x < 40301442012,600
40x<6040 \leq x < 6050221,10055,000
60x<8060 \leq x < 8070261,820127,400
80x10080 \leq x \leq 10090121,08097,200
Total804,480292,800

xˉ=mff=4,48080=56.0\bar{x} = \frac{\sum mf}{\sum f} = \frac{4{,}480}{80} = 56.0

Answer: Mean =56.0= 56.0

(b) [2 marks]

Answer: Standard deviation 22.2\approx 22.2

Working:

s2=m2f(mf)2nn1=292,800(4,480)28079s^2 = \frac{\sum m^2f - \frac{(\sum mf)^2}{n}}{n-1} = \frac{292{,}800 - \frac{(4{,}480)^2}{80}}{79}

=292,80020,070,4008079=292,800250,88079=41,92079530.63= \frac{292{,}800 - \frac{20{,}070{,}400}{80}}{79} = \frac{292{,}800 - 250{,}880}{79} = \frac{41{,}920}{79} \approx 530.63

s=530.6323.04s = \sqrt{530.63} \approx 23.04

Answer: Standard deviation 23.0\approx 23.0

Teaching notes: When data is grouped, we use the midpoint of each class as a representative value for all data points in that class. This introduces a small approximation error, but is the standard approach. The unbiased variance formula with n1n-1 applies here too.

Marking: (a) M1 for correct midpoints and mf\sum mf calculation. A1 for 56.0. (b) M1 for correct substitution into variance formula. A1 for 23.0 (accept 22.9–23.1).


Question 9 (4 marks)

(a) [2 marks]

Working: For a valid PDF, the total area under the curve must equal 1:

04kx(4x)dx=1\int_0^4 kx(4-x)\,dx = 1

k04(4xx2)dx=1k\int_0^4 (4x - x^2)\,dx = 1

k[2x2x33]04=1k\left[2x^2 - \frac{x^3}{3}\right]_0^4 = 1

k[(2(16)643)0]=1k\left[\left(2(16) - \frac{64}{3}\right) - 0\right] = 1

k[32643]=1k\left[32 - \frac{64}{3}\right] = 1

k[96643]=1k\left[\frac{96 - 64}{3}\right] = 1

k323=1    k=332(shown)k \cdot \frac{32}{3} = 1 \implies k = \frac{3}{32} \quad \text{(shown)}

(b) [2 marks]

Answer: E(X)=2\text{E}(X) = 2

Working:

E(X)=04x332x(4x)dx=33204x2(4x)dx\text{E}(X) = \int_0^4 x \cdot \frac{3}{32}x(4-x)\,dx = \frac{3}{32}\int_0^4 x^2(4-x)\,dx

=33204(4x2x3)dx=332[4x33x44]04= \frac{3}{32}\int_0^4 (4x^2 - x^3)\,dx = \frac{3}{32}\left[\frac{4x^3}{3} - \frac{x^4}{4}\right]_0^4

=332[4(64)32564]=332[256364]= \frac{3}{32}\left[\frac{4(64)}{3} - \frac{256}{4}\right] = \frac{3}{32}\left[\frac{256}{3} - 64\right]

=332[2561923]=332643=6432=2= \frac{3}{32}\left[\frac{256 - 192}{3}\right] = \frac{3}{32} \cdot \frac{64}{3} = \frac{64}{32} = 2

Teaching notes: For a continuous random variable with PDF f(x)f(x), the expected value is E(X)=xf(x)dx\text{E}(X) = \int_{-\infty}^{\infty} x \cdot f(x)\,dx. The normalisation condition f(x)dx=1\int f(x)\,dx = 1 is used to find unknown constants in the PDF.

Marking: (a) M1 for setting up the integral correctly. A1 for showing k=3/32k = 3/32 clearly. (b) M1 for correct integration setup. A1 for E(X)=2\text{E}(X) = 2.


Section B: Regression & Correlation


Question 10 (5 marks)

(a) [3 marks]

Answer: r0.975r \approx -0.975

Working:

Sxx=x2(x)2n=330(50)210=330250=80S_{xx} = \sum x^2 - \frac{(\sum x)^2}{n} = 330 - \frac{(50)^2}{10} = 330 - 250 = 80

Syy=y2(y)2n=52,200(720)210=52,20051,840=360S_{yy} = \sum y^2 - \frac{(\sum y)^2}{n} = 52{,}200 - \frac{(720)^2}{10} = 52{,}200 - 51{,}840 = 360

Sxy=xy(x)(y)n=3,480(50)(720)10=3,4803,600=120S_{xy} = \sum xy - \frac{(\sum x)(\sum y)}{n} = 3{,}480 - \frac{(50)(720)}{10} = 3{,}480 - 3{,}600 = -120

r=SxySxxSyy=12080×360=12028,800=120169.710.707r = \frac{S_{xy}}{\sqrt{S_{xx} \cdot S_{yy}}} = \frac{-120}{\sqrt{80 \times 360}} = \frac{-120}{\sqrt{28{,}800}} = \frac{-120}{169.71} \approx -0.707

Let me recheck: 28,800=14400×2=1202169.71\sqrt{28{,}800} = \sqrt{14400 \times 2} = 120\sqrt{2} \approx 169.71

r=120169.710.707r = \frac{-120}{169.71} \approx -0.707

Answer: r0.707r \approx -0.707

(b) [1 mark]

Interpretation: There is a moderately strong negative linear relationship between hours of weekly exercise and resting heart rate. As exercise hours increase, resting heart rate tends to decrease.

(c) [1 mark]

Comment: Since r0.707|r| \approx 0.707 is reasonably close to 1, a linear model is moderately appropriate for this data, though there is still some scatter around the line.

Teaching notes: The product moment correlation coefficient rr ranges from 1-1 to +1+1. Values close to 1-1 or +1+1 indicate strong linear association; values near 0 indicate weak or no linear association. Negative rr means the variables move in opposite directions. The summary statistics formulas SxxS_{xx}, SyyS_{yy}, SxyS_{xy} are used when raw data is not available.

Marking: (a) M1 for correct SxxS_{xx}, SyyS_{yy}, SxyS_{xy} values. M1 for correct substitution into rr formula. A1 for 0.707-0.707 (accept 0.71-0.71). (b) A1 for correct interpretation in context. (c) A1 for appropriate comment.


Question 11 (5 marks)

(a) [3 marks]

Answer: y=842.4xy = 84 - 2.4x

Working:

b=SxySxx=12080=1.5b = \frac{S_{xy}}{S_{xx}} = \frac{-120}{80} = -1.5

a=yˉbxˉ=72010(1.5)(5010)=72+1.5(5)=72+7.5=79.5a = \bar{y} - b\bar{x} = \frac{720}{10} - (-1.5)\left(\frac{50}{10}\right) = 72 + 1.5(5) = 72 + 7.5 = 79.5

Answer: y=79.51.5xy = 79.5 - 1.5x

(b) [1 mark]

Answer: 69.069.0 beats per minute

Working: When x=7x = 7:

y=79.51.5(7)=79.510.5=69.0y = 79.5 - 1.5(7) = 79.5 - 10.5 = 69.0

(c) [1 mark]

Explanation: x=20x = 20 is far outside the range of the sample data (which had x=50\sum x = 50 for 10 people, so xx values are roughly 0–10 hours). Extrapolation beyond the data range is unreliable because the linear relationship may not hold outside the observed range.

Teaching notes: The regression line of yy on xx minimises the sum of squared vertical distances from the data points to the line. The slope b=Sxy/Sxxb = S_{xy}/S_{xx} represents the change in yy for a one-unit increase in xx. Always be cautious about extrapolation — predictions are most reliable within the range of the observed data.

Marking: (a) M1 for correct bb value. M1 for correct aa value. A1 for correct equation. (b) A1 for 69.0 (or 69). (c) A1 for mentioning extrapolation or data range.


Question 12 (5 marks)

(a) [3 marks]

Answer: y=4.8+4.86xy = 4.8 + 4.86x (or y=4.8+4.9xy = 4.8 + 4.9x to 2 s.f.)

Working:

n=6,x=18.5,y=122,xy=426.5,x2=64.75n = 6,\quad \sum x = 18.5,\quad \sum y = 122,\quad \sum xy = 426.5,\quad \sum x^2 = 64.75

Sxx=64.75(18.5)26=64.75342.256=64.7557.042=7.708S_{xx} = 64.75 - \frac{(18.5)^2}{6} = 64.75 - \frac{342.25}{6} = 64.75 - 57.042 = 7.708

Sxy=426.5(18.5)(122)6=426.522576=426.5376.167=50.333S_{xy} = 426.5 - \frac{(18.5)(122)}{6} = 426.5 - \frac{2257}{6} = 426.5 - 376.167 = 50.333

b=SxySxx=50.3337.7086.53b = \frac{S_{xy}}{S_{xx}} = \frac{50.333}{7.708} \approx 6.53

a=yˉbxˉ=12266.53×18.56=20.3336.53×3.083=20.33320.133=0.20a = \bar{y} - b\bar{x} = \frac{122}{6} - 6.53 \times \frac{18.5}{6} = 20.333 - 6.53 \times 3.083 = 20.333 - 20.133 = 0.20

Answer: y=0.20+6.53xy = 0.20 + 6.53x

(b) [2 marks]

Working: The scatter diagram should show advertising expenditure on the horizontal axis and revenue on the vertical axis, with all 6 data points plotted and the regression line drawn.

<image_placeholder> id: Q12-fig1 type: graph linked_question: Q12 description: Scatter diagram with advertising expenditure (x-axis, range 0 to 6, in thousands of dollars) and revenue (y-axis, range 0 to 35, in thousands of dollars). Plot the six data points: (2.0, 15), (3.5, 22), (1.5, 12), (4.0, 25), (2.5, 18), (5.0, 30). Include the regression line y = 0.20 + 6.53x drawn through the data. labels: x-axis: Advertising expenditure (1000),yaxis:Revenue(1000), y-axis: Revenue (1000), title: Scatter diagram of advertising vs revenue values: Data points as listed above; regression line equation y = 0.20 + 6.53x must_show: All six plotted points, both axes with labels and scale, regression line, title </image_placeholder>

Teaching notes: When calculating regression from raw data, first compute the summary statistics (x\sum x, y\sum y, xy\sum xy, x2\sum x^2), then find SxxS_{xx} and SxyS_{xy}, then the slope and intercept. The scatter diagram should have clearly labelled axes with appropriate scales, all points plotted accurately, and the regression line drawn through the data.

Marking: (a) M1 for correct SxxS_{xx} and SxyS_{xy}. M1 for correct bb and aa. A1 for correct equation. (b) B1 for plotting points correctly. B1 for drawing the regression line.


Question 13 (5 marks)

(a) [2 marks]

Answer: λ=3\lambda = 3

Working: For XPo(λ)X \sim \text{Po}(\lambda):

P(X=0)=eλ=0.0498P(X = 0) = e^{-\lambda} = 0.0498

λ=ln(0.0498)=3.000    λ=3-\lambda = \ln(0.0498) = -3.000 \implies \lambda = 3

(b) [3 marks]

Answer: P(X3)=0.5768P(X \geq 3) = 0.5768

Working:

P(X3)=1P(X2)=1[P(X=0)+P(X=1)+P(X=2)]P(X \geq 3) = 1 - P(X \leq 2) = 1 - [P(X=0) + P(X=1) + P(X=2)]

P(X=0)=e3=0.0498P(X = 0) = e^{-3} = 0.0498

P(X=1)=3e3=3×0.0498=0.1494P(X = 1) = 3e^{-3} = 3 \times 0.0498 = 0.1494

P(X=2)=32e32!=9×0.04982=0.2240P(X = 2) = \frac{3^2 e^{-3}}{2!} = \frac{9 \times 0.0498}{2} = 0.2240

P(X2)=0.0498+0.1494+0.2240=0.4232P(X \leq 2) = 0.0498 + 0.1494 + 0.2240 = 0.4232

P(X3)=10.4232=0.5768P(X \geq 3) = 1 - 0.4232 = 0.5768

Answer: 0.5770.577 (to 3 s.f.)

Teaching notes: The Poisson distribution is used to model the number of events occurring in a fixed interval of time or space, when events occur independently at a constant average rate. The parameter λ\lambda represents both the mean and the variance. The formula is P(X=r)=eλλrr!P(X = r) = \frac{e^{-\lambda}\lambda^r}{r!}.

Marking: (a) M1 for setting eλ=0.0498e^{-\lambda} = 0.0498. A1 for λ=3\lambda = 3. (b) M1 for correct method (complementary probability). M1 for calculating individual probabilities. A1 for 0.577.


Section C: Applied Probability & Normal Distribution


Question 14 (4 marks)

(a) [2 marks]

Answer: 0.09180.0918

Working: XN(162,62)X \sim \text{N}(162,\, 6^2)

P(X>170)=P(Z>1701626)=P(Z>86)=P(Z>1.333)P(X > 170) = P\left(Z > \frac{170 - 162}{6}\right) = P\left(Z > \frac{8}{6}\right) = P(Z > 1.333) =1Φ(1.333)=10.9088=0.0912= 1 - \Phi(1.333) = 1 - 0.9088 = 0.0912

Answer: 0.09120.0912 (to 3 s.f.)

(b) [2 marks]

Answer: 168.2 cm168.2\text{ cm}

Working: We need hh such that P(X>h)=0.15P(X > h) = 0.15, so P(Xh)=0.85P(X \leq h) = 0.85.

From normal tables, Φ(z)=0.85\Phi(z) = 0.85 gives z1.036z \approx 1.036.

h1626=1.036    h=162+6(1.036)=162+6.216=168.22\frac{h - 162}{6} = 1.036 \implies h = 162 + 6(1.036) = 162 + 6.216 = 168.22

Answer: 168 cm168\text{ cm} (to 3 s.f.)

Teaching notes: Part (a) is a standard forward probability calculation. Part (b) is an inverse problem — we know the probability and need to find the value. This requires using the inverse normal function (or reading the zz-value from tables). The zz-score tells us how many standard deviations above or below the mean a value lies.

Marking: (a) M1 for standardising. A1 for 0.0912 (accept 0.091–0.092). (b) M1 for correct zz-value or inverse normal method. A1 for 168 (accept 168.2).


Question 15 (4 marks)

(a) [2 marks]

Answer: 122\dfrac{1}{22}

Working: Total balls = 12. Drawing 3 without replacement.

P(all 3 red)=(53)(123)=10220=122P(\text{all 3 red}) = \frac{\binom{5}{3}}{\binom{12}{3}} = \frac{10}{220} = \frac{1}{22}

(b) [2 marks]

Answer: 2944\dfrac{29}{44}

Working: "Exactly two balls are of the same colour" means one pair and one different colour.

Case 1: 2 red, 1 not red

(52)(71)(123)=10×7220=70220\frac{\binom{5}{2}\binom{7}{1}}{\binom{12}{3}} = \frac{10 \times 7}{220} = \frac{70}{220}

Case 2: 2 blue, 1 not blue

(42)(81)(123)=6×8220=48220\frac{\binom{4}{2}\binom{8}{1}}{\binom{12}{3}} = \frac{6 \times 8}{220} = \frac{48}{220}

Case 3: 2 green, 1 not green

(32)(91)(123)=3×9220=27220\frac{\binom{3}{2}\binom{9}{1}}{\binom{12}{3}} = \frac{3 \times 9}{220} = \frac{27}{220}

Total:

P(exactly 2 same colour)=70+48+27220=145220=2944P(\text{exactly 2 same colour}) = \frac{70 + 48 + 27}{220} = \frac{145}{220} = \frac{29}{44}

Teaching notes: When drawing without replacement, use combinations (not probabilities multiplied sequentially, though that also works). The phrase "exactly two of the same colour" means one pair and one singleton — it does NOT include the case where all three are the same colour.

Marking: (a) M1 for using combinations correctly. A1 for 122\frac{1}{22}. (b) M1 for considering cases or correct approach. A1 for 2944\frac{29}{44} (or 0.659).


Question 16 (4 marks)

(a) [2 marks]

Answer: 0.46800.4680

Working: XN(45,82)X \sim \text{N}(45,\, 8^2)

P(40<X<50)=P(40458<Z<50458)=P(0.625<Z<0.625)P(40 < X < 50) = P\left(\frac{40 - 45}{8} < Z < \frac{50 - 45}{8}\right) = P(-0.625 < Z < 0.625) =Φ(0.625)Φ(0.625)=2Φ(0.625)1=2(0.7340)1=0.4680= \Phi(0.625) - \Phi(-0.625) = 2\Phi(0.625) - 1 = 2(0.7340) - 1 = 0.4680

Answer: 0.4680.468 (to 3 s.f.)

(b) [2 marks]

Answer: Expected number 5\approx 5

Working: First find P(X>55)P(X > 55):

P(X>55)=P(Z>55458)=P(Z>1.25)=1Φ(1.25)=10.8944=0.1056P(X > 55) = P\left(Z > \frac{55 - 45}{8}\right) = P(Z > 1.25) = 1 - \Phi(1.25) = 1 - 0.8944 = 0.1056

Expected number = 50×0.1056=5.2850 \times 0.1056 = 5.28

Answer: 5.285.28 (or about 5 candidates)

Teaching notes: For part (b), the expected number in a binomial setting is n×pn \times p, where pp is the probability of the event for one individual. This uses the linearity of expectation.

Marking: (a) M1 for standardising both bounds. A1 for 0.468. (b) M1 for finding P(X>55)P(X > 55). A1 for 5.28 (accept 5).


Question 17 (4 marks)

(a) [1 mark]

Answer: Poisson distribution. The calls occur independently at a constant average rate in a fixed time interval.

(b) [2 marks]

Answer: 0.13220.1322

Working: XPo(3.5)X \sim \text{Po}(3.5)

P(X=5)=e3.5(3.5)55!=0.030197×525.2188120=15.8581200.1322P(X = 5) = \frac{e^{-3.5}(3.5)^5}{5!} = \frac{0.030197 \times 525.2188}{120} = \frac{15.858}{120} \approx 0.1322

(c) [1 mark]

Answer: 0.86410.8641

Working:

P(X2)=1P(X=0)P(X=1)=1e3.53.5e3.5P(X \geq 2) = 1 - P(X = 0) - P(X = 1) = 1 - e^{-3.5} - 3.5e^{-3.5} =10.0301970.10569=10.13589=0.8641= 1 - 0.030197 - 0.10569 = 1 - 0.13589 = 0.8641

Teaching notes: The Poisson distribution is appropriate for modelling counts of events in fixed intervals when events occur independently at a known constant rate. The parameter λ\lambda is the average number of events per interval.

Marking: (a) A1 for Poisson with valid reason. (b) M1 for correct Poisson formula. A1 for 0.132. (c) A1 for 0.864.


Question 18 (4 marks)

(a) [2 marks]

Answer: σ=10.2 g\sigma = 10.2\text{ g}

Working: XN(500,σ2)X \sim \text{N}(500,\, \sigma^2), and P(X<480)=0.025P(X < 480) = 0.025.

P(Z<480500σ)=0.025P\left(Z < \frac{480 - 500}{\sigma}\right) = 0.025

From normal tables, Φ(z)=0.025\Phi(z) = 0.025 gives z=1.96z = -1.96.

480500σ=1.96    20σ=1.96    σ=201.96=10.20\frac{480 - 500}{\sigma} = -1.96 \implies \frac{-20}{\sigma} = -1.96 \implies \sigma = \frac{20}{1.96} = 10.20

Answer: σ=10.2 g\sigma = 10.2\text{ g} (to 3 s.f.)

(b) [2 marks]

Answer: 0.77450.7745

Working: Using σ=10.20\sigma = 10.20:

P(490<X<515)=P(49050010.20<Z<51550010.20)P(490 < X < 515) = P\left(\frac{490 - 500}{10.20} < Z < \frac{515 - 500}{10.20}\right) =P(1010.20<Z<1510.20)=P(0.980<Z<1.471)= P\left(\frac{-10}{10.20} < Z < \frac{15}{10.20}\right) = P(-0.980 < Z < 1.471) =Φ(1.471)Φ(0.980)=0.92940.1635=0.7659= \Phi(1.471) - \Phi(-0.980) = 0.9294 - 0.1635 = 0.7659

Answer: 0.7660.766 (to 3 s.f.)

Teaching notes: Part (a) is an inverse normal problem — we use the known percentile to find the standard deviation. The zz-score corresponding to the 2.5th percentile is 1.96-1.96 (a standard value worth remembering). Part (b) then uses this σ\sigma for a standard probability calculation.

Marking: (a) M1 for setting up the inverse normal equation with z=1.96z = -1.96. A1 for σ=10.2\sigma = 10.2. (b) M1 for standardising with the found σ\sigma. A1 for 0.766 (accept 0.765–0.775).


End of Answer Key

Total: 70 marks

SectionMarks
A: Pure Statistics30
B: Regression & Correlation20
C: Applied Probability & Normal Distribution20
Total70