IB Questionbank

User interface language: English | Español

HL Paper 2

Charlotte decides to model the shape of a cupcake to calculate its volume.

From rotating a photograph of her cupcake she estimates that its cross-section passes through the points $(0, 3.5), (4, 6), (6.5, 4), (7, 3)$ and $(7.5, 0)$ , where all units are in centimetres. The cross-section is symmetrical in the $x$ -axis, as shown below:

She models the section from $(0, 3.5)$ to $(4, 6)$ as a straight line.

Charlotte models the section of the cupcake that passes through the points $(4, 6), (6.5, 4), (7, 3)$ and $(7.5, 0)$ with a quadratic curve.

Charlotte thinks that a quadratic with a maximum point at $(4, 6)$ and that passes through the point $(7.5, 0)$ would be a better fit.

Believing this to be a better model for her cupcake, Charlotte finds the volume of revolution about the $x$ -axis to estimate the volume of the cupcake.

Find the equation of the line passing through these two points.

[2]

Find the equation of the least squares regression quadratic curve for these four points.

[2]

b.i.

By considering the gradient of this curve when $x = 4$ , explain why it may not be a good model.

[1]

b.ii.

Find the equation of the new model.

[4]

Write down an expression for her estimate of the volume as a sum of two integrals.

[4]

d.i.

Find the value of Charlotte’s estimate.

[1]

d.ii.

Markscheme

$y = \frac{5}{8} x + \frac{7}{2} (y = 0.625 x + 3.5)$ A1A1

Note: Award A1 for $0.625 x$ , A1 for $3.5$ .
Award a maximum of A0A1 if not part of an equation.

[2 marks]

$y = - 0.975 x^{2} + 9.56 x - 16.7$ (M1)A1

$(y = - 0.974630 x^{2} + 9.55919 x - 16.6569 \dots)$

[2 marks]

b.i.

gradient of curve is positive at $x = 4$ R1

Note: Accept a sensible rationale that refers to the gradient.

[1 mark]

b.ii.

METHOD 1

let $y = a x^{2} + b x + c$

differentiating or using $x = \frac{- b}{2 a}$ (M1)

$8 a + b = 0$

substituting in the coordinates
$7 . 5^{2} a + 7.5 b + c = 0$ (A1)
$4^{2} a + 4 b + c = 6$ (A1)

solve to get
$y = - \frac{24}{49} x^{2} + \frac{192}{49} x - \frac{90}{49}$ OR $y = - 0.490 x^{2} + 3.92 x - 1.84$ A1

Note: Use of quadratic regression with points using the symmetry of the graph is a valid method.

METHOD 2

$y = a {(x - 4)}^{2} + 6$ (M1)

$0 = a {(7.5 - 4)}^{2} + 6$ (M1)

$a = - \frac{24}{49}$ (A1)

$y = - \frac{24}{49} {(x - 4)}^{2} + 6$ OR $y = - 0.490 {(x - 4)}^{2} + 6$ A1

[4 marks]

$π \int_{0}^{4} {(\frac{5}{8} x + 3.5)}^{2} d x + π \int_{4}^{7.5} {(- \frac{24}{49} {(x - 4)}^{2} + 6)}^{2} d x$ (M1)(M1) (M1)A1

Note: Award (M1)(M1)(M1)A0 if $π$ is omitted but response is otherwise correct. Award (M1) for an integral that indicates volume, (M1) for their part (a) within their volume integral, (M1) for their part (b)(i) within their volume integral, A1 for their correct two integrals with all correct limits.

[4 marks]

d.i.

$501 {cm}^{3} (501.189 \dots)$ A1

[1 mark]

d.ii.

Examiners report

[N/A]

b.i.

[N/A]

b.ii.

[N/A]

d.i.

[N/A]

d.ii.

It is known that the weights of male Persian cats are normally distributed with mean $6.1 kg$ and variance $0 . 5^{2} {kg}^{2}$ .

A group of $80$ male Persian cats are drawn from this population.

The male cats are now joined by $80$ female Persian cats. The female cats are drawn from a population whose weights are normally distributed with mean $4.5 kg$ and standard deviation $0.45 kg$ .

Ten female cats are chosen at random.

Sketch a diagram showing the above information.

[2]

Find the proportion of male Persian cats weighing between $5.5 kg$ and $6.5 kg$ .

[2]

Determine the expected number of cats in this group that have a weight of less than $5.3 kg$ .

[3]

Find the probability that exactly one of them weighs over $4.62 kg$ .

[4]

d.i.

Let $N$ be the number of cats weighing over $4.62 kg$ .

Find the variance of $N$ .

[1]

d.ii.

A cat is selected at random from all $160$ cats.

Find the probability that the cat was female, given that its weight was over $4.7 kg$ .

[4]

Markscheme

A1A1

Note: Award A1 for a normal curve with mean labelled $6.1$ or $μ$ , A1 for indication of SD $(0.5) :$ marks on horizontal axis at $5.6$ and/or $6.6$ OR $μ - 0.5$ and/or $μ + 0.5$ on the correct side and approximately correct position.

[2 marks]

$X ~ N (6.1, 0 . 5^{2})$

$P (5.5 < X < 6.5)$ OR labelled sketch of region (M1)

$= 0.673 (0.673074 \dots)$ A1

[2 marks]

$(P (X < 5.3) =) 0.0547992 \dots$ (A1)

$0.0547992 \dots \times 80$ (M1)

$= 4.38 (4.38393 \dots)$ A1

[3 marks]

$Y ~ N (4.5, 0 . 45^{2})$ ,

$(P (Y > 4.62) =) 0.394862 \dots$ (A1)

use of binomial seen or implied (M1)

using $B (10, 0.394862 \dots)$ (M1)

$0.0430 (0.0429664 \dots)$ A1

[4 marks]

d.i.

$n p (1 - p) = 2.39 (2.38946 \dots)$ A1

[1 mark]

d.ii.

$P (F \cap (W > 4.7)) = 0.5 \times 0.3284 (= 0.1642)$ (A1)

attempt use of tree diagram OR use of $P (F W > 4.7) = \frac{P (F \cap (W > 4.7))}{P (W > 4.7)}$ (M1)

$\frac{0.5 \times 0.3284}{0.5 \times 0.9974 + 0.5 \times 0.3284}$ (A1)

$= 0.248 (0.247669 \dots)$ A1

[4 marks]

Examiners report

[N/A]

d.i.

[N/A]

d.ii.

[N/A]

A city has two cable companies, X and Y. Each year 20 % of the customers using company X move to company Y and 10 % of the customers using company Y move to company X. All additional losses and gains of customers by the companies may be ignored.

Initially company X and company Y both have 1200 customers.

Write down a transition matrix T representing the movements between the two companies in a particular year.

[2]

Find the eigenvalues and corresponding eigenvectors of T.

[4]

Hence write down matrices P and D such that T = PDP⁻¹.

[2]

Find an expression for the number of customers company X has after $n$ years, where $n \in \mathbb{N}$ .

[5]

Hence write down the number of customers that company X can expect to have in the long term.

[1]

Markscheme

$\left( {\begin{array}{*{20}{c}} {0.8}&{0.1} \\ {0.2}&{0.9} \end{array}} \right)$ M1A1

[2 marks]

$\left| {\begin{array}{*{20}{c}} {0.8 - \lambda }&{0.1} \\ {0.2}&{0.9 - \lambda } \end{array}} \right| = 0$ M1

$\lambda = 1$ and 0.7 A1

eigenvectors $\left( {\begin{array}{*{20}{c}} 1 \\ 2 \end{array}} \right)$ and $\left( {\begin{array}{*{20}{c}} 1 \\ { - 1} \end{array}} \right)$ (M1)A1

Note: Accept any scalar multiple of the eigenvectors.

[4 marks]

EITHER

P = $\left( {\begin{array}{*{20}{c}} 1&1 \\ 2&{ - 1} \end{array}} \right)$ D = $\left( {\begin{array}{*{20}{c}} 1&0 \\ 0&{0.7} \end{array}} \right)$ A1A1

P = $\left( {\begin{array}{*{20}{c}} 1&1 \\ { - 1}&2 \end{array}} \right)$ D = $\left( {\begin{array}{*{20}{c}} {0.7}&0 \\ 0&1 \end{array}} \right)$ A1A1

[2 marks]

P⁻¹ = $\frac{1}{3}\left( {\begin{array}{*{20}{c}} 1&1 \\ 2&{ - 1} \end{array}} \right)$ A1

$\frac{1}{3}\left( {\begin{array}{*{20}{c}} 1&1 \\ 2&{ - 1} \end{array}} \right)\left( {\begin{array}{*{20}{c}} 1&0 \\ 0&{{{0.7}^n}} \end{array}} \right)\left( {\begin{array}{*{20}{c}} 1&1 \\ 2&{ - 1} \end{array}} \right)\left( {\begin{array}{*{20}{c}} {1200} \\ {1200} \end{array}} \right)$ M1A1

attempt to multiply matrices M1

so in company A, after $n$ years, $400\left( {2 + {{0.7}^n}} \right)$ A1

[5 marks]

400 × 2 = 800 A1

[1 mark]

Examiners report

[N/A]

In a small village there are two doctors’ clinics, one owned by Doctor Black and the other owned by Doctor Green. It was noted after each year that $3.5 %$ of Doctor Black’s patients moved to Doctor Green’s clinic and $5 %$ of Doctor Green’s patients moved to Doctor Black’s clinic. All additional losses and gains of patients by the clinics may be ignored.

At the start of a particular year, it was noted that Doctor Black had $2100$ patients on their register, compared to Doctor Green’s $3500$ patients.

Write down a transition matrix $T$ indicating the annual population movement between clinics.

[2]

Find a prediction for the ratio of the number of patients Doctor Black will have, compared to Doctor Green, after two years.

[2]

Find a matrix $P$ , with integer elements, such that $T = P D P^{- 1}$ , where $D$ is a diagonal matrix.

[6]

Hence, show that the long-term transition matrix $T^{\infty}$ is given by $T^{\infty} = (\begin{matrix} \frac{10}{17} & \frac{10}{17} \\ \frac{7}{17} & \frac{7}{17} \end{matrix})$ .

[6]

Hence, or otherwise, determine the expected ratio of the number of patients Doctor Black would have compared to Doctor Green in the long term.

[2]

Markscheme

$T = (\begin{matrix} 0.965 & 0.05 \\ 0.035 & 0.95 \end{matrix})$ M1A1

Note: Award M1A1 for $T = (\begin{matrix} 0.95 & 0.035 \\ 0.05 & 0.965 \end{matrix})$ .
Award the A1 for a transposed $T$ if used correctly in part (b) i.e. preceded by $1 \times 2$ matrix $(2100 3500)$ rather than followed by a $2 \times 1$ matrix.

[2 marks]

${(\begin{matrix} 0.965 & 0.05 \\ 0.035 & 0.95 \end{matrix})}^{2} (\begin{matrix} 2100 \\ 3500 \end{matrix})$ (M1)

$= (\begin{matrix} 2294 \\ 3306 \end{matrix})$

so ratio is $2294 : 3306 (= 1147 : 1653, 0.693889 \dots)$ A1

[2 marks]

to solve $A x = λ x :$

$|\begin{matrix} 0.965 - λ & 0.05 \\ 0.035 & 0.95 - λ \end{matrix}| = 0$ (M1)

$(0.965 - λ) (0.95 - λ) - 0.05 \times 0.035 = 0$

$λ = 0.915$ OR $λ = 1$ (A1)

attempt to find eigenvectors for at least one eigenvalue (M1)

when $λ = 0.915, x = (\begin{matrix} 1 \\ - 1 \end{matrix})$ (or any real multiple) (A1)

when $λ = 1, x = (\begin{matrix} 10 \\ 7 \end{matrix})$ (or any real multiple) (A1)

therefore $P = (\begin{matrix} 1 & 10 \\ - 1 & 7 \end{matrix})$ (accept integer valued multiples of their eigenvectors and columns in either order) A1

[6 marks]

$P^{- 1} = {(\begin{matrix} 1 & 10 \\ - 1 & 7 \end{matrix})}^{- 1} = \frac{1}{17} (\begin{matrix} 7 & - 10 \\ 1 & 1 \end{matrix})$ (A1)

Note: This mark is independent, and may be seen anywhere in part (d).

$D = (\begin{matrix} 0.915 & 0 \\ 0 & 1 \end{matrix})$ (A1)

$T^{n} = P D^{n} P^{- 1} = (\begin{matrix} 1 & 10 \\ - 1 & 7 \end{matrix}) (\begin{matrix} 0 . 915^{n} & 0 \\ 0 & 1^{n} \end{matrix}) \frac{1}{17} (\begin{matrix} 7 & - 10 \\ 1 & 1 \end{matrix})$ (M1)A1

Note: Award (M1)A0 for finding $P^{- 1} D^{n} P$ correctly.

as $n \to \infty, D^{n} = (\begin{matrix} 0 . 915^{n} & 0 \\ 0 & 1^{n} \end{matrix}) \to (\begin{matrix} 0 & 0 \\ 0 & 1 \end{matrix})$ R1

so $T^{n} \to \frac{1}{17} (\begin{matrix} 1 & 10 \\ - 1 & 7 \end{matrix}) (\begin{matrix} 0 & 0 \\ 0 & 1 \end{matrix}) (\begin{matrix} 7 & - 10 \\ 1 & 1 \end{matrix})$ A1

$= (\begin{matrix} \frac{10}{17} & \frac{10}{17} \\ \frac{7}{17} & \frac{7}{17} \end{matrix})$ AG

Note: The AG line must be seen for the final A1 to be awarded.

[6 marks]

METHOD ONE

$(\begin{matrix} \frac{10}{17} & \frac{10}{17} \\ \frac{7}{17} & \frac{7}{17} \end{matrix}) (\begin{matrix} 2100 \\ 3500 \end{matrix}) = (\begin{matrix} 3294 \\ 2306 \end{matrix})$ (M1)

so ratio is $3294 : 2306 (1647 : 1153, 1.42844 \dots, 0.700060 \dots)$ A1

METHOD TWO

long term ratio is the eigenvector associated with the largest eigenvalue (M1)

$10 : 7$ A1

[2 marks]

Examiners report

[N/A]

The masses in kilograms of melons produced by a farm can be modelled by a normal distribution with a mean of $2.6 kg$ and a standard deviation of $0.5 kg$ .

Find the probability that two melons picked at random and independently of each other will

One year due to favourable weather conditions it is thought that the mean mass of the melons has increased.

The owner of the farm decides to take a random sample of $16$ melons to test this hypothesis at the $5 %$ significance level, assuming the standard deviation of the masses of the melons has not changed.

Unknown to the farmer the favourable weather conditions have led to all the melons having $10 %$ greater mass than the model described above.

Find the probability that a melon selected at random will have a mass greater than $3.0 kg$ .

[2]

both have a mass greater than $3.0 kg$ .

[2]

b.i.

have a total mass greater than $6.0 kg$ .

[2]

b.ii.

Write down the null and alternative hypotheses for the test.

[1]

Find the critical region for this test.

[4]

Find the mean and standard deviation of the mass of the melons for this year.

[3]

Find the probability of a Type II error in the owner’s test.

[2]

Markscheme

* This sample question was produced by experienced DP mathematics senior examiners to aid teachers in preparing for external assessment in the new MAA course. There may be minor differences in formatting compared to formal exam papers.

Let $X$ represent the mass of a melon

$P (X > 3.0) = 0.212 (0.2118 \dots)$ (M1)A1

[2 marks]

$0.2118 \dots \times 0.2118 \dots$ (M1)

$= 0.0449 (0.04488 \dots)$ A1

[2 marks]

b.i.

Let $T$ represent the total mass

$E (T) = 5.2$ A1

$Var (T) = 0 . 5^{2} + 0 . 5^{2} = 0.5$ (M1)A1

$T ~ N (5.2, 0.5)$

$P (T > 6.0) = 0.129 (0.1289 \dots)$ A1

[4 marks]

b.ii.

Let $μ$ be the mean mass of the melons produced by the farm.

$H_{0} : μ = 2.6 kg$ , $H_{1} : μ > 2.6 kg$ only A1

Note: Accept $H_{0} :$ The mean mass of melons produced by the farm is equal to $2.6 kg$
$H_{1} :$ The mean mass of melons produced by the farm is greater than $2.6 kg$

Note: Award A0 if $2.6 kg$ does not appear in the hypothesis.

[1 mark]

Under $H_{0}$ $\bar{X} ~ N (2.6, \frac{0 . 5^{2}}{16})$ A1

$P (\bar{X} > a) = 0.05$ (M1)

$a = 2.81 (2.805606 \dots)$ (A1)

Critical region is $\bar{X} > 2.81$ A1

[4 marks]

Let $W$ represent the new mass of the melons

$E (W) = 1.1 \times 2.6 = 2.86$ A1

Standard deviation of $W = 1.1 \times 0.5$ (M1)

$= 0.55$ A1

Note: award M1A0 for $Var (W) = 1 . 1^{2} \times 0 . 5^{2} = 0.3025$

[3 marks]

$P (Type II error) =$

$P (\bar{X} < 2.81 μ = 2.86, σ = \frac{0.55}{4})$ (M1)

$= 0.346 (0.346204 \dots)$ A1

Note: Accept $0.358$ from use of the three‐figure answer to part (d)

[2 marks]

Examiners report

[N/A]

b.i.

[N/A]

b.ii.

[N/A]

The random variable X is thought to follow a binomial distribution B (4, $p$ ). In order to investigate this belief, a random sample of 100 observations on X was taken with the following results.

An automatic machine is used to fill bottles of water. The amount delivered, $Y$ ml, may be assumed to be normally distributed with mean $\mu$ ml and standard deviation 8 ml. Initially, the machine is adjusted so that the value of $\mu$ is 500. In order to check that the value of $\mu$ remains equal to 500, a random sample of 10 bottles is selected at regular intervals, and the mean amount of water, ${\bar y}$ , in these bottles is calculated. The following hypotheses are set up.

H₀: $\mu$ = 500; H₁: $\mu$ ≠ 500

The critical region is defined to be $\left( {\bar y < 495} \right) \cup \left( {\bar y > 505} \right)$ .

State suitable hypotheses for testing this belief.

[1]

a.i.

Calculate the mean of these data and hence estimate the value of $p$ .

[5]

a.ii.

Calculate an appropriate value of ${\chi ^2}$ and state your conclusion, using a 1% significance level.

[13]

a.iii.

Find the significance level of this procedure.

[5]

b.i.

Some time later, the actual value of $\mu$ is 503. Find the probability of a Type II error.

[3]

b.ii.

Markscheme

H₀: The data are B (4, $p$ ); H₁ : The data are not B (4, $p$ ) A1

[1 mark]

a.i.

Mean $= \frac{{1 \times 32 + \ldots + 4 \times 14}}{{100}}$ M1A1

= 1.8 A1

$4\widehat p = 1.8 \Rightarrow \widehat p = 0.45$ M1A1

[5 marks]

a.ii.

The expected frequencies are

A1A1A1A1A1

The last two classes must be combined because the expected frequency for $x$ = 4 is less than 5. R1

${\chi ^2} = \frac{{{{17}^2}}}{{9.15}} + \frac{{{{32}^2}}}{{29.95}} + \frac{{{{19}^2}}}{{36.75}} + \frac{{{{32}^2}}}{{24.15}} - 100$ M2

= 18.0 A2

DF = 2 (A1)

Critical value = 9.21 A1

We conclude, at the 1% significance level, that X does not fit a binomial model. R1

Special case: award the following marks to candidates who do not combine classes.

${\chi ^2} = \frac{{{{17}^2}}}{{9.15}} + \frac{{{{32}^2}}}{{29.95}} + \frac{{{{19}^2}}}{{36.75}} + \frac{{{{18}^2}}}{{20.05}} + \frac{{{{14}^2}}}{{4.1}} - 100$ M2

= 39.6 A0

DF = 3 (A1)

Critical value = 11.345 A1

We conclude, at the 1% significance level, that X does not fit a binomial model. R1

[13 marks]

a.iii.

Under H₀, the distribution of ${\bar y}$ is N (500, 6.4). (A1)

Significance level = P ${\bar y}$ < 495 or > 505 | H₀ M2

= 2 × 0.02405 (A1)

= 0.0481 A1 N5

[5 marks]

b.i.

The distribution of ${\bar y}$ is now N (503, 6.4). (A1)

P(Type ΙΙ error) = P(495 < ${\bar y}$ < 505) (M1)

= 0.785 A1 N3

[3 marks]

b.ii.

Examiners report

[N/A]

a.i.

[N/A]

a.ii.

[N/A]

a.iii.

[N/A]

b.i.

[N/A]

b.ii.

A student investigating the relationship between chemical reactions and temperature finds the Arrhenius equation on the internet.

$k = A e^{- \frac{c}{T}}$

This equation links a variable $k$ with the temperature $T$ , where $A$ and $c$ are positive constants and $T > 0$ .

The Arrhenius equation predicts that the graph of $\ln k$ against $\frac{1}{T}$ is a straight line.

Write down

The following data are found for a particular reaction, where $T$ is measured in Kelvin and $k$ is measured in ${cm}^{3} {mol}^{- 1} s^{- 1}$ :

Find an estimate of

Show that $\frac{d k}{d T}$ is always positive.

[3]

Given that $\lim_{T \to \infty} k = A$ and $\lim_{T \to 0} k = 0$ , sketch the graph of $k$ against $T$ .

[3]

(i) the gradient of this line in terms of $c$ ;

(ii) the $y$ -intercept of this line in terms of $A$ .

[4]

Find the equation of the regression line for $\ln k$ on $\frac{1}{T}$ .

[2]

$c$ .

It is not required to state units for this value.

[1]

e.i.

$A$ .

It is not required to state units for this value.

[2]

e.ii.

Markscheme

attempt to use chain rule, including the differentiation of $\frac{1}{T}$ (M1)

$\frac{d k}{d T} = A \times \frac{c}{T^{2}} \times e^{- \frac{c}{T}}$ A1

this is the product of positive quantities so must be positive R1

Note: The R1 may be awarded for correct argument from their derivative. R1 is not possible if their derivative is not always positive.

[3 marks]

A1A1A1

Note: Award A1 for an increasing graph, entirely in first quadrant, becoming concave down for larger values of $T$ , A1 for tending towards the origin and A1 for asymptote labelled at $k = A$ .

[3 marks]

taking $\ln$ of both sides OR substituting $y = \ln x$ and $x = \frac{1}{T}$ (M1)

$\ln k = \ln A - \frac{c}{T}$ OR $y = - c x + \ln A$ (A1)

(i) so gradient is $- c$ A1

(ii) $y$ -intercept is $\ln A$ A1

Note: The implied (M1) and (A1) can only be awarded if both correct answers are seen. Award zero if only one value is correct and no working is seen.

[4 marks]

an attempt to convert data to $\frac{1}{T}$ and $\ln k$ (M1)

e.g. at least one correct row in the following table

line is $\ln k = - 13400 \times \frac{1}{T} + 15.0 (= - 13383.1 \dots \times \frac{1}{T} + 15.0107 \dots)$ A1

[2 marks]

$c = 13400 (13383.1 \dots)$ A1

[1 mark]

e.i.

attempt to rearrange or solve graphically $\ln A = 15.0107 \dots$ (M1)

$A = 3 300 000 (3 304 258 \dots)$ A1

Note: Accept an $A$ value of $3269017$ … from use of $3 sf$ value.

[2 marks]

e.ii.

Examiners report

This question caused significant difficulties for many candidates and many did not even attempt the question. Very few candidates were able to differentiate the expression in part (a) resulting in difficulties for part (b). Responses to parts (c) to (e) illustrated a lack of understanding of linearizing a set of data. Those candidates that were able to do part (d) frequently lost a mark as their answer was given in x and y.

[N/A]

e.i.

[N/A]

e.ii.

Willow finds that she receives approximately 70 emails per working day.

She decides to model the number of emails received per working day using the random variable $X$ , where $X$ follows a Poisson distribution with mean 70.

In order to test her model, Willow records the number of emails she receives per working day over a period of 6 months. The results are shown in the following table.

From the table, calculate

Archie works for a different company and knows that he receives emails according to a Poisson distribution, with a mean of $\lambda$ emails per day.

Using this distribution model, find ${\text{P}}\left( {X < 60} \right)$ .

[2]

a.i.

Using this distribution model, find the standard deviation of $X$ .

[2]

a.ii.

an estimate for the mean number of emails received per working day.

[3]

b.i.

an estimate for the standard deviation of the number of emails received per working day.

[2]

b.ii.

Give one piece of evidence that suggests Willow’s Poisson distribution model is not a good fit.

[1]

Suppose that the probability of Archie receiving more than 10 emails in total on any one day is 0.99. Find the value of λ.

[3]

Now suppose that Archie received exactly 20 emails in total in a consecutive two day period. Show that the probability that he received exactly 10 of them on the first day is independent of λ.

[5]

Markscheme

${\text{P}}\left( {X < 60} \right)$

$= {\text{P}}\left( {X \leqslant 59} \right)$ (M1)

= 0.102 A1

[2 marks]

a.i.

standard deviation = $\sqrt {70}$ (= 8.37) (M1)A1

[2 marks]

a.ii.

use of midpoints (accept consistent use of 45, 55 etc.) (M1)

$\frac{{44.5 \times 2 + 54.5 \times 15 + 64.5 \times 40 + 74.5 \times 53 + 94.5 + 104.5 \times 3 + 114.5 \times 6}}{{2 + 15 + 40 + 53 + 0 + 1 + 3 + 6}}$ (M1)

$= \frac{{8530}}{{120}}\left( { = 71.1} \right)$ A1

Note: If 45, 55, etc. are used consistently instead of midpoints (implied by the answer 71.58…) award M1M1A0.

[3 marks]

b.i.

13.9 (M1)A1

[2 marks]

b.ii.

valid reason given to include the examples below R1

variance is 192 which is not close to the mean (accept not equal to) standard deviation too high (using parts (a)(ii) and (b)(ii))

relative frequency of $X$ ≤ 59 is 0.142 which is too high (using part (a)(i))

Poisson would give a frequency of roughly 14 for 80 ≤ $X$ ≤ 89

Note: Reasons which do not use values found in previous parts must be backed up with numerical evidence.

[1 mark]

${\text{P}}\left( {Y > 10} \right) = 0.99$

$1 - {\text{P}}\left( {Y \leqslant 10} \right) = 0.99 \Rightarrow {\text{P}}\left( {Y \leqslant 10} \right) = 0.01$ (M1)

attempt to solve a correct equation (M1)

λ = 20.1 A1

[3 marks]

in 1 day, no of emails is X ~ Po(λ)

in 2 days, no of emails is Y ~ Po(2λ) (A1)

P(10 on first day | 20 in 2 days) (M1)

$\frac{{{\text{P}}\left( {X = 10} \right) \times {\text{P}}\left( {X = 10} \right)}}{{{\text{P}}\left( {Y = 20} \right)}}$ (M1)

$= \frac{{{{\left( {\frac{{{\lambda ^{10}}{e^{ - \lambda }}}}{{10{\text{!}}}}} \right)}^2}}}{{\frac{{{{\left( {2\lambda } \right)}^{20}}{e^{ - 2\lambda }}}}{{20{\text{!}}}}}}$ A1

$= \frac{{{\lambda ^{20}}{e^{ - 2\lambda }}}}{{{2^{20}}{\lambda ^{20}}{e^{ - 2\lambda }}}} \times \frac{{20{\text{!}}}}{{{{\left( {10{\text{!}}} \right)}^2}}}$ A1

$= \frac{{20{\text{!}}}}{{{2^{20}}{{\left( {10{\text{!}}} \right)}^2}}}$

which is independent of λ AG

[5 marks]

Examiners report

[N/A]

a.i.

[N/A]

a.ii.

[N/A]

b.i.

[N/A]

b.ii.

[N/A]

Long term experience shows that if it is sunny on a particular day in Vokram, then the probability that it will be sunny the following day is $0.8$ . If it is not sunny, then the probability that it will be sunny the following day is $0.3$ .

The transition matrix $T$ is used to model this information, where $T = (\begin{matrix} 0.8 & 0.3 \\ 0.2 & 0.7 \end{matrix})$ .

The matrix $T$ can be written as a product of three matrices, $P D P^{- 1}$ , where $D$ is a diagonal matrix.

It is sunny today. Find the probability that it will be sunny in three days’ time.

[2]

Find the eigenvalues and eigenvectors of $T$ .

[5]

Write down the matrix $P$ .

[1]

c.i.

Write down the matrix $D$ .

[1]

c.ii.

Hence find the long-term percentage of sunny days in Vokram.

[4]

Markscheme

finding $T^{3}$ OR use of tree diagram (M1)

$T^{3} = (\begin{matrix} 0.65 & 0.525 \\ 0.35 & 0.475 \end{matrix})$

the probability of sunny in three days’ time is $0.65$ A1

[2 marks]

attempt to find eigenvalues (M1)

Note: Any indication that $det (T - λ I) = 0$ has been used is sufficient for the (M1).

$|\begin{matrix} 0.8 - λ & 0.3 \\ 0.2 & 0.7 - λ \end{matrix}| = (0.8 - λ) (0.7 - λ) - 0.06 = 0$

$(λ^{2} - 1.5 λ + 0.5 = 0)$

$λ = 1, λ = 0.5$ A1

attempt to find either eigenvector (M1)

$0.8 x + 0.3 y = x \Rightarrow - 0.2 x + 0.3 y = 0$ so an eigenvector is $(\begin{matrix} 3 \\ 2 \end{matrix})$ A1

$0.8 x + 0.3 y = 0.5 x \Rightarrow 0.3 x + 0.3 y = 0$ so an eigenvector is $(\begin{matrix} 1 \\ - 1 \end{matrix})$ A1

Note: Accept multiples of the stated eigenvectors.

[5 marks]

$P = (\begin{matrix} 3 & 1 \\ 2 & - 1 \end{matrix})$ OR $P = (\begin{matrix} 1 & 3 \\ - 1 & 2 \end{matrix})$ A1

Note: Examiners should be aware that different, correct, matrices $P$ may be seen.

[1 mark]

c.i.

$D = (\begin{matrix} 1 & 0 \\ 0 & 0.5 \end{matrix})$ OR $D = (\begin{matrix} 0.5 & 0 \\ 0 & 1 \end{matrix})$ A1

Note: $P$ and $D$ must be consistent with each other.

[1 mark]

c.ii.

$0 . 5^{n} \to 0$ (M1)

$D^{n} = (\begin{matrix} 1 & 0 \\ 0 & 0 \end{matrix})$ OR $D^{n} = (\begin{matrix} 0 & 0 \\ 0 & 1 \end{matrix})$ (A1)

Note: Award A1 only if their $D^{n}$ corresponds to their $P$

$P D^{n} P^{- 1} = (\begin{matrix} 0.6 & 0.6 \\ 0.4 & 0.4 \end{matrix})$ (M1)

$60 %$ A1

[4 marks]

Examiners report

[N/A]

c.i.

[N/A]

c.ii.

[N/A]

A continuous random variable $X$ has probability density function $f$ given by

$f(x) = \left\{ {\begin{array}{*{20}{l}} {\frac{{{x^2}}}{a} + b,}&{0 \leqslant x \leqslant 4} \\ 0&{{\text{otherwise}}} \end{array}} \right.{\text{where }}a{\text{ and }}b{\text{ are positive constants.}}$

It is given that ${\text{P}}(X \geqslant 2) = 0.75$ .

Eight independent observations of $X$ are now taken and the random variable $Y$ is the number of observations such that $X \geqslant 2$ .

Show that $a = 32$ and $b = \frac{1}{{12}}$ .

[5]

Find ${\text{E}}(X)$ .

[2]

Find ${\text{Var}}(X)$ .

[2]

Find the median of $X$ .

[3]

Find ${\text{E}}(Y)$ .

[2]

Find ${\text{P}}(Y \geqslant 3)$ .

[1]

Markscheme

* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.

$\int\limits_0^4 {\left( {\frac{{{x^2}}}{a} + b} \right){\text{d}}x = 1 \Rightarrow \left[ {\frac{{{x^3}}}{{3a}} + bx} \right]_0^4 = 1 \Rightarrow \frac{{64}}{{3a}} + 4b = 1}$ M1A1

$\int\limits_2^4 {\left( {\frac{{{x^2}}}{a} + b} \right){\text{d}}x = 0.75 \Rightarrow \frac{{56}}{{3a}} + 2b = 0.75}$ M1A1

Note: $\int\limits_0^2 {\left( {\frac{{{x^2}}}{a} + b} \right)} \,dx = 0.25 \Rightarrow \frac{8}{{3a}} + 2b = 0.25$ could be seen/used in place of either of the above equations.

evidence of an attempt to solve simultaneously (or check given a,b values are consistent) M1

$a = 32,{\text{ }}b = \frac{1}{{12}}$ AG

[5 marks]

${\text{E}}\left( X \right) = \int\limits_0^4 {x\left( {\frac{{{x^2}}}{{32}} + \frac{1}{{12}}} \right){\text{d}}x}$ (M1)

${\text{E}}(X) = \frac{8}{3}\,\,\,( = 2.67)$ A1

[2 marks]

${\text{E}}\left( {{X^2}} \right) = \int\limits_0^4 {{x^2}\left( {\frac{{{x^2}}}{{32}} + \frac{1}{{12}}} \right){\text{d}}x}$ (M1)

${\text{Var}}(X) = {\text{E}}({X^2}) - {[{\text{E}}(X)]^2} = \frac{{16}}{{15}}\,\,\,( = 1.07)$ A1

[2 marks]

$\int\limits_0^m {\left( {\frac{{{x^2}}}{{32}} + \frac{1}{{12}}} \right){\text{d}}x = 0.5}$ (M1)

$\frac{{{m^3}}}{{96}} + \frac{m}{{12}} = 0.5\,\,\,( \Rightarrow {m^3} + 8m - 48 = 0)$ (A1)

$m = 2.91$ A1

[3 marks]

$Y \sim B(8,{\text{ }}0.75)$ (M1)

${\text{E}}(Y) = 8 \times 0.75 = 6$ A1

[2 marks]

${\text{P}}(Y \geqslant 3) = 0.996$ A1

[1 mark]

Examiners report

[N/A]

Arianne plays a game of darts.

The distance that her darts land from the centre, $O$ , of the board can be modelled by a normal distribution with mean $10 cm$ and standard deviation $3 cm$ .

Find the probability that

Each of Arianne’s throws is independent of her previous throws.

In a competition a player has three darts to throw on each turn. A point is scored if a player throws all three darts to land within a central area around $O$ . When Arianne throws a dart the probability that it lands within this area is $0.8143$ .

In the competition Arianne has ten turns, each with three darts.

a dart lands less than $13 cm$ from $O$ .

[2]

a.i.

a dart lands more than $15 cm$ from $O$ .

[1]

a.ii.

Find the probability that Arianne throws two consecutive darts that land more than $15 cm$ from $O$ .

[2]

Find the probability that Arianne does not score a point on a turn of three darts.

[2]

Find Arianne’s expected score in the competition.

[4]

d.i.

Find the probability that Arianne scores at least $5$ points in the competition.

[1]

d.ii.

Find the probability that Arianne scores at least $5$ points and less than $8$ points.

[2]

d.iii.

Given that Arianne scores at least $5$ points, find the probability that Arianne scores less than $8$ points.

[2]

d.iv.

Markscheme

Let $X$ be the random variable “distance from $O$ ”.

$X ~ N (10, 3^{2})$

$P (X < 13) = 0.841 (0.841344 \dots)$ (M1)(A1)

[2 marks]

a.i.

$(P (X > 15) =) 0.0478 (0.0477903)$ A1

[1 mark]

a.ii.

$P (X > 15) \times P (X > 15)$ (M1)

$= 0.00228 (0.00228391 \dots)$ A1

[2 marks]

$1 - {(0.8143)}^{3}$ (M1)

$0.460 (0.460050 \dots)$ A1

[2 marks]

let $Y$ be the random variable “number of points scored”

evidence of use of binomial distribution (M1)

$Y ~ B (10, 0.539949 \dots)$ (A1)

$(E (Y) =) 10 \times 0.539949 \dots$ (M1)

$= 5.40$ A1

[4 marks]

d.i.

$(P (Y \geq 5) =) 0.717 (0.716650 \dots)$ A1

[1 mark]

d.ii.

$P (5 \leq Y < 8)$ (M1)

$= 0.628 (0.627788 \dots)$ A1

Note: Award M1 for a correct probability statement or indication of correct lower and upper bounds, $5$ and $7$ .

[2 marks]

d.iii.

$\frac{P (5 \leq Y < 8)}{P (Y \geq 5)} (= \frac{0.627788 \dots}{0.716650 \dots})$ (M1)

$= 0.876 (0.876003 \dots)$ A1

[2 marks]

d.iv.

Examiners report

Most candidates started this question well and applied the normal distribution correctly in part (a). The same goes for part (b) where the candidates were able to combine probabilities correctly. Part (c) was not very well done, and there were a surprising number of incorrect approaches on a seemingly straightforward problem. This suggests that candidates were not interpreting the problem correctly and there was a lack of careful reading to be sure of the scenario being described. In part (d) most candidates recognized the need to model the situation with the binomial distribution. However, many candidates did not choose a correct probability for the Bernoulli trial in this question and oversimplified the problem. This again seems to be a problem of “interpretation” rather than “conceptual understanding”.

a.i.

a.ii.

d.i.

d.ii.

d.iii.

d.iv.

A Chocolate Shop advertises free gifts to customers that collect three vouchers. The vouchers are placed at random into 10% of all chocolate bars sold at this shop. Kati buys some of these bars and she opens them one at a time to see if they contain a voucher. Let ${\text{P}}(X = n)$ be the probability that Kati obtains her third voucher on the $n{\text{th}}$ bar opened.

(It is assumed that the probability that a chocolate bar contains a voucher stays at 10% throughout the question.)

It is given that ${\text{P}}(X = n) = \frac{{{n^2} + an + b}}{{2000}} \times {0.9^{n - 3}}$ for $n \geqslant 3,{\text{ }}n \in \mathbb{N}$ .

Kati’s mother goes to the shop and buys $x$ chocolate bars. She takes the bars home for Kati to open.

Show that ${\text{P}}(X = 3) = 0.001$ and ${\text{P}}(X = 4) = 0.0027$ .

[3]

Find the values of the constants $a$ and $b$ .

[5]

Deduce that $\frac{{{\text{P}}(X = n)}}{{{\text{P}}(X = n - 1)}} = \frac{{0.9(n - 1)}}{{n - 3}}$ for $n > 3$ .

[4]

(i) Hence show that $X$ has two modes ${m_1}$ and ${m_2}$ .

(ii) State the values of ${m_1}$ and ${m_2}$ .

[5]

Determine the minimum value of $x$ such that the probability Kati receives at least one free gift is greater than 0.5.

[3]

Markscheme

* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.

${\text{P}}(X = 3) = {(0.1)^3}$ A1

$= 0.001$ AG

${\text{P}}(X = 4) = {\text{P}}(VV\bar VV) + {\text{P}}(V\bar VVV) + {\text{P}}(\bar VVVV)$ (M1)

$= 3 \times {(0.1)^3} \times 0.9$ (or equivalent) A1

$= 0.0027$ AG

[3 marks]

METHOD 1

attempting to form equations in $a$ and $b$ M1

$\frac{{9 + 3a + b}}{{2000}} = \frac{1}{{1000}}{\text{ }}(3a + b = - 7)$ A1

$\frac{{16 + 4a + b}}{{2000}} \times \frac{9}{{10}} = \frac{{27}}{{10\,000}}{\text{ }}(4a + b = - 10)$ A1

attempting to solve simultaneously (M1)

$a = - 3,{\text{ }}b = 2$ A1

METHOD 2

${\text{P}}(X = n) = \left( {\begin{array}{*{20}{c}} {n - 1} \\ 2 \end{array}} \right) \times {0.1^3} \times {0.9^{n - 3}}$ M1

$= \frac{{(n - 1)(n - 2)}}{{2000}} \times {0.9^{n - 3}}$ (M1)A1

$= \frac{{{n^2} - 3n + 2}}{{2000}} \times {0.9^{n - 3}}$ A1

$a = - 3,b = 2$ A1

Note: Condone the absence of ${0.9^{n - 3}}$ in the determination of the values of $a$ and $b$ .

[5 marks]

METHOD 1

EITHER

${\text{P}}(X = n) = \frac{{{n^2} - 3n + 2}}{{2000}} \times {0.9^{n - 3}}$ (M1)

${\text{P}}(X = n) = \left( {\begin{array}{*{20}{c}} {n - 1} \\ 2 \end{array}} \right) \times {0.1^3} \times {0.9^{n - 3}}$ (M1)

THEN

$= \frac{{(n - 1)(n - 2)}}{{2000}} \times {0.9^{n - 3}}$ A1

${\text{P}}(X = n - 1) = \frac{{(n - 2)(n - 3)}}{{2000}} \times {0.9^{n - 4}}$ A1

$\frac{{{\text{P}}(X = n)}}{{{\text{P}}(X = n - 1)}} = \frac{{(n - 1)(n - 2)}}{{(n - 2)(n - 3)}} \times 0.9$ A1

$= \frac{{0.9(n - 1)}}{{n - 3}}$ AG

METHOD 2

$\frac{{{\text{P}}(X = n)}}{{{\text{P}}(X = n - 1)}} = \frac{{\frac{{{n^2} - 3n + 2}}{{2000}} \times {{0.9}^{n - 3}}}}{{\frac{{{{(n - 1)}^2} - 3(n - 1) + 2}}{{2000}} \times {{0.9}^{n - 4}}}}$ (M1)

$= \frac{{0.9({n^2} - 3n + 2)}}{{({n^2} - 5n + 6)}}$ A1A1

Note: Award A1 for a correct numerator and A1 for a correct denominator.

$= \frac{{0.9(n - 1)(n - 2)}}{{(n - 2)(n - 3)}}$ A1

$= \frac{{0.9(n - 1)}}{{n - 3}}$ AG

[4 marks]

(i) attempting to solve $\frac{{0.9(n - 1)}}{{n - 3}} = 1$ for $n$ M1

$n = 21$ A1

$\frac{{0.9(n - 1)}}{{n - 3}} < 1 \Rightarrow n > 21$ R1

$\frac{{0.9(n - 1)}}{{n - 3}} > 1 \Rightarrow n < 21$ R1

$X$ has two modes AG

Note: Award R1R1 for a clearly labelled graphical representation of the two inequalities (using $\frac{{{\text{P}}(X = n)}}{{{\text{P}}(X = n - 1)}}$ ).

(ii) the modes are 20 and 21 A1

[5 marks]

METHOD 1

$Y \sim {\text{B}}(x,{\text{ }}0.1)$ (A1)

attempting to solve ${\text{P}}(Y \geqslant 3) > 0.5$ (or equivalent eg $1 - {\text{P}}(Y \leqslant 2) > 0.5$ ) for $x$ (M1)

Note: Award (M1) for attempting to solve an equality (obtaining $x = 26.4$ ).

$x = 27$ A1

METHOD 2

$\sum\limits_{n = 0}^x {{\text{P}}(X = n) > 0.5}$ (A1)

attempting to solve for $x$ (M1)

$x = 27$ A1

[3 marks]

Examiners report

[N/A]

The weights, in grams, of individual packets of coffee can be modelled by a normal distribution, with mean $102 g$ and standard deviation $8 g$ .

Find the probability that a randomly selected packet has a weight less than $100 g$ .

[2]

The probability that a randomly selected packet has a weight greater than $w$ grams is $0.444$ . Find the value of $w$ .

[2]

A packet is randomly selected. Given that the packet has a weight greater than $105 g$ , find the probability that it has a weight greater than $110 g$ .

[3]

From a random sample of $500$ packets, determine the number of packets that would be expected to have a weight lying within $1.5$ standard deviations of the mean.

[3]

Packets are delivered to supermarkets in batches of $80$ . Determine the probability that at least $20$ packets from a randomly selected batch have a weight less than $95 g$ .

[4]

Markscheme

* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.

$Χ ~ N (102, 8^{2})$

$P (Χ < 100) = 0.401$ (M1)A1

[2 marks]

$P (Χ > w) = 0.444$ (M1)

$\Rightarrow w = 103 (g)$ A1

[2 marks]

$P (Χ > 100 Χ > 105) = \frac{P (Χ > 100 \cap Χ > 105)}{P (Χ > 105)}$ (M1)

$= \frac{P (Χ > 100)}{P (Χ > 105)}$ (A1)

$= \frac{0.15865 \dots}{0.35383 \dots}$

$= 0.448$ A1

[3 marks]

EITHER

$P (90 < Χ < 114) = 0.866 \dots$ (A1)

$P (- 1.5 < Z < 1.5) = 0.866 \dots$ (A1)

THEN

$0.866 \dots \times 500$ (M1)

$= 433$ A1

[3 marks]

$p = P (Χ < 95) = 0.19078 \dots$ (A1)

recognising $Y ~ B (80, p)$ (M1)

now using $Y ~ B (80, 0.19078 \dots)$ (M1)

$P (Y \geq 20) = 0.116$ A1

[4 marks]

Examiners report

[N/A]

Hank sets up a bird table in his garden to provide the local birds with some food. Hank notices that a specific bird, a large magpie, visits several times per month and he names him Bill. Hank models the number of times per month that Bill visits his garden as a Poisson distribution with mean $3.1$ .

Over the course of $3$ consecutive months, find the probability that Bill visits the garden:

Using Hank’s model, find the probability that Bill visits the garden on exactly four occasions during one particular month.

[1]

on exactly $12$ occasions.

[2]

b.i.

during the first and third month only.

[3]

b.ii.

Find the probability that over a $12$ -month period, there will be exactly $3$ months when Bill does not visit the garden.

[4]

After the first year, a number of baby magpies start to visit Hank’s garden. It may be assumed that each of these baby magpies visits the garden randomly and independently, and that the number of times each baby magpie visits the garden per month is modelled by a Poisson distribution with mean $2.1$ .

Determine the least number of magpies required, including Bill, in order that the probability of Hank’s garden having at least $30$ magpie visits per month is greater than $0.2$ .

[4]

Markscheme

$X_{1} ~ Po (3.1)$

$P (X_{1} = 4) = 0.173 (0.173349 \dots)$ A1

[1 mark]

$X_{2} ~ Po (3 \times 3.1) = Po (9.3)$ (M1)

$P (X_{2} = 12) = 0.0799 (0.0798950 \dots)$ A1

[2 marks]

b.i.

${(P (X_{1} > 0))}^{2} \times P (X_{1} = 0)$ (M1)

$0 . 95495^{2} \times 0.04505$ (A1)

$= 0.0411 (0.0410817 \dots)$ A1

[3 marks]

b.ii.

$P (X_{1} = 0) = 0.04505$ (A1)

$X_{1} ~ B (12, 0.04505)$ (M1)(A1)

Note: Award M1 for recognizing binomial probability, and A1 for correct parameters.

$= 0.0133 (0.013283 \dots)$ A1

[4 marks]

METHOD ONE

(M1)(A1)(A1)

Note: Award M1 for evidence of a cumulative Poisson with $λ = 3.1 + 2.1 n$ , A1 for $0.136705$ and A1 for $0.253384$ .

so require $12$ magpies (including Bill) A1

METHOD TWO

evidence of a cumulative Poisson with $λ = 3.1 + 2.1 n$ (M1)

sketch of curve and $y = 0.2$ (A1)

(intersect at) $10.5810 \dots$ (A1)

rounding up gives $n = 11$

so require $12$ magpies (including Bill) A1

[4 marks]

Examiners report

[N/A]

b.i.

[N/A]

b.ii.

[N/A]

It is given that one in five cups of coffee contain more than 120 mg of caffeine.
It is also known that three in five cups contain more than 110 mg of caffeine.

Assume that the caffeine content of coffee is modelled by a normal distribution.
Find the mean and standard deviation of the caffeine content of coffee.

Markscheme

* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.

let $X$ be the random variable “amount of caffeine content in coffee”

${\text{P}}(X > 120) = 0.2,{\text{ P}}(X > 110) = 0.6$ (M1)

$( \Rightarrow {\text{P}}(X < 120) = 0.8,{\text{ P}}(X < 110) = 0.4)$

Note: Award M1 for at least one correct probability statement.

$\frac{{120 - \mu }}{\sigma } = 0.84162 \ldots ,{\text{ }}\frac{{110 - \mu }}{\sigma } = - 0.253347 \ldots$ (M1)(A1)(A1)

Note: Award M1 for attempt to find at least one appropriate $z$ -value.

$120 - \mu = 0.84162\sigma ,{\text{ }}110 - \mu = - 0.253347\sigma$

attempt to solve simultaneous equations (M1)

$\mu = 112,{\text{ }}\sigma = 9.13$ A1

[6 marks]

Examiners report

[N/A]

John likes to go sailing every day in July. To help him make a decision on whether it is safe to go sailing he classifies each day in July as windy or calm. Given that a day in July is calm, the probability that the next day is calm is 0.9. Given that a day in July is windy, the probability that the next day is calm is 0.3. The weather forecast for the 1st July predicts that the probability that it will be calm is 0.8.

Draw a tree diagram to represent this information for the first three days of July.

[3]

Find the probability that the 3rd July is calm.

[2]

Find the probability that the 1st July was calm given that the 3rd July is windy.

[4]

Markscheme

* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.

M17/5/MATHL/HP2/ENG/TZ2/05.a/M M1A2

Note: Award M1 for 3 stage tree-diagram, A2 for 0.8, 0.9, 0.3 probabilities correctly placed.

[3 marks]

$0.2 \times 0.7 \times 0.3 + 0.2 \times 0.3 \times 0.9 + 0.8 \times 0.1 \times 0.3 + 0.8 \times 0.9 \times 0.9 = 0.768$ (M1)A1

[2 marks]

${\text{P}}({\text{1st July is calm | 3rd July is windy)}} = \frac{{{\text{P}}({\text{1st July is calm and 3rd July is windy)}}}}{{{\text{P}}({\text{3rd July is windy)}}}}$ (M1)

$= \frac{{0.8 \times 0.1 \times 0.7 + 0.8 \times 0.9 \times 0.1}}{{1 - 0.768}}$

OR $\,\,\,\,\,$ $\frac{{0.8 \times 0.1 \times 0.7 + 0.8 \times 0.9 \times 0.1}}{{0.2 \times 0.7 \times 0.7 + 0.2 \times 0.3 \times 0.1 + 0.8 \times 0.1 \times 0.7 + 0.8 \times 0.9 \times 0.1}}$

OR $\,\,\,\,\,$ $\frac{{0.128}}{{0.232}}$ (A1)(A1)

Note: Award A1 for correct numerator, A1 for correct denominator.

$= 0.552$ A1

[4 marks]

Examiners report

[N/A]

The curve $y = f\left( x \right)$ is shown in the graph, for $0 \leqslant x \leqslant 10$ .

The curve $y = f\left( x \right)$ passes through the following points.

It is required to find the area bounded by the curve, the $x$ -axis, the $y$ -axis and the line $x = 10$ .

One possible model for the curve $y = f\left( x \right)$ is a cubic function.

Use the trapezoidal rule to find an estimate for the area.

[3]

Use all the coordinates in the table to find the equation of the least squares cubic regression curve.

[3]

b.i.

Write down the coefficient of determination.

[1]

b.ii.

Write down an expression for the area enclosed by the cubic regression curve, the $x$ -axis, the $y$ -axis and the line $x = 10$ .

[1]

c.i.

Find the value of this area.

[2]

c.ii.

Markscheme

Area = $\frac{2}{2}\left( {2 + 2\left( {4.5 + 4.2 + 3.3 + 4.5} \right) + 8} \right)$ M1A1

Area = 43 A1

[3 marks]

$y = 0.0389{x^3} - 0.534{x^2} + 2.06x + 2.06$ M1A2

[3 marks]

b.i.

${R^2} = 0.991$ A1

[1 mark]

b.ii.

Area = $\int\limits_0^{10} {y\,dx}$ A1

[1 mark]

c.i.

42.5 A2

[2 marks]

c.ii.

Examiners report

[N/A]

b.i.

[N/A]

b.ii.

[N/A]

c.i.

[N/A]

c.ii.

A café serves sandwiches and cakes. Each customer will choose one of the following three options; buy only a sandwich, buy only a cake or buy both a sandwich and a cake.

The probability that a customer buys a sandwich is 0.72 and the probability that a customer buys a cake is 0.45.

Find the probability that a customer chosen at random will buy

On a typical day 200 customers come to the café.

It is known that 46 % of the customers who come to the café are male, and that 80 % of these buy a sandwich.

both a sandwich and a cake.

[3]

a.i.

only a sandwich.

[1]

a.ii.

Find the expected number of cakes sold on a typical day.

[1]

b.i.

Find the probability that more than 100 cakes will be sold on a typical day.

[3]

b.ii.

A customer is selected at random. Find the probability that the customer is male and buys a sandwich.

[1]

c.i.

A female customer is selected at random. Find the probability that she buys a sandwich.

[4]

c.ii.

Markscheme

use of formula or Venn diagram (M1)

0.72 + 0.45 − 1 (A1)

= 0.17 A1

[3 marks]

a.i.

0.72 − 0.17 = 0.55 A1

[1 mark]

a.ii.

200 × 0.45 = 90 A1

[1 mark]

b.i.

let X be the number of customers who order cake

X ~ B(200,0.45) (M1)

P(X > 100) = P(X ≥ 101)(= 1 − P(X ≤ 100)) (M1)

= 0.0681 A1

[3 marks]

b.ii.

0.46 × 0.8 = 0.368 A1

[1 mark]

c.i.

METHOD 1

$0.368 + 0.54 \times {\text{P}}\left( {S\left| F \right.} \right) = 0.72$ M1A1A1

Note: Award M1 for an appropriate tree diagram. Award M1 for LHS, M1 for RHS.

${\text{P}}\left( {S\left| F \right.} \right) = 0.652$ A1

METHOD 2

${\text{P}}\left( {S\left| F \right.} \right) = \frac{{{\text{P}}\left( {S \cap F} \right)}}{{{\text{P}}\left( F \right)}}$ (M1)

$= \frac{{0.72 - 0.368}}{{0.54}}$ A1A1

Note: Award A1 for numerator, A1 for denominator.

${\text{P}}\left( {S\left| F \right.} \right) = 0.652$ A1

[4 marks]

c.ii.

Examiners report

[N/A]

a.i.

[N/A]

a.ii.

[N/A]

b.i.

[N/A]

b.ii.

[N/A]

c.i.

[N/A]

c.ii.

Dana has collected some data regarding the heights $h$ (metres) of waves against a pier at $50$ randomly chosen times in a single day. This data is shown in the table below.

She wishes to perform a $χ^{2}$ -test at the $5 %$ significance level to see if the height of waves could be modelled by a normal distribution. Her null hypothesis is

$H_{0} :$ The data can be modelled by a normal distribution.

From the table she calculates the mean of the heights in her sample to be $0.828 m$ and the standard deviation of the heights $s_{n}$ to be $0.257 m$ .

She calculates the expected values for each interval under this null hypothesis, and some of these values are shown in the table below.

Use the given value of $s_{n}$ to find the value of $s_{n - 1}$ .

[2]

Find the value of $a$ and the value of $b$ , giving your answers correct to one decimal place.

[3]

Find the value of the $χ^{2}$ test statistic $(χ_{c a l c}^{2})$ for this test.

[2]

Determine the degrees of freedom for Dana’s test.

[2]

It is given that the critical value for this test is $9.49$ .

State the conclusion of the test in context. Use your answer to part (c) to justify your conclusion.

[2]

Markscheme

$s_{n - 1} = \sqrt{\frac{50}{49}} \times 0.257$ (M1)

Note: M1 is for the use of the correct formula

$= 0.260$ A1

[2 marks]

Using $\bar{x} = 0.828$ and $s_{n - 1} = 0.260$ (M1)

$a = 7.3, b = 7.6$ A1A1

[3 marks]

$χ_{c a l c}^{2} = 3.35$ (M1)A1

[2 marks]

Combining columns with expected values less than $5$ leaves $7$ cells (M1)

$7 - 1 - 2 = 4$ A1

[2 marks]

$3.35 < 9.49$ R1

hence insufficient evidence to reject $H_{0}$ that the heights of the waves are normally distributed. A1

Note: The A1 can be awarded independently of the R1.

[2 marks]

Examiners report

[N/A]

The times taken for male runners to complete a marathon can be modelled by a normal distribution with a mean 196 minutes and a standard deviation 24 minutes.

It is found that 5% of the male runners complete the marathon in less than ${T_1}$ minutes.

The times taken for female runners to complete the marathon can be modelled by a normal distribution with a mean 210 minutes. It is found that 58% of female runners complete the marathon between 185 and 235 minutes.

Find the probability that a runner selected at random will complete the marathon in less than 3 hours.

[2]

Calculate ${T_1}$ .

[2]

Find the standard deviation of the times taken by female runners.

[4]

Markscheme

* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.

$T \sim N(196,{\text{ }}{24^2})$

${\text{P}}(T < 180) = 0.252$ (M1)A1

[2 marks]

${\text{P}}(T < {T_1}) = 0.05$ (M1)

${T_1} = 157$ A1

[2 marks]

$F \sim N(210,{\text{ }}{\sigma ^2})$

${\text{P}}(F < 235) = 0.79$ (M1)

$\frac{{235 - 210}}{\sigma } = 0.806421$ or equivalent (M1)(A1)

$\sigma = 31.0$ A1

[4 marks]

Examiners report

[N/A]

A Principal would like to compare the students in his school with a national standard. He decides to give a test to eight students made up of four boys and four girls. One of the teachers offers to find the volunteers from his class.

The marks out of $40$ , for the students who took the test, are:

$25, 29, 38, 37, 12, 18, 27, 31 .$

For the eight students find

The national standard mark is $25.2$ out of $40$ .

Two additional students take the test at a later date and the mean mark for all ten students is $28.1$ and the standard deviation is $8.4$ .

For further analysis, a standardized score out of $100$ for the ten students is obtained by multiplying the scores by $2$ and adding $20$ .

For the ten students, find

Name the type of sampling that best describes the method used by the Principal.

[1]

the mean mark.

[2]

b.i.

the standard deviation of the marks.

[1]

b.ii.

Perform an appropriate test at the $5 %$ significance level to see if the mean marks achieved by the students in the school are higher than the national standard. It can be assumed that the marks come from a normal population.

[5]

State one reason why the test might not be valid.

[1]

their mean standardized score.

[1]

e.i.

the standard deviation of their standardized score.

[2]

e.ii.

Markscheme

quota A1

[1 mark]

$27.125 \approx 27.1$ (M1)A1

[2 marks]

b.i.

$8.29815 \dots \approx 8.30$ A1

[1 mark]

b.ii.

(let $μ$ be the national mean)

$H_{0} : μ = 25.2$

$H_{1} : μ > 25.2$ A1

Note: Accept hypotheses in words if they are clearly expressed and ‘population mean’ or ‘school mean’ is referred to. Do not accept $H_{0} : μ = μ_{0}$ unless $μ_{0}$ is explicitly defined as “national standard mark” or given as $25.2$ .

recognizing $t$ -test (M1)

$p$ -value $= 0.279391 \dots$ A1

$0.279391 \dots > 0.05$ R1

Note: The R1 mark is for the comparison of their $p$ -value with $0.05$ .

insufficient evidence to reject the null hypothesis (that the mean for the school is $25.2$ ) A1

Note: Award the final A1 only if the null hypothesis is also correct (e.g. $μ_{0} = 25.2$ or (population) mean $= 25.2$ ) and the conclusion is consistent with both the direction of the inequality and the alternative hypothesis.

[5 marks]

EITHER

the sampling process is not random R1

For example:

the school asked for volunteers

the students were selected from a single class

the quota might not be representative of the student population R1

For example:

the school may have only $4$ boys and $400$ girls.

Note: Do not accept ‘the sample is too small’.

[1 mark]

$(28.1 \times 2 + 20 =) 76.2$ A1

[1 mark]

e.i.

$8.4 \times 2$ (A1)

$= 16.8$ A1

[2 marks]

e.ii.

Examiners report

The most common answer to this question was ‘convenience sampling’. Though it is a convenience sample because four boys and four girls were required the most appropriate response was ‘quota sampling’.

Some candidates still try to calculate a mean and standard deviation by hand. This is not expected.

b.i.

[N/A]

b.ii.

This was surprisingly poorly answered given that statistical testing forms a large part of the course. Candidates need to give the null and alternative hypotheses, find a p-value, compare this to the significance level and write their conclusion, in context of the question; examination questions may ask for each element individually or the question may say “Perform the test” wherein it is expected that each individual element will be clearly stated (as the test is incomplete if any are omitted). Many candidates had the null hypothesis as an inequality. The easiest way to write the null hypothesis is $H_{0} : μ=25.2$ , but it could also be stated in words so long as it is clear that the population mean is being referred to rather than the sample mean. For example, H₀: The mean score of the whole school is equal to 25.2.

The answer that the sample was self-selecting or unrepresentative was the expected response. The sample being small was also accepted if the additional reason of therefore ‘not able to assume a normal population’ was also given. In general, a small sample can be valid (though will probably not be reliable).

Some candidates missed the point of this question, that it was concerned with transformations of the mean and standard deviation, and instead tried to work out the actual values for the extra two candidates.

e.i.

[N/A]

e.ii.

The number of marathons that Audrey runs in any given year can be modelled by a Poisson distribution with mean 1.3 .

Calculate the probability that Audrey will run at least two marathons in a particular year.

[2]

Find the probability that she will run at least two marathons in exactly four out of the following five years.

[4]

Markscheme

$X \sim {\text{Po}}\left( {1.3} \right)$

${\text{P}}\left( {X \geqslant 2} \right) = 0.373$ (M1)A1

[2 marks]

$V \sim {\text{B}}\left( {5{\text{, }}0.373} \right)$ (M1)A1

Note: Award (M1) for recognition of binomial or equivalent, A1 for correct parameters.

${\text{P}}\left( {V = 4} \right) = 0.0608$ (M1)A1

[4 marks]

Examiners report

[N/A]

Packets of biscuits are produced by a machine. The weights $X$ , in grams, of packets of biscuits can be modelled by a normal distribution where $X \sim {\text{N}}(\mu ,{\text{ }}{\sigma ^2})$ . A packet of biscuits is considered to be underweight if it weighs less than 250 grams.

The manufacturer makes the decision that the probability that a packet is underweight should be 0.002. To do this $\mu$ is increased and $\sigma$ remains unchanged.

The manufacturer is happy with the decision that the probability that a packet is underweight should be 0.002, but is unhappy with the way in which this was achieved. The machine is now adjusted to reduce $\sigma$ and return $\mu$ to 253.

Given that $\mu = 253$ and $\sigma = 1.5$ find the probability that a randomly chosen packet of biscuits is underweight.

[2]

Calculate the new value of $\mu$ giving your answer correct to two decimal places.

[3]

Calculate the new value of $\sigma$ .

[2]

Markscheme

* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.

${\text{P}}(X < 250) = 0.0228$ (M1)A1

[2 marks]

$\frac{{250 - \mu }}{{1.5}} = - 2.878 \ldots$ (M1)(A1)

$\Rightarrow \mu = 254.32$ A1

Notes: Only award A1 here if the correct 2dp answer is seen. Award M0 for use of ${1.5^2}$ .

[3 marks]

$\frac{{250 - 253}}{\sigma } = - 2.878 \ldots$ (A1)

$\Rightarrow \sigma = 1.04$ A1

[2 marks]

Examiners report

[N/A]

In a reforested area of pine trees, heights of trees planted in a specific year seem to follow a normal distribution. A sample of 100 such trees is selected to test the validity of this hypothesis. The results of measuring tree heights, to the nearest centimetre, are recorded in the first two columns of the table below.

Describe what is meant by

a goodness of fit test (a complete explanation required);

[2]

a.i.

the level of significance of a hypothesis test.

[1]

a.ii.

Find the mean and standard deviation of the sample data in the table above. Show how you arrived at your answers.

[4]

Most of the expected frequencies have been calculated in the third column. (Frequencies have been rounded to the nearest integer, and frequencies in the first and last classes have been extended to include the rest of the data beyond 15 and 225. Find the values of $a$ , $b$ and $c$ and show how you arrived at your answers.

[4]

In order to test for the goodness of fit, the test statistic was calculated to be 1.0847. Show how this was done.

[3]

State your hypotheses, critical number, decision rule and conclusion (using a 5% level of significance).

[5]

Markscheme

A goodness of fit test is a statistical test of the hypothesis that a set of observed counts of $k$ cells of a certain large population is consistent with a set of theoretical counts. (R1)

The test statistic has a ${\chi ^2}$ distribution with $k - n$ degrees of freedom. One degree of freedom is lost for every parameter that has to be estimated from the sample. (R1)

[2 marks]

a.i.

The level of significance of a hypothesis test is the maximal probability that we reject a true null hypothesis. (R1)

[1 mark]

a.ii.

We use the class midpoints in the calculation of the mean and standard deviation.

$\bar x = \frac{{\sum {{x_i}f\left( {{x_i}} \right)} }}{{\sum {f\left( {{x_i}} \right)} }} = \frac{{30 \times 6 + 60 \times 11 + 90 \times 15 + \ldots }}{{100}} = \frac{{13350}}{{100}}$ (M1)

= 133.5 (A1)

${\text{s}} = \sqrt {\frac{{\sum {x_i^2f\left( {{x_i}} \right)} }}{{\sum {f\left( {{x_i}} \right)} }} - {{\left( {\bar x} \right)}^2}} = \sqrt {\frac{{900 \times 6 + 3600 \times 11 + \ldots }}{{100}} - {{133.5}^2}}$ (M1)

= 56.345 (= 56.3 to 3 sf) (A1)

[4 marks]

Every frequency is the product of the number of observations and the probability of a number in each class. Since by hypothesis we have a normal distribution, the probabilities can be read from a normal table with mean 133.5 and standard deviation 56.345 (M1)

E₁ = 100 × P(45 ≤ $x$ ≤ 75) ≈ 9 so $a$ = 9 (A1)

E₂ = 100 × P(135 ≤ $x$ ≤ 165) ≈ 20 so b = 20 (A1)

E₃ = 100 × P(195 ≤ $x$ ≤ 225) ≈ 9 so $c$ = 9 (A1)

[4 marks]

The test statistic is a ${\chi ^2}$ variable. Hence (M1)

${\chi ^2} = \sum {\frac{{{{\left( {{f_e} - {f_o}} \right)}^2}}}{{{f_e}}}} = \frac{{{{\left( {6 - 6} \right)}^2}}}{6} + \frac{{{{\left( {9 - 11} \right)}^2}}}{9} + \ldots \frac{{{{\left( {5 - 6} \right)}^2}}}{5}$ (M1)

= 1.0847 (A1)

[3 marks]

H₀: The distribution of tree heights is normally distributed

H₁: The distribution is not normal (M1)

Since the mean and standard deviation were estimated from the sample, the number of degrees of freedom is 8 – 1 – 2 = 5 (A1)

The critical number is $\chi _{5{\text{,}}\,\,0.05}^2$ = 110705

If ${\chi ^2}$ > 11.0705 we reject H₀ (A1)

Since ${\chi ^2}$ = 1.0847 < 11.0705, we fail to reject H₀ (R1)

Conclusion: we do not have enough evidence to claim that the distribution of tree heights is not normal (R1)

[5 marks]

Examiners report

[N/A]

a.i.

[N/A]

a.ii.

[N/A]

The random variable X has a normal distribution with mean μ = 50 and variance σ ² = 16 .

Sketch the probability density function for X, and shade the region representing P(μ − 2σ < X < μ + σ).

[2]

Find the value of P(μ − 2σ < X < μ + σ).

[2]

Find the value of k for which P(μ − kσ < X < μ + kσ) = 0.5.

[2]

Markscheme

* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.

normal curve centred on 50 A1

vertical lines at $x$ = 42 and $x$ = 54, with shading in between A1

[2 marks]

P(42 < X < 54) (= P(− 2 < Z < 1)) (M1)

= 0.819 A1

[2 marks]

P(μ − kσ < X < μ + kσ) = 0.5 ⇒ P(X < μ + kσ) = 0.75 (M1)

k = 0.674 A1

Note: Award M1A0 for k = −0.674.

[2 marks]

Examiners report

[N/A]

Loreto is a manager at the Da Vinci health centre. If the mean rate of patients arriving at the health centre exceeds $1.5$ per minute then Loreto will employ extra staff. It is assumed that the number of patients arriving in any given time period follows a Poisson distribution.

Loreto performs a hypothesis test to determine whether she should employ extra staff. She finds that $320$ patients arrived during a randomly selected $3$ -hour clinic.

Loreto is also concerned about the average waiting time for patients to see a nurse. The health centre aims for at least $95 %$ of patients to see a nurse in under $20$ minutes.

Loreto assumes that the waiting times for patients are independent of each other and decides to perform a hypothesis test at a $10 %$ significance level to determine whether the health centre is meeting its target.

Loreto surveys $150$ patients and finds that $11$ of them waited more than $20$ minutes.

Write down null and alternative hypotheses for Loreto’s test.

[2]

a.i.

Using the data from Loreto’s sample, perform the hypothesis test at a $5 %$ significance level to determine if Loreto should employ extra staff.

[5]

a.ii.

Write down null and alternative hypotheses for this test.

[2]

b.i.

Perform the test, clearly stating the conclusion in context.

[5]

b.ii.

Markscheme

let $X$ be the random variable “number of patients arriving in a minute”, such that $X ~ Po (m)$

$H_{0} : m = 1.5$ A1

$H_{1} : m > 1.5$ A1

Note: Allow a value of $270$ for $m$ . Award at most A0A1 if it is not clear that it is the population mean being referred to e.g
$H_{0} :$ The number of patients is equal to 1.5 every minute
$H_{1} :$ The number of patients exceeds 1.5 every minute.
Referring to the “expected” number of patients or the use of $μ$ or $λ$ is sufficient for A1A1.

[2 marks]

a.i.

under $H_{0}$ let $Y$ be the number of patients in $3$ hours

$Y ~ Po (270)$ (A1)

$P (Y \geq 320) (= 1 - P (Y \leq 319)) = 0.00166 (0.00165874)$ (M1)A1

since $0.00166 < 0.05$ R1

(reject $H_{0}$ )

Loreto should employ more staff A1

[5 marks]

a.ii.

$H_{0}$ : The probability of a patient waiting less than $20$ minutes is $0.95$ A1

$H_{1}$ : The probability of a patient waiting less than $20$ minutes is less than $0.95$ A1

[2 marks]

b.i.

Under $H_{0}$ let $W$ be the number of patients waiting more than $20$ minutes

$W ~ B (150, 0.05)$ (A1)

$P (W \geq 11) = 0.132 (0.132215 \dots)$ (M1)A1

since $0.132 > 0.1$ R1

(fail to reject $H_{0}$ )

insufficient evidence to suggest they are not meeting their target A1

Note: Do not accept “they are meeting target” for the A1. Accept use of $B (150, 0.95)$ and $P (W \leq 139)$ and any consistent use of a random variable, appropriate $p$ -value and significance level.

[5 marks]

b.ii.

Examiners report

In part (a) there was a general lack of consistency in how candidates wrote down their null and alternative hypotheses. It was surprising how many candidates solved a Poisson $PDF$ rather than $CDF$ to find their $p$ -value. This suggests a lack of understanding of the nature of distributions or more specifically the concepts of hypothesis testing. In part (b), which was challenging, there were issues for many candidates in interpreting the situation. This is understandable since it was difficult, but as previously mentioned interpretation is a general issue in the paper. When writing down the conclusion of the tests, there was often very loose use of the terms accept/reject and candidates seemed unclear of the significance and importance of the correct use of these terms.

a.i.

a.ii.

b.i.

b.ii.

The marks achieved by eight students in a class test are given in the following list.

The teacher increases all the marks by 2. Write down the new value for

the standard deviation.

Markscheme

2.22 A1

[1 mark]

Examiners report

[N/A]

A random variable $X$ is normally distributed with mean $\mu$ and standard deviation $\sigma$ , such that ${\text{P}}(X < 30.31) = 0.1180$ and ${\text{P}}(X > 42.52) = 0.3060$ .

Find $\mu$ and $\sigma$ .

[6]

Find ${\text{P}}\left( {\left| {X - \mu } \right| < 1.2\sigma } \right)$ .

[2]

Markscheme

* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.

${\text{P}}(X < 42.52) = 0.6940$ (M1)

either ${\text{P}}\left( {Z < \frac{{30.31 - \mu }}{\sigma }} \right) = 0.1180{\text{ or P}}\left( {Z < \frac{{42.52 - \mu }}{\sigma }} \right) = 0.6940$ (M1)

$\frac{{30.31 - \mu }}{\sigma } = \underbrace {{\Phi ^{ - 1}}(0.1180)}_{ - 1.1850 \ldots }$ (A1)

$\frac{{42.52 - \mu }}{\sigma } = \underbrace {{\Phi ^{ - 1}}(0.6940)}_{0.5072 \ldots }$ (A1)

attempting to solve simultaneously (M1)

$\mu = 38.9$ and $\sigma = 7.22$ A1

[6 marks]

${\text{P}}(\mu - 1.2\sigma < X < \mu + 1.2\sigma )$ (or equivalent eg. $2{\text{P}}(\mu < X < \mu + 1.2\sigma )$ ) (M1)

$= 0.770$ A1

Note: Award (M1)A1 for ${\text{P}}( - 1.2 < Z < 1.2) = 0.770$ .

[2 marks]

Examiners report

[N/A]

A geneticist uses a Markov chain model to investigate changes in a specific gene in a cell as it divides. Every time the cell divides, the gene may mutate between its normal state and other states.

The model is of the form

$(\begin{matrix} X_{n + 1} \\ Z_{n + 1} \end{matrix}) = M (\begin{matrix} X_{n} \\ Z_{n} \end{matrix})$

where $X_{n}$ is the probability of the gene being in its normal state after dividing for the $n th$ time, and $Z_{n}$ is the probability of it being in another state after dividing for the $n th$ time, where $n \in ℕ$ .

Matrix $M$ is found to be $(\begin{matrix} 0.94 & b \\ 0.06 & 0.98 \end{matrix})$ .

The gene is in its normal state when $n = 0$ . Calculate the probability of it being in its normal state

Write down the value of $b$ .

[1]

a.i.

What does $b$ represent in this context?

[1]

a.ii.

Find the eigenvalues of $M$ .

[3]

Find the eigenvectors of $M$ .

[3]

when $n = 5$ .

[2]

d.i.

in the long term.

[2]

d.ii.

Markscheme

$0.02$ A1

[1 mark]

a.i.

the probability of mutating from ‘not normal state’ to ‘normal state’ A1

Note: The A1 can only be awarded if it is clear that transformation is from the mutated state.

[1 mark]

a.ii.

$det (\begin{matrix} 0.94 - λ & 0.02 \\ 0.06 & 0.98 - λ \end{matrix}) = 0$ (M1)

Note: Award M1 for an attempt to find eigenvalues. Any indication that $det (M - λ I) = 0$ has been used is sufficient for the (M1).

$(0.94 - λ) (0.98 - λ) - 0.0012 = 0$ OR $λ^{2} - 1.92 λ + 0.92 = 0$ (A1)

$λ = 1, 0.92 (\frac{23}{25})$ A1

[3 marks]

$(\begin{matrix} 0.94 & 0.02 \\ 0.06 & 0.98 \end{matrix}) (\begin{matrix} x \\ y \end{matrix}) = (\begin{matrix} x \\ y \end{matrix})$ OR $(\begin{matrix} 0.94 & 0.02 \\ 0.06 & 0.98 \end{matrix}) (\begin{matrix} x \\ y \end{matrix}) = 0.92 (\begin{matrix} x \\ y \end{matrix})$ (M1)

Note: Award M1 can be awarded for attempting to find either eigenvector.

$0.02 y - 0.06 x = 0$ OR $0.02 y + 0.02 x = 0$

$(\begin{matrix} 1 \\ 3 \end{matrix})$ and $(\begin{matrix} 1 \\ - 1 \end{matrix})$ A1A1

Note: Accept any multiple of the given eigenvectors.

[3 marks]

${(\begin{matrix} 0.94 & 0.02 \\ 0.06 & 0.98 \end{matrix})}^{5} (\begin{matrix} 1 \\ 0 \end{matrix})$ OR $(\begin{matrix} 0.744 & 0.0852 \\ 0.256 & 0.915 \end{matrix}) (\begin{matrix} 1 \\ 0 \end{matrix})$ (M1)

Note: Condone omission of the initial state vector for the M1.

$0.744 (0.744311 \dots)$ A1

[2 marks]

d.i.

$(\begin{matrix} 0.25 \\ 0.75 \end{matrix})$ (A1)

Note: Award A1 for $(\begin{matrix} 0.25 \\ 0.75 \end{matrix})$ OR $(\begin{matrix} 0.25 & 0.25 \\ 0.75 & 0.75 \end{matrix})$ seen.

$0.25$ A1

[2 marks]

d.ii.

Examiners report

There was some difficulty in interpreting the meaning of the values in the transition matrix, but most candidates did well with the rest of the question. In part (d) there was frequently evidence of a correct method, but a failure to identify the correct probabilities. It was surprising to see a significant number of candidates diagonalizing the matrix in part (d) and this often led to errors. Clearly this was not necessary.

a.i.

[N/A]

a.ii.

[N/A]

d.i.

[N/A]

d.ii.

Scientists have developed a type of corn whose protein quality may help chickens gain weight faster than the present type used. To test this new type, 20 one-day-old chicks were fed a ration that contained the new corn while another control group of 20 chicks was fed the ordinary corn. The data below gives the weight gains in grams, for each group after three weeks.

The scientists wish to investigate the claim that Group B gain weight faster than Group A. Test this claim at the 5% level of significance, noting which hypothesis test you are using. You may assume that the weight gain for each group is normally distributed, with the same variance, and independent from each other.

[6]

The data from the two samples above are combined to form a single set of data. The following frequency table gives the observed frequencies for the combined sample. The data has been divided into five intervals.

Test, at the 5% level, whether the combined data can be considered to be a sample from a normal population with a mean of 380.

[10]

Markscheme

This is a t-test of the difference of two means. Our assumptions are that the two populations are approximately normal, samples are random, and they are independent from each other. (R1)

H₀: μ₁ − μ₂ = 0

H₁: μ₁ − μ₂ < 0 (A1)

t = −2.460, (A1)

degrees of freedom = 38 (A1)

Since the value of critical t = −1.686 we reject H₀. (A1)

Hence group B grows faster. (R1)

[6 marks]

This is a ${\chi ^2}$ goodness-of-fit test.

To finish the table, the frequencies of the respective cells have to be calculated. Since the standard deviation is not given, it has to be estimated using the data itself. s = 49.59, eg the third expected frequency is 40 × 0.308 = 12.32, since P(350.5 < W < 390.5) = 0.3078...

The table of observed and expected frequencies is:

(M1)(A2)

Since the first expected frequency is 3.22, we combine the two cells, so that the first two rows become one row, that is,

(M1)

Number of degrees of freedom is 4 – 1 – 1 = 2 (C1)

H₀: The distribution is normal with mean 380

H₁: The distribution is not normal with mean 380 (A1)

The test statistic is

$\chi _{calc}^2 = \sum {\frac{{{{\left( {{f_e} - {f_0}} \right)}^2}}}{{{f_e}}}} = \frac{{{{\left( {11 - 11.04} \right)}^2}}}{{11.04}} + \frac{{{{\left( {8 - 12.32} \right)}^2}}}{{12.32}} + \frac{{{{\left( {15 - 10.48} \right)}^2}}}{{10.48}} + \frac{{{{\left( {6 - 6.17} \right)}^2}}}{{6.17}}$

= 3.469 (A1)

With 2 degrees of freedom, the critical number is ${\chi ^2}$ = 5.99 (A2)

So, we do not have enough evidence to reject the null hypothesis. Therefore, there is no evidence to say that the distribution is not normal with mean 380. (R1)

[10 marks]

Examiners report

[N/A]

The random variable X has a binomial distribution with parameters n and p.
It is given that E(X) = 3.5.

Find the least possible value of n.

[2]

It is further given that P(X ≤ 1) = 0.09478 correct to 4 significant figures.

Determine the value of n and the value of p.

[5]

Markscheme

* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.

np = 3.5 (A1)

p ≤ 1 ⇒ least n = 4 A1

[2 marks]

(1 − p)ⁿ + np(1 − p)ⁿ⁻¹ = 0.09478 M1A1

attempt to solve above equation with np = 3.5 (M1)

n = 12, p = $\frac{7}{{24}}$ (=0.292) A1A1

Note: Do not accept n as a decimal.

[5 marks]

Examiners report

[N/A]

A survey of British holidaymakers found that $15 %$ of those surveyed took a holiday in the Lake District in $2019$ .

A random sample of $16$ British holidaymakers was taken. The number of people in the sample who took a holiday in the Lake District in $2019$ can be modelled by a binomial distribution.

State two assumptions made in order for this model to be valid.

[2]

a.i.

Find the probability that at least three people from the sample took a holiday in the Lake District in $2019$ .

[2]

a.ii.

From a random sample of $n$ holidaymakers, the probability that at least one of them took a holiday in the Lake District in $2019$ is greater than $0.999$ .

Determine the least possible value of $n$ .

[3]

Markscheme

people’s holidays are independent of each other R1

the proportion is constant (at $0.15$ ) R1

[2 marks]

a.i.

$Χ ~ B (16, 0.15)$

$P (Χ \geq 3) = 0.439$ (M1)A1

[2 marks]

a.ii.

probability of at least one $= 1 -$ probability of none

$\Rightarrow 1 - 0 . 85^{n} > 0.999$ OR $0 . 85^{n} < 0.001$ (A1)

attempt to solve inequality (M1)

$n \geq 42.503 \dots$

so least possible $n = 43$ A1

[3 marks]

Examiners report

[N/A]

a.i.

[N/A]

a.ii.

[N/A]

A discrete random variable $X$ follows a Poisson distribution ${\text{Po}}(\mu )$ .

Show that ${\text{P}}(X = x + 1) = \frac{\mu }{{x + 1}} \times {\text{P}}(X = x),{\text{ }}x \in \mathbb{N}$ .

[3]

Given that ${\text{P}}(X = 2) = 0.241667$ and ${\text{P}}(X = 3) = 0.112777$ , use part (a) to find the value of $\mu$ .

[3]

Markscheme

* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.

METHOD 1

${\text{P}}(X = x + 1) = \frac{{{\mu ^{x + 1}}}}{{(x + 1)!}}{{\text{e}}^{ - \mu }}$ A1

$= \frac{\mu }{{x + 1}} \times \frac{{{\mu ^x}}}{{x!}}{{\text{e}}^{ - \mu }}$ M1A1

$= \frac{\mu }{{x + 1}} \times {\text{P}}(X = x)$ AG

METHOD 2

$\frac{\mu }{{x + 1}} \times {\text{P}}(X = x) = \frac{\mu }{{x + 1}} \times \frac{{{\mu ^x}}}{{x!}}{{\text{e}}^{ - \mu }}$ A1

$= \frac{{{\mu ^{x + 1}}}}{{(x + 1)!}}{{\text{e}}^{ - \mu }}$ M1A1

$= {\text{P}}(X = x + 1)$ AG

METHOD 3

$\frac{{{\text{P}}(X = x + 1)}}{{{\text{P}}(X = x)}} = \frac{{\frac{{{\mu ^{x + 1}}}}{{(x + 1)!}}{{\text{e}}^{ - \mu }}}}{{\frac{{{\mu ^x}}}{{x!}}{{\text{e}}^{ - \mu }}}}$ (M1)

$= \frac{{{\mu ^{x + 1}}}}{{{\mu ^x}}} \times \frac{{x!}}{{(x + 1)!}}$ A1

$= \frac{\mu }{{x + 1}}$ A1

and so ${\text{P}}(X = x + 1) = \frac{\mu }{{x + 1}} \times {\text{P}}(X = x)$ AG

[3 marks]

${\text{P}}(X = 3) = \frac{\mu }{3} \bullet {\text{P}}(X = 2){\text{ }}\left( {0.112777 = \frac{\mu }{3} \bullet 0.241667} \right)$ A1

attempting to solve for $\mu$ (M1)

$\mu = 1.40$ A1

[3 marks]

Examiners report

[N/A]

Jenna is a keen book reader. The number of books she reads during one week can be modelled by a Poisson distribution with mean $2.6$ .

Determine the expected number of weeks in one year, of $52$ weeks, during which Jenna reads at least four books.

Markscheme

* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.

let $Χ$ be the random variable “number of books Jenna reads per week.”

then $Χ ~ Po (2.6)$

$P (Χ \geq 4) = 0 .264 (0.263998 \dots)$ (M1)(A1)

$0.263998 \dots \times 52$ (M1)

$= 13.7$ A1

Note: Accept $14$ weeks.

[4 marks]

Examiners report

[N/A]

It is known that 56 % of Infiglow batteries have a life of less than 16 hours, and 94 % have a life less than 17 hours. It can be assumed that battery life is modelled by the normal distribution ${\text{N}}\left( {\mu ,\,\,{\sigma ^2}} \right)$ .

Find the value of $\mu$ and the value of $\sigma$ .

[6]

Find the probability that a randomly selected Infiglow battery will have a life of at least 15 hours.

[2]

Markscheme

* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.

use of inverse normal (implied by ±0.1509… or ±1.554…) (M1)

P(X < 16) = 0.56

$\Rightarrow \frac{{16 - \mu }}{\sigma } = 0.1509 \ldots$ (A1)

P(X < 17) = 0.94

$\Rightarrow \frac{{17 - \mu }}{\sigma } = 1.554 \ldots$ (A1)

attempt to solve a pair of simultaneous equations (M1)

$\mu$ = 15.9, $\sigma$ = 0.712 A1A1

[6 marks]

correctly shaded diagram or intent to find P(X ≥ 15) (M1)

= 0.895 A1

Note: Accept answers rounding to 0.89 or 0.90. Award M1A0 for the answer 0.9.

[2 marks]

Examiners report

[N/A]

The continuous random variable X has probability density function $f$ given by

$f\left( x \right) = \left\{ {\begin{array}{*{20}{c}} {3ax}&,&{0 \leqslant x < 0.5} \\ {a\left( {2 - x} \right)}&,&{0.5 \leqslant x < 2} \\ 0&,&{{\text{otherwise}}} \end{array}} \right.$

Show that $a = \frac{2}{3}$ .

[3]

Find ${\text{P}}\left( {X < 1} \right)$ .

[3]

Given that ${\text{P}}\left( {s < X < 0.8} \right) = 2 \times {\text{P}}\left( {2s < X < 0.8} \right)$ , and that 0.25 < s < 0.4 , find the value of s.

[7]

Markscheme

* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.

$a\left[ {\int_0^{0.5} {3x\,{\text{d}}x} + \int_{0.5}^2 {\left( {2 - x} \right)} \,{\text{d}}x} \right] = 1$ M1

Note: Award the M1 for the total integral equalling 1, or equivalent.

$a\left( {\frac{3}{2}} \right) = 1$ (M1)A1

$a = \frac{2}{3}$ AG

[3 marks]

EITHER

$\int_0^{0.5} {2x\,{\text{d}}x} + \frac{2}{3}\int_{0.5}^1 {\left( {2 - x} \right)} \,{\text{d}}x$ (M1)(A1)

$= \frac{2}{3}$ A1

$\frac{2}{3}\int_1^2 {\left( {2 - x} \right)} \,{\text{d}}x = \frac{1}{3}$ (M1)

so ${\text{P}}\left( {X < 1} \right) = \frac{2}{3}$ (M1)A1

[3 marks]

${\text{P}}\left( {s < X < 0.8} \right) = \int_s^{0.5} {2x\,{\text{d}}x} + \frac{2}{3}\int_{0.5}^{0.8} {\left( {2 - x} \right)} \,{\text{d}}x$ M1A1

$= \left[ {{x^2}} \right]_s^{0.5} + 0.27$

$0.25 - {s^2} + 0.27$ (A1)

${\text{P}}\left( {2s < X < 0.8} \right) = \frac{2}{3}\int_{2s}^{0.8} {\left( {2 - x} \right)} \,{\text{d}}x$ A1

$= \frac{2}{3}\left[ {2x - \frac{{{x^2}}}{2}} \right]_{2s}^{0.8}$

$\frac{2}{3}\left( {1.28 - \left( {4s - 2{s^2}} \right)} \right)$

equating

$0.25 - {s^2} + 0.27 = \frac{4}{3}\left( {1.28 - \left( {4s - 2{s^2}} \right)} \right)$ (A1)

attempt to solve for s (M1)

s = 0.274 A1

[7 marks]

Examiners report

[N/A]

Steffi the stray cat often visits Will’s house in search of food. Let $X$ be the discrete random variable “the number of times per day that Steffi visits Will’s house”.

The random variable $X$ can be modelled by a Poisson distribution with mean 2.1.

Let Y be the discrete random variable “the number of times per day that Steffi is fed at Will’s house”. Steffi is only fed on the first four occasions that she visits each day.

Find the probability that on a randomly selected day, Steffi does not visit Will’s house.

[2]

Copy and complete the probability distribution table for Y.

[4]

Hence find the expected number of times per day that Steffi is fed at Will’s house.

[3]

In any given year of 365 days, the probability that Steffi does not visit Will for at most $n$ days in total is 0.5 (to one decimal place). Find the value of $n$ .

[3]

Show that the expected number of occasions per year on which Steffi visits Will’s house and is not fed is at least 30.

[4]

Markscheme

* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.

$X \sim {\text{Po}}\left( {2.1} \right)$

${\text{P}}\left( {X = 0} \right) = 0.122\left( { = {{\text{e}}^{ - 2.1}}} \right)$ (M1)A1

[2 marks]

A1A1A1A1

Note: Award A1 for each correct probability for Y = 1, 2, 3, 4. Accept 0.162 for P(Y = 4).

[4 marks]

${\text{E}}\left( Y \right) = \sum {y{\text{P}}\left( {Y = y} \right)}$ (M1)

$= 1 \times 0.257 \ldots + 2 \times 0.270 \ldots + 3 \times 0.189 + 4 \times 0.161 \ldots$ (A1)

$= 2.01$ A1

[3 marks]

let $T$ be the no of days per year that Steffi does not visit

$T \sim B\left( {365{\text{,}}\,\,0.122 \ldots } \right)$ (M1)

require $0.45 \leqslant {\text{P}}\left( {T \leqslant n} \right) < 0.55$ (M1)

${\text{P}}\left( {T \leqslant 44} \right) = 0.51$

$n = 44$ A1

[3 marks]

METHOD 1

let $V$ be the discrete random variable “number of times Steffi is not fed per day”

${\text{E}}\left( V \right) = 1 \times {\text{P}}\left( {X = 5} \right) + 2 \times {\text{P}}\left( {X = 6} \right) + 3 \times {\text{P}}\left( {X = 7} \right) + \ldots$ M1

$= 1 \times 0.0416 \ldots + 2 \times 0.0145 \ldots + 3 \times 0.00437 \ldots + \ldots$ A1

= 0.083979... A1

expected no of occasions per year > 0.083979... × 365 = 30.7 A1

hence Steffi can expect not to be fed on at least 30 occasions AG

Note: Candidates may consider summing more than three terms in their calculation for ${\text{E}}\left( V \right)$ .

METHOD 2

${\text{E}}\left( X \right) - {\text{E}}\left( Y \right) = 0.0903 \ldots$ M1A1

0.0903… × 365 M1

= 33.0 > 30 A1AG

[4 marks]

Examiners report

[N/A]

Timmy owns a shop. His daily income from selling his goods can be modelled as a normal distribution, with a mean daily income of $820, and a standard deviation of $230. To make a profit, Timmy’s daily income needs to be greater than $1000.

Calculate the probability that, on a randomly selected day, Timmy makes a profit.

[2]

The shop is open for 24 days every month.

Calculate the probability that, in a randomly selected month, Timmy makes a profit on between 5 and 10 days (inclusive).

[3]

Markscheme

* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.

X ~ N(820, 230²) (M1)

Note: Award M1 for an attempt to use normal distribution. Accept labelled normal graph.

⇒P(X > 1000) = 0.217 A1

[2 marks]

Y ~ B(24,0.217...) (M1)

Note: Award M1 for recognition of binomial distribution with parameters.

P(Y ≤ 10) − P(Y ≤ 4) (M1)

Note: Award M1 for an attempt to find P(5 ≤ Y ≤ 10) or P(Y ≤ 10) − P(Y ≤ 4).

= 0.613 A1

[3 marks]

Examiners report

[N/A]

The number of bananas that Lucca eats during any particular day follows a Poisson distribution with mean 0.2.

Find the probability that Lucca eats at least one banana in a particular day.

[2]

Find the expected number of weeks in the year in which Lucca eats no bananas.

[4]

Markscheme

* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.

let $X$ be the number of bananas eaten in one day

$X \sim {\text{Po}}(0.2)$

${\text{P}}(X \geqslant 1) = 1 - {\text{P}}(X = 0)$ (M1)

$= 0.181{\text{ }}( = 1 - {{\text{e}}^{ - 0.2}})$ A1

[2 marks]

EITHER

let $Y$ be the number of bananas eaten in one week

${\text{Y}} \sim {\text{Po}}(1.4)$ (A1)

${\text{P}}(Y = 0) = 0.246596 \ldots {\text{ }}( = {{\text{e}}^{ - 1.4}})$ (A1)

let $Z$ be the number of days in one week at least one banana is eaten

$Z \sim {\text{B}}(7,{\text{ }}0.181 \ldots )$ (A1)

${\text{P}}(Z = 0) = 0.246596 \ldots$ (A1)

THEN

$52 \times 0.246596 \ldots$ (M1)

$= 12.8{\text{ }}( = 52{{\text{e}}^{ - 1.4}})$ A1

[4 marks]

Examiners report

[N/A]

Events $A$ and $B$ are such that ${\text{P}}(A \cup B) = 0.95,{\text{ P}}(A \cap B) = 0.6$ and ${\text{P}}(A|B) = 0.75$ .

Find ${\text{P}}(B)$ .

[2]

Find ${\text{P}}(A)$ .

[2]

Hence show that events $A^{'}$ and $B$ are independent.

[2]

Markscheme

* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.

${\text{P}}(A|B) = \frac{{{\text{P}}(A \cap B)}}{{{\text{P}}(B)}}$

$\Rightarrow 0.75 = \frac{{0.6}}{{{\text{P}}(B)}}$ (M1)

$\Rightarrow {\text{P}}(B){\text{ }}\left( { = \frac{{0.6}}{{0.75}}} \right) = 0.8$ A1

[2 marks]

${\text{P}}(A \cup B) = {\text{P}}(A) + {\text{P}}(B) - {\text{P}}(A \cap B)$

$\Rightarrow 0.95 = {\text{P}}(A) + 0.8 - 0.6$ (M1)

$\Rightarrow {\text{P}}(A) = 0.75$ A1

[2 marks]

METHOD 1

${\text{P}}(A'|B) = \frac{{{\text{P}}(A' \cap B)}}{{{\text{P}}(B)}} = \frac{{0.2}}{{0.8}} = 0.25$ A1

${\text{P}}(A'|B) = {\text{P}}(A’)$ R1

hence $A^{'}$ and $B$ are independent AG

Note: If there is evidence that the student has calculated ${\text{P}}(A' \cap B) = 0.2$ by assuming independence in the first place, award A0R0.

METHOD 2

EITHER

${\text{P}}(A) = {\text{P}}(A|B)$ A1

${\text{P}}(A) \times {\text{P}}(B) = 0.75 \times 0.80 = 0.6 = {\text{P}}(A \cap B)$ A1

THEN

$A$ and $B$ are independent R1

hence $A^{'}$ and $B$ are independent AG

METHOD 3

${\text{P}}(A') \times {\text{P}}(B) = 0.25 \times 0.80 = 0.2$ A1

${\text{P}}(A') \times {\text{P}}(B) = {\text{P}}(A' \cap B)$ R1

hence $A^{'}$ and $B$ are independent AG

[2 marks]

Examiners report

[N/A]

A horse breeder records the number of births for each of 100 horses during the past eight years. The results are summarized in the following table:

Stating null and alternative hypotheses carry out an appropriate test at the 5% significance level to decide whether the results can be modelled by B (6, 0.5).

[10]

Without doing any further calculations, explain briefly how you would carry out a test, at the 5% significance level, to decide if the data can be modelled by B(6, $p$ ), where $p$ is unspecified.

[2]

A different horse breeder collected data on the time and outcome of births. The data are summarized in the following table:

Carry out an appropriate test at the 5% significance level to decide whether there is an association between time and outcome.

[8]

Markscheme

METHOD 1

H₀: distribution is B(6, 0.5); H₁: distribution is not B(6, 0.5) A1

$\left( {{E_0} = 100{{\left( {0.5} \right)}^6} = \frac{{25}}{{16}} = 0.015625} \right)$ A3

Combining the first two columns and the last two columns: A1

${\chi ^2} = \sum {\frac{{{O^2}}}{E}} - \sum E$

$= \frac{{{6^2}}}{{\left( {\frac{{175}}{{16}}} \right)}} + \frac{{{{26}^2}}}{{\left( {\frac{{375}}{{16}}} \right)}} + \frac{{{{37}^2}}}{{\left( {\frac{{500}}{{16}}} \right)}} + \frac{{{{18}^2}}}{{\left( {\frac{{375}}{{16}}} \right)}} + \frac{{{{13}^2}}}{{\left( {\frac{{175}}{{16}}} \right)}} - 100$ (M1)

= 5.22 A1

$v$ = 4, so critical value of $\chi _{5{\text{% }}}^2 = 9.488$ A1A1

Since 5.22 < 9.488 the result is not significant and we accept H₀ R1

METHOD 2

H₀: distribution is B(6, 0.5); H₁: distribution is not B(6, 0.5) A1

By GDC, $p = 0.266$ A8

Since 0.266 > 0.05 the result is not significant and we accept H₀ R1

[10 marks]

Estimate $p$ from the data which would entail the loss of one degree of freedom A1A1

[2 marks]

H₀: there is no association H₁: there is an association A1

${\chi ^2} = \frac{{{{68}^2}}}{{64.8}} + \frac{{{{42}^2}}}{{45.2}} + \frac{{{{103}^2}}}{{94.3}} + \ldots + \frac{{{6^2}}}{{10.6}} + \frac{{{{12}^2}}}{{7.4}} - 314$ (M1)

= 15.7

$v = 3{\text{,}}\,\,\chi _{5{\text{% }}}^2\left( 3 \right) = 7.815$ A1A1

Since 15.7 > 7.815 we reject H₀ R1

METHOD 2

H₀: there is no association H₁: there is an association A1

By GDC, $p$ = 0.00129 A6

Since 0.00129 < 0.05 we reject H₀. R1

[8 marks]

Examiners report

[N/A]

Each of the 25 students in a class are asked how many pets they own. Two students own three pets and no students own more than three pets. The mean and standard deviation of the number of pets owned by students in the class are $\frac{{18}}{{25}}$ and $\frac{{24}}{{25}}$ respectively.

Find the number of students in the class who do not own a pet.

Markscheme

* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.

METHOD 1

let p have no pets, q have one pet and r have two pets (M1)

p + q + r + 2 = 25 (A1)

0p + 1q + 2r + 6 = 18 A1

Note: Accept a statement that there are a total of 12 pets.

attempt to use variance equation, or evidence of trial and error (M1)

$\frac{{0p + 1q + 4r + 18}}{{25}} - {\left( {\frac{{18}}{{25}}} \right)^2} = {\left( {\frac{{24}}{{25}}} \right)^2}$ (A1)

attempt to solve a system of linear equations (M1)

p = 14 A1

METHOD 2

(M1)

$p + q + r + \frac{2}{{25}} = 1$ (A1)

$q + 2r + \frac{6}{{25}} = \frac{{18}}{{25}}\left( { \Rightarrow q + 2r = \frac{{12}}{{25}}} \right)$ A1

$q + 4r + \frac{{18}}{{25}} - {\left( {\frac{{18}}{{25}}} \right)^2} = \frac{{576}}{{625}}\left( { \Rightarrow q + 4r = \frac{{18}}{{25}}} \right)$ (M1)(A1)

$q = \frac{6}{{25}},\,\,r = \frac{3}{{25}}$ (M1)

$p = \frac{{14}}{{25}}$ A1

so 14 have no pets

[7 marks]

Examiners report

[N/A]

Iqbal attempts three practice papers in mathematics. The probability that he passes the first paper is 0.6. Whenever he gains a pass in a paper, his confidence increases so that the probability of him passing the next paper increases by 0.1. Whenever he fails a paper the probability of him passing the next paper is 0.6.

Complete the given probability tree diagram for Iqbal’s three attempts, labelling each branch with the correct probability.

[3]

Calculate the probability that Iqbal passes at least two of the papers he attempts.

[2]

Find the probability that Iqbal passes his third paper, given that he passed only one previous paper.

[3]

Markscheme

* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.

A1A1A1

Note: Award A1 for each correct column of probabilities.

[3 marks]

probability (at least twice) =

EITHER

$\left( {0.6 \times 0.7 \times 0.8} \right) + \left( {0.6 \times 0.7 \times 0.2} \right) + \left( {0.6 \times 0.3 \times 0.6} \right) + \left( {0.4 \times 0.6 \times 0.7} \right)$ (M1)

$\left( {0.6 \times 0.7} \right) + \left( {0.6 \times 0.3 \times 0.6} \right) + \left( {0.4 \times 0.6 \times 0.7} \right)$ (M1)

Note: Award M1 for summing all required probabilities.

THEN

= 0.696 A1

[2 marks]

P(passes third paper given only one paper passed before)

$= \frac{{{\text{P}}\,\left( {{\text{passes third AND only one paper passed before}}} \right)}}{{{\text{P}}\,\left( {{\text{passes once in first two papers}}} \right)}}$ (M1)

$= \frac{{\left( {0.6 \times 0.3 \times 0.6} \right) + \left( {0.4 \times 0.6 \times 0.7} \right)}}{{\left( {0.6 \times 0.3} \right) + \left( {0.4 \times 0.6} \right)}}$ A1

= 0.657 A1

[3 marks]

Examiners report

[N/A]

There are 75 players in a golf club who take part in a golf tournament. The scores obtained on the 18th hole are as shown in the following table.

M17/5/MATHL/HP2/ENG/TZ2/01

One of the players is chosen at random. Find the probability that this player’s score was 5 or more.

[2]

Calculate the mean score.

[2]

Markscheme

* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.

${\text{P}}(5{\text{ or more}}) = \frac{{29}}{{75}}\,\,\,( = 0.387)$ (M1)A1

[2 marks]

${\text{mean score}} = \frac{{2 \times 3 + 3 \times 15 + 4 \times 28 + 5 \times 17 + 6 \times 9 + 7 \times 3}}{{75}}$ (M1)

$= \frac{{323}}{{75}}\,\,\,( = 4.31)$ A1

[2 marks]

Examiners report

[N/A]

Jorge is carefully observing the rise in sales of a new app he has created.

The number of sales in the first four months is shown in the table below.

Jorge believes that the increase is exponential and proposes to model the number of sales $N$ in month $t$ with the equation

$N = A e^{r t}, A, r \in ℝ$

Jorge plans to adapt Euler’s method to find an approximate value for $r$ .

With a step length of one month the solution to the differential equation can be approximated using Euler’s method where

$N (n + 1) \approx N (n) + 1 \times N' (n), n \in ℕ$

Jorge decides to take the mean of these values as the approximation of $r$ for his model. He also decides the graph of the model should pass through the point $(2, 52)$ .

The sum of the square residuals for these points for the least squares regression model is approximately $6.555$ .

Show that Jorge’s model satisfies the differential equation

$\frac{d N}{d t} = r N$

[2]

Show that $r \approx \frac{N (n + 1) - N (n)}{N (n)}$

[3]

Hence find three approximations for the value of $r$ .

[3]

Find the equation for Jorge’s model.

[3]

Find the sum of the square residuals for Jorge’s model using the values $t = 1, 2, 3, 4$ .

[2]

Comment how well Jorge’s model fits the data.

[1]

f.i.

Give two possible sources of error in the construction of his model.

[2]

f.ii.

Markscheme

$\frac{d N}{d t} = r A e^{r t}$ (M1)A1

Note: M1 is for an attempt to find $\frac{d N}{d t}$

$= r N$ AG

Note: Accept solution of the differential equation by separating variables

[2 marks]

$N (n + 1) \approx N (n) + 1 \times N' (n) \Rightarrow N' (n) \approx N (n + 1) - N (n)$ M1

$\Rightarrow r N (n) \approx N (n + 1) - N (n)$ M1A1

$\Rightarrow r \approx \frac{N (n + 1) - N (n)}{N (n)}$ AG

Note: Do not penalize the use of the $=$ sign.

[3 marks]

Correct method (M1)

$r \approx \frac{52 - 40}{40} = 0.3$

$r \approx \frac{70 - 52}{52} = 0.346$

$r \approx \frac{98 - 70}{70} = 0.4$ A2

Note: A1 for a single error A0 for two or more errors.

[3 marks]

$r = 0.349 (0.34871 \dots)$ or $\frac{68}{195}$ A1

$52 = A e^{0.34871 \dots \times 2}$ (M1)

$A = 25.8887 \dots$

$N = 25.9 e^{0.349 t}$ A1

[3 marks]

${(36.6904 \dots - 40)}^{2} + 0 + {(73.6951 \dots - 70)}^{2} + {(104.4435 \dots - 98)}^{2}$ (M1)

$= 66.1 (66.126 \dots)$ A1

[2 marks]

The sum of the square residuals is approximately $10$ times as large as the minimum possible, so Jorge’s model is unlikely to fit the data exactly R1

[1 mark]

f.i.

For example

Selecting a single point for the curve to pass through

Approximating the gradient of the curve by the gradient of a chord R1R1

[2 marks]

f.ii.

Examiners report

[N/A]

f.i.

[N/A]

f.ii.

A random variable $X$ has a probability distribution given in the following table.

N16/5/MATHL/HP2/ENG/TZ0/01

Determine the value of ${\text{E}}({X^2})$ .

[2]

Find the value of ${\text{Var}}(X)$ .

[3]

Markscheme

* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.

${\text{E}}({X^2}) = \Sigma {x^2} \bullet {\text{P}}(X = x) = 10.37{\text{ }}( = 10.4{\text{ 3 sf)}}$ (M1)A1

[2 marks]

METHOD 1

${\text{sd}}(X) = 1.44069 \ldots$ (M1)(A1)

${\text{Var}}(X) = 2.08{\text{ }}( = 2.0756)$ A1

METHOD 2

${\text{E}}(X) = 2.88{\text{ }}( = 0.06 + 0.27 + 0.5 + 0.98 + 0.63 + 0.44)$ (A1)

use of ${\text{Var}}(X) = {\text{E}}({X^2}) - {\left( {{\text{E}}(X)} \right)^2}$ (M1)

Note: Award (M1) only if ${\left( {{\text{E}}(X)} \right)^2}$ is used correctly.

$\left( {{\text{Var}}(X) = 10.37 - 8.29} \right)$

${\text{Var}}(X) = 2.08{\text{ }}( = 2.0756)$ A1

Note: Accept 2.11.

METHOD 3

${\text{E}}(X) = 2.88{\text{ }}( = 0.06 + 0.27 + 0.5 + 0.98 + 0.63 + 0.44)$ (A1)

use of ${\text{Var}}(X) = {\text{E}}\left( {{{\left( {X - {\text{E}}(X)} \right)}^2}} \right)$ (M1)

$(0.679728 + \ldots + 0.549152)$

${\text{Var}}(E) = 2.08{\text{ }}( = 2.0756)$ A1

[3 marks]

Examiners report

[N/A]

When carpet is manufactured, small faults occur at random. The number of faults in Premium carpets can be modelled by a Poisson distribution with mean 0.5 faults per 20 $\,$ m². Mr Jones chooses Premium carpets to replace the carpets in his office building. The office building has 10 rooms, each with the area of 80 $\,$ m².

Find the probability that the carpet laid in the first room has fewer than three faults.

[3]

Find the probability that exactly seven rooms will have fewer than three faults in the carpet.

[3]

Markscheme

* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.

$\lambda = 4 \times 0.5$ (M1)

$\lambda = 2$ (A1)

${\text{P}}(X \leqslant 2) = 0.677$ A1

[3 marks]

$Y \sim B(10,{\text{ }}0,677)$ (M1)(A1)

${\text{P}}(Y = 7) = 0.263$ A1

Note: Award M1 for clear recognition of binomial distribution.

[3 marks]

Examiners report

[N/A]

The hens on a farm lay either white or brown eggs. The eggs are put into boxes of six. The farmer claims that the number of brown eggs in a box can be modelled by the binomial distribution, B(6, $p$ ). By inspecting the contents of 150 boxes of eggs she obtains the following data.

Show that this data leads to an estimated value of $p = 0.4$ .

[1]

Stating null and alternative hypotheses, carry out an appropriate test at the 5 % level to decide whether the farmer’s claim can be justified.

[11]

Markscheme

from the sample, the probability of a brown egg is

$\frac{{0 \times 7 + 1 \times 32 + \ldots }}{{6 \times 150}} = \frac{{360}}{{900}} = 0.4$ A1

$p = 0.4$ AG

[1 mark]

if the data can be modelled by a binomial distribution with $p = 0.4$ , the expected frequencies of boxes are given in the table

Notes: Deduct one mark for each error or omission.
Accept any rounding to at least one decimal place.

null hypothesis: the distribution is binomial A1

alternative hypothesis: the distribution is not binomial A1

for a chi-squared test the last two columns should be combined R1

$\chi _{{\text{calc}}}^2 = \frac{{{{\left( {7 - 7} \right)}^2}}}{7} + \frac{{{{\left( {32 - 28} \right)}^2}}}{{28}} + \ldots = 6.05$ (Accept 6.06) (M1)A1

degrees of freedom = 4 A1

critical value = 9.488 A1

Or use of $p$ -value

we conclude that the farmer’s claim can be justified R1

[11 marks]

Examiners report

[N/A]

The number of taxis arriving at Cardiff Central railway station can be modelled by a Poisson distribution. During busy periods of the day, taxis arrive at a mean rate of 5.3 taxis every 10 minutes. Let T represent a random 10 minute busy period.

Find the probability that exactly 4 taxis arrive during T.

[2]

a.i.

Find the most likely number of taxis that would arrive during T.

[2]

a.ii.

Given that more than 5 taxis arrive during T, find the probability that exactly 7 taxis arrive during T.

[3]

a.iii.

During quiet periods of the day, taxis arrive at a mean rate of 1.3 taxis every 10 minutes.

Find the probability that during a period of 15 minutes, of which the first 10 minutes is busy and the next 5 minutes is quiet, that exactly 2 taxis arrive.

[6]

Markscheme

$X \sim {\text{Po}}\left( {5.3} \right)$

${\text{P}}\left( {X = 4} \right) = {{\text{e}}^{ - 5.3}}\frac{{{{5.3}^4}}}{{4{\text{!}}}}$ (M1)

= 0.164 A1

[2 marks]

a.i.

METHOD 1

listing probabilities (table or graph) M1

mode X = 5 (with probability 0.174) A1

Note: Award M0A0 for 5 (taxis) or mode = 5 with no justification.

METHOD 2

mode is the integer part of mean R1

E(X) = 5.3 ⇒ mode = 5 A1

Note: Do not allow R0A1.

[2 marks]

a.ii.

attempt at conditional probability (M1)

$\frac{{{\text{P}}\left( {X = 7} \right)}}{{{\text{P}}\left( {X \geqslant 6} \right)}}$ or equivalent $\left( { = \frac{{0.1163 \ldots }}{{0.4365 \ldots }}} \right)$ A1

= 0.267 A1

[3 marks]

a.iii.

METHOD 1

the possible arrivals are (2,0), (1,1), (0,2) (A1)

$Y \sim {\text{Po}}\left( {0.65} \right)$ A1

attempt to compute, using sum and product rule, (M1)

0.070106… × 0.52204… + 0.026455… × 0.33932… + 0.0049916… × 0.11028… (A1)(A1)

Note: Award A1 for one correct product and A1 for two other correct products.

= 0.0461 A1

[6 marks]

METHOD 2

recognising a sum of 2 independent Poisson variables eg Z = X + Y R1

$\lambda = 5.3 + \frac{{1.3}}{2}$

P(Z = 2) = 0.0461 (M1)A3

[6 marks]

Examiners report

[N/A]

a.i.

[N/A]

a.ii.

[N/A]

a.iii.

[N/A]

The age, L, in years, of a wolf can be modelled by the normal distribution L ~ N(8, 5).

Find the probability that a wolf selected at random is at least 5 years old.

[2]

Eight wolves are independently selected at random and their ages recorded.

Find the probability that more than six of these wolves are at least 5 years old.

[3]

Markscheme

* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.

P(L ≥ 5) = 0.910 (M1)A1

[2 marks]

X is the number of wolves found to be at least 5 years old recognising binomial distribution M1

X ~ B(8, 0.910…)

P(X > 6) = 1 − P(X ≤ 6) (M1)

= 0.843 A1

Note: Award M1A0 for finding P(X ≥ 6).

[3 marks]

Examiners report

[N/A]

Consider two events $A$ and $B$ such that ${\text{P}}(A) = k,{\text{ P}}(B) = 3k,{\text{ P}}(A \cap B) = {k^2}$ and ${\text{P}}(A \cup B) = 0.5$ .

Calculate $k$ ;

[3]

Find ${\text{P}}(A' \cap B)$ .

[3]

Markscheme

* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.

use of ${\text{P}}(A \cup B) = {\text{P}}(A) + {\text{P}}(B) - {\text{P}}(A \cap B)$ M1

$0.5 = k + 3k - {k^2}$ A1

${k^2} - 4k + 0.5 = 0$

$k = 0.129$ A1

Note: Do not award the final A1 if two solutions are given.

[3 marks]

use of ${\text{P}}(A' \cap B) = {\text{P}}(B) - {\text{P}}(A \cap B)$ or alternative (M1)

${\text{P}}(A' \cap B) = 3k - {k^2}$ (A1)

$= 0.371$ A1

[3 marks]

Examiners report

[N/A]

The mean number of squirrels in a certain area is known to be 3.2 squirrels per hectare of woodland. Within this area, there is a 56 hectare woodland nature reserve. It is known that there are currently at least 168 squirrels in this reserve.

Assuming the population of squirrels follow a Poisson distribution, calculate the probability that there are more than 190 squirrels in the reserve.

Markscheme

* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.

X is number of squirrels in reserve
X ∼ Po(179.2) A1

Note: Award A1 if 179.2 or 56 × 3.2 seen or implicit in future calculations.

recognising conditional probability M1

P(X > 190 | X ≥ 168)

$= \frac{{{\text{P}}\left( {X > 190} \right)}}{{{\text{P}}\left( {X \geqslant 168} \right)}} = \left( {\frac{{0.19827 \ldots }}{{0.80817 \ldots }}} \right)$ (A1)(A1)

= 0.245 A1

[5 marks]

Examiners report

[N/A]