User interface language: English | Español

HL Paper 2

Charlotte decides to model the shape of a cupcake to calculate its volume.

From rotating a photograph of her cupcake she estimates that its cross-section passes through the points (0, 3.5), (4, 6), (6.5, 4), (7, 3) and (7.5, 0), where all units are in centimetres. The cross-section is symmetrical in the x-axis, as shown below:

She models the section from (0, 3.5) to (4, 6) as a straight line.

Charlotte models the section of the cupcake that passes through the points (4, 6), (6.5, 4), (7, 3) and (7.5, 0) with a quadratic curve.

Charlotte thinks that a quadratic with a maximum point at (4, 6) and that passes through the point (7.5, 0) would be a better fit.

Believing this to be a better model for her cupcake, Charlotte finds the volume of revolution about the x-axis to estimate the volume of the cupcake.

Find the equation of the line passing through these two points.

[2]
a.

Find the equation of the least squares regression quadratic curve for these four points.

[2]
b.i.

By considering the gradient of this curve when x=4, explain why it may not be a good model.

[1]
b.ii.

Find the equation of the new model.

[4]
c.

Write down an expression for her estimate of the volume as a sum of two integrals.

[4]
d.i.

Find the value of Charlotte’s estimate.

[1]
d.ii.

Markscheme

y=58x+72   y=0.625x+3.5                  A1A1


Note:
Award A1 for 0.625x, A1 for 3.5.
Award a maximum of A0A1 if not part of an equation.


[2 marks]

a.

y=-0.975x2+9.56x-16.7                  (M1)A1

y=-0.974630x2+9.55919x-16.6569


[2 marks]

b.i.

gradient of curve is positive at x=4                 R1


Note: Accept a sensible rationale that refers to the gradient.


[1 mark]

b.ii.

METHOD 1

let y=ax2+bx+c

differentiating or using x=-b2a                       (M1)

8a+b=0

substituting in the coordinates
7.52a+7.5b+c=0                       (A1)
42a+4b+c=6                       (A1)

solve to get
y=-2449x2+19249x-9049  OR  y=-0.490x2+3.92x-1.84                       A1


Note: Use of quadratic regression with points using the symmetry of the graph is a valid method.

 

METHOD 2

y=ax-42+6                       (M1)

0=a7.5-42+6                       (M1)

a=-2449                       (A1)

y=-2449x-42+6  OR  y=-0.490x-42+6                       A1

 

[4 marks]

c.

π0458x+3.52dx+π47.5-2449x-42+62dx                       (M1)(M1) (M1)A1


Note: Award (M1)(M1)(M1)A0 if π is omitted but response is otherwise correct. Award (M1) for an integral that indicates volume, (M1) for their part (a) within their volume integral, (M1) for their part (b)(i) within their volume integral, A1 for their correct two integrals with all correct limits.

 

[4 marks]

d.i.

501 cm3  501.189                      A1

 

[1 mark]

d.ii.

Examiners report

[N/A]
a.
[N/A]
b.i.
[N/A]
b.ii.
[N/A]
c.
[N/A]
d.i.
[N/A]
d.ii.



It is known that the weights of male Persian cats are normally distributed with mean 6.1kg and variance 0.52kg2.

A group of 80 male Persian cats are drawn from this population.

The male cats are now joined by 80 female Persian cats. The female cats are drawn from a population whose weights are normally distributed with mean 4.5kg and standard deviation 0.45kg.

Ten female cats are chosen at random.

Sketch a diagram showing the above information.

[2]
a.

Find the proportion of male Persian cats weighing between 5.5kg and 6.5kg.

[2]
b.

Determine the expected number of cats in this group that have a weight of less than 5.3kg.

[3]
c.

Find the probability that exactly one of them weighs over 4.62kg.

[4]
d.i.

Let N be the number of cats weighing over 4.62kg.

Find the variance of N.

[1]
d.ii.

A cat is selected at random from all 160 cats.

Find the probability that the cat was female, given that its weight was over 4.7kg.

[4]
e.

Markscheme

                A1A1


Note: Award A1 for a normal curve with mean labelled 6.1 or μ, A1 for indication of SD (0.5): marks on horizontal axis at 5.6 and/or 6.6 OR μ-0.5 and/or μ+0.5 on the correct side and approximately correct position.

[2 marks]

a.

X~N6.1, 0.52

P5.5<X<6.5  OR  labelled sketch of region                (M1)

=0.673  0.673074                A1


[2 marks]

b.

PX<5.3= 0.0547992                (A1)

0.0547992×80                (M1)

=4.38   4.38393                A1


[3 marks]

c.

Y~N4.5, 0.452,

PY>4.62= 0.394862                (A1)

use of binomial seen or implied                (M1)

using B10, 0.394862                (M1)

0.0430  0.0429664                A1


[4 marks]

d.i.

np1-p=2.39  2.38946                A1


[1 mark]

d.ii.

PFW>4.7=0.5×0.3284  =0.1642                    (A1)

attempt use of tree diagram OR use of PFW>4.7=PFW>4.7PW>4.7                    (M1)

0.5×0.32840.5×0.9974+0.5×0.3284                    (A1)

=0.248  0.247669                A1


[4 marks]

e.

Examiners report

[N/A]
a.
[N/A]
b.
[N/A]
c.
[N/A]
d.i.
[N/A]
d.ii.
[N/A]
e.



A city has two cable companies, X and Y. Each year 20 % of the customers using company X move to company Y and 10 % of the customers using company Y move to company X. All additional losses and gains of customers by the companies may be ignored.

Initially company X and company Y both have 1200 customers.

Write down a transition matrix T representing the movements between the two companies in a particular year.

[2]
a.

Find the eigenvalues and corresponding eigenvectors of T.

[4]
b.

Hence write down matrices P and D such that T = PDP−1.

[2]
c.

Find an expression for the number of customers company X has after n years, where n N .

[5]
d.

Hence write down the number of customers that company X can expect to have in the long term.

[1]
e.

Markscheme

( 0.8 0.1 0.2 0.9 )       M1A1

[2 marks]

a.

| 0.8 λ 0.1 0.2 0.9 λ | = 0       M1

λ = 1 and 0.7      A1

eigenvectors  ( 1 2 ) and  ( 1 1 )      (M1)A1

Note: Accept any scalar multiple of the eigenvectors.

[4 marks]

b.

EITHER

P ( 1 1 2 1 )   D ( 1 0 0 0.7 )        A1A1

OR

P = ( 1 1 1 2 )   D = ( 0.7 0 0 1 )       A1A1

[2 marks]

c.

P−1 = 1 3 ( 1 1 2 1 )        A1

1 3 ( 1 1 2 1 ) ( 1 0 0 0.7 n ) ( 1 1 2 1 ) ( 1200 1200 )        M1A1

attempt to multiply matrices         M1

so in company A, after n years,  400 ( 2 + 0.7 n )          A1

[5 marks]

d.

400 × 2 = 800        A1

[1 mark]

e.

Examiners report

[N/A]
a.
[N/A]
b.
[N/A]
c.
[N/A]
d.
[N/A]
e.



In a small village there are two doctors’ clinics, one owned by Doctor Black and the other owned by Doctor Green. It was noted after each year that 3.5% of Doctor Black’s patients moved to Doctor Green’s clinic and 5% of Doctor Green’s patients moved to Doctor Black’s clinic. All additional losses and gains of patients by the clinics may be ignored.

At the start of a particular year, it was noted that Doctor Black had 2100 patients on their register, compared to Doctor Green’s 3500 patients.

Write down a transition matrix T indicating the annual population movement between clinics.

[2]
a.

Find a prediction for the ratio of the number of patients Doctor Black will have, compared to Doctor Green, after two years.

[2]
b.

Find a matrix P, with integer elements, such that T=PDP1, where D is a diagonal matrix.

[6]
c.

Hence, show that the long-term transition matrix T is given by T=10171017717717.

[6]
d.

Hence, or otherwise, determine the expected ratio of the number of patients Doctor Black would have compared to Doctor Green in the long term.

[2]
e.

Markscheme

T=0.9650.050.0350.95        M1A1


Note: Award M1A1 for T=0.950.0350.050.965.
Award the A1 for a transposed T if used correctly in part (b) i.e. preceded by 1×2 matrix 2100    3500 rather than followed by a 2×1 matrix.


[2 marks]

a.

0.9650.050.0350.95221003500        (M1)

=22943306

so ratio is 2294:3306  =1147:1653, 0.693889        A1


[2 marks]

b.

to solve Ax=λx:

0.965-λ0.050.0350.95-λ=0        (M1)

0.965-λ0.95-λ-0.05×0.035=0

λ=0.915  OR  λ=1        (A1)

attempt to find eigenvectors for at least one eigenvalue        (M1)

when λ=0.915, x=1-1 (or any real multiple)        (A1)

when λ=1, x=107 (or any real multiple)        (A1)

therefore P=110-17 (accept integer valued multiples of their eigenvectors and columns in either order)        A1


[6 marks]

c.

P-1=110-17-1=1177-1011            (A1)


Note: This mark is independent, and may be seen anywhere in part (d).


D=0.915001            (A1)

Tn=PDnP-1=110-170.915n001n1177-1011            (M1)A1


Note:
Award (M1)A0 for finding P-1DnP correctly.


as n, Dn=0.915n001n0001                  R1

so Tn117110-1700017-1011                 A1

=10171017717717                   AG


Note: The AG line must be seen for the final A1 to be awarded.


[6 marks]

d.

METHOD ONE

1017101771771721003500=32942306            (M1)

so ratio is  3294:2306   1647:1153,  1.42844, 0.700060                A1

 

METHOD TWO

long term ratio is the eigenvector associated with the largest eigenvalue            (M1)

10:7                A1


[2 marks]

e.

Examiners report

[N/A]
a.
[N/A]
b.
[N/A]
c.
[N/A]
d.
[N/A]
e.



The masses in kilograms of melons produced by a farm can be modelled by a normal distribution with a mean of 2.6 kg and a standard deviation of 0.5 kg.

Find the probability that two melons picked at random and independently of each other will

One year due to favourable weather conditions it is thought that the mean mass of the melons has increased.

The owner of the farm decides to take a random sample of 16 melons to test this hypothesis at the 5% significance level, assuming the standard deviation of the masses of the melons has not changed.

Unknown to the farmer the favourable weather conditions have led to all the melons having 10% greater mass than the model described above.

Find the probability that a melon selected at random will have a mass greater than 3.0kg.

[2]
a.

both have a mass greater than 3.0kg.

[2]
b.i.

have a total mass greater than 6.0kg.

[2]
b.ii.

Write down the null and alternative hypotheses for the test.

[1]
c.

Find the critical region for this test.

[4]
d.

Find the mean and standard deviation of the mass of the melons for this year.

[3]
e.

Find the probability of a Type II error in the owner’s test.

[2]
f.

Markscheme

* This sample question was produced by experienced DP mathematics senior examiners to aid teachers in preparing for external assessment in the new MAA course. There may be minor differences in formatting compared to formal exam papers.

Let X represent the mass of a melon

PX>3.0=0.212  0.2118       (M1)A1

 

[2 marks]

a.

0.2118×0.2118       (M1)

=0.0449  0.04488         A1

 

[2 marks]

b.i.

Let T represent the total mass

ET=5.2         A1

VarT=0.52+0.52=0.5       (M1)A1

T~N5.2,0.5

PT>6.0=0.129  0.1289         A1

 

[4 marks]

b.ii.

Let μ be the mean mass of the melons produced by the farm.

H0: μ=2.6kg, H1: μ>2.6kg only         A1

 

Note: Accept H0: The mean mass of melons produced by the farm is equal to 2.6kg
H1: The mean mass of melons produced by the farm is greater than 2.6kg

Note: Award A0 if 2.6kg does not appear in the hypothesis.

 

[1 mark]

c.

Under H0  X¯~N2.6,0.5216         A1

PX¯>a=0.05          (M1)

a=2.81 2.805606          (A1)

Critical region is X¯>2.81         A1

 

[4 marks]

d.

Let W represent the new mass of the melons

EW=1.1×2.6=2.86         A1

Standard deviation of W=1.1×0.5          (M1)

=0.55         A1

 

Note: award M1A0 for VarW=1.12×0.52=0.3025

 

[3 marks]

e.

P(Type II error)=

PX¯<2.81  μ=2.86,σ=0.554          (M1)

=0.346  0.346204         A1

 

Note: Accept 0.358 from use of the three‐figure answer to part (d)

 

[2 marks]

f.

Examiners report

[N/A]
a.
[N/A]
b.i.
[N/A]
b.ii.
[N/A]
c.
[N/A]
d.
[N/A]
e.
[N/A]
f.



The random variable X is thought to follow a binomial distribution B (4, p ). In order to investigate this belief, a random sample of 100 observations on X was taken with the following results.

An automatic machine is used to fill bottles of water. The amount delivered, Y  ml, may be assumed to be normally distributed with mean μ  ml and standard deviation 8 ml. Initially, the machine is adjusted so that the value of μ is 500. In order to check that the value of μ remains equal to 500, a random sample of 10 bottles is selected at regular intervals, and the mean amount of water, y ¯ , in these bottles is calculated. The following hypotheses are set up.

H0: μ  = 500;  H1: μ  ≠ 500

The critical region is defined to be  ( y ¯ < 495 ) ( y ¯ > 505 ) .

State suitable hypotheses for testing this belief.

[1]
a.i.

Calculate the mean of these data and hence estimate the value of p .

[5]
a.ii.

Calculate an appropriate value of χ 2  and state your conclusion, using a 1% significance level.

[13]
a.iii.

Find the significance level of this procedure.

[5]
b.i.

Some time later, the actual value of μ  is 503. Find the probability of a Type II error.

[3]
b.ii.

Markscheme

H: The data are B (4, p ); H1 : The data are not B (4, p )       A1

[1 mark]

a.i.

Mean  = 1 × 32 + + 4 × 14 100        M1A1

= 1.8       A1

4 p ^ = 1.8 p ^ = 0.45        M1A1

[5 marks]

a.ii.

The expected frequencies are

   A1A1A1A1A1

The last two classes must be combined because the expected frequency for x = 4 is less than 5.       R1

χ 2 = 17 2 9.15 + 32 2 29.95 + 19 2 36.75 + 32 2 24.15 100        M2

= 18.0        A2

DF = 2        (A1)

Critical value = 9.21        A1         

We conclude, at the 1% significance level, that X does not fit a binomial model.           R1         

 

Special case: award the following marks to candidates who do not combine classes.

χ 2 = 17 2 9.15 + 32 2 29.95 + 19 2 36.75 + 18 2 20.05 + 14 2 4.1 100        M2

= 39.6        A0

DF = 3        (A1)

Critical value = 11.345        A1         

We conclude, at the 1% significance level, that X does not fit a binomial model.           R1

 

[13 marks]

a.iii.

Under H0, the distribution of y ¯ is N (500, 6.4).      (A1)

Significance level = P y ¯  < 495 or > 505 | H0       M2

                             = 2 × 0.02405      (A1)

                             = 0.0481       A1 N5

[5 marks]

b.i.

The distribution of y ¯  is now N (503, 6.4).      (A1)

P(Type ΙΙ error) = P(495 < y ¯  < 505)      (M1)

                       = 0.785       A1 N3

[3 marks]

b.ii.

Examiners report

[N/A]
a.i.
[N/A]
a.ii.
[N/A]
a.iii.
[N/A]
b.i.
[N/A]
b.ii.



A student investigating the relationship between chemical reactions and temperature finds the Arrhenius equation on the internet.

k=Ae-cT

This equation links a variable k with the temperature T, where A and c are positive constants and T>0.

The Arrhenius equation predicts that the graph of lnk against 1T is a straight line.

Write down

The following data are found for a particular reaction, where T is measured in Kelvin and k is measured in cm3mol1s1:

Find an estimate of

Show that dkdT is always positive.

[3]
a.

Given that limTk=A and limT0k=0, sketch the graph of k against T.

[3]
b.

(i)   the gradient of this line in terms of c;

(ii)  the y-intercept of this line in terms of A.

[4]
c.

Find the equation of the regression line for lnk on 1T.

[2]
d.

c.

It is not required to state units for this value.

[1]
e.i.

A.

It is not required to state units for this value.

[2]
e.ii.

Markscheme

attempt to use chain rule, including the differentiation of 1T          (M1)

dkdT=A×cT2×e-cT          A1

this is the product of positive quantities so must be positive          R1


Note: The R1 may be awarded for correct argument from their derivative. R1 is not possible if their derivative is not always positive.

 

[3 marks]

a.

         A1A1A1

 

Note: Award A1 for an increasing graph, entirely in first quadrant, becoming concave down for larger values of T, A1 for tending towards the origin and A1 for asymptote labelled at k=A.

 

[3 marks]

b.

taking ln of both sides   OR   substituting y=lnx  and  x=1T           (M1)

lnk=lnA-cT  OR  y=-cx+lnA           (A1)


(i)   so gradient is -c         A1


(ii)  y-intercept is lnA         A1

 

Note: The implied (M1) and (A1) can only be awarded if both correct answers are seen. Award zero if only one value is correct and no working is seen.

 

[4 marks]

c.

an attempt to convert data to 1T and lnk           (M1)

e.g. at least one correct row in the following table

line is lnk=-13400×1T+15.0   =-13383.1×1T+15.0107         A1

 

[2 marks]

d.

c=13400   13383.1         A1

 

[1 mark]

e.i.

attempt to rearrange or solve graphically lnA=15.0107          (M1)

A=3300000    3304258         A1

 Note: Accept an A value of 3269017… from use of 3sf value.

[2 marks]

e.ii.

Examiners report

This question caused significant difficulties for many candidates and many did not even attempt the question. Very few candidates were able to differentiate the expression in part (a) resulting in difficulties for part (b). Responses to parts (c) to (e) illustrated a lack of understanding of linearizing a set of data. Those candidates that were able to do part (d) frequently lost a mark as their answer was given in x and y.

a.
[N/A]
b.
[N/A]
c.
[N/A]
d.
[N/A]
e.i.
[N/A]
e.ii.



Willow finds that she receives approximately 70 emails per working day.

She decides to model the number of emails received per working day using the random variable X , where X follows a Poisson distribution with mean 70.

In order to test her model, Willow records the number of emails she receives per working day over a period of 6 months. The results are shown in the following table.

From the table, calculate

Archie works for a different company and knows that he receives emails according to a Poisson distribution, with a mean of λ emails per day.

Using this distribution model, find  P ( X < 60 ) .

[2]
a.i.

Using this distribution model, find the standard deviation of X .

[2]
a.ii.

an estimate for the mean number of emails received per working day.

[3]
b.i.

an estimate for the standard deviation of the number of emails received per working day.

[2]
b.ii.

Give one piece of evidence that suggests Willow’s Poisson distribution model is not a good fit.

[1]
c.

Suppose that the probability of Archie receiving more than 10 emails in total on any one day is 0.99. Find the value of λ.

[3]
d.

Now suppose that Archie received exactly 20 emails in total in a consecutive two day period. Show that the probability that he received exactly 10 of them on the first day is independent of λ.

[5]
e.

Markscheme

P ( X < 60 )

= P ( X 59 )      (M1)

= 0.102     A1

 

[2 marks]

a.i.

standard deviation =  70 (= 8.37)      (M1)A1

 

[2 marks]

a.ii.

use of midpoints (accept consistent use of 45, 55 etc.)     (M1)

44.5 × 2 + 54.5 × 15 + 64.5 × 40 + 74.5 × 53 + 94.5 + 104.5 × 3 + 114.5 × 6 2 + 15 + 40 + 53 + 0 + 1 + 3 + 6       (M1)

= 8530 120 ( = 71.1 )       A1

Note: If 45, 55, etc. are used consistently instead of midpoints (implied by the answer 71.58…) award M1M1A0.

 

[3 marks]

b.i.

13.9     (M1)A1

 

[2 marks]

b.ii.

valid reason given to include the examples below       R1

variance is 192 which is not close to the mean (accept not equal to) standard deviation too high (using parts (a)(ii) and (b)(ii))

relative frequency of X ≤ 59 is 0.142 which is too high (using part (a)(i))

Poisson would give a frequency of roughly 14 for 80 ≤ X  ≤ 89

Note: Reasons which do not use values found in previous parts must be backed up with numerical evidence.

 

[1 mark]

c.

 

P ( Y > 10 ) = 0.99

1 P ( Y 10 ) = 0.99 P ( Y 10 ) = 0.01       (M1)

attempt to solve a correct equation       (M1)

λ = 20.1       A1

 

[3 marks]

d.

in 1 day, no of emails is X ~ Po(λ)

in 2 days, no of emails is Y ~ Po(2λ)       (A1)

P(10 on first day | 20 in 2 days)        (M1)

P ( X = 10 ) × P ( X = 10 ) P ( Y = 20 )         (M1)

= ( λ 10 e λ 10 ! ) 2 ( 2 λ ) 20 e 2 λ 20 !      A1

= λ 20 e 2 λ 2 20 λ 20 e 2 λ × 20 ! ( 10 ! ) 2       A1

= 20 ! 2 20 ( 10 ! ) 2

which is independent of λ       AG

 

[5 marks]

e.

Examiners report

[N/A]
a.i.
[N/A]
a.ii.
[N/A]
b.i.
[N/A]
b.ii.
[N/A]
c.
[N/A]
d.
[N/A]
e.



Long term experience shows that if it is sunny on a particular day in Vokram, then the probability that it will be sunny the following day is 0.8. If it is not sunny, then the probability that it will be sunny the following day is 0.3.

The transition matrix T is used to model this information, where T=0.8 0.30.2 0.7.

The matrix T can be written as a product of three matrices, PDP-1 , where D is a diagonal matrix.

It is sunny today. Find the probability that it will be sunny in three days’ time.

[2]
a.

Find the eigenvalues and eigenvectors of T.

[5]
b.

Write down the matrix P.

[1]
c.i.

Write down the matrix D.

[1]
c.ii.

Hence find the long-term percentage of sunny days in Vokram.

[4]
d.

Markscheme

finding T3  OR  use of tree diagram                     (M1)

T3=0.65  0.5250.35  0.475

the probability of sunny in three days’ time is 0.65                     A1

 

[2 marks]

a.

attempt to find eigenvalues                    (M1)

 


Note:
Any indication that detT-λI=0 has been used is sufficient for the (M1).


0.8-λ  0.30.2  0.7-λ=0.8-λ0.7-λ-0.06=0

λ2-1.5λ+0.5=0

λ=1, λ=0.5                     A1

attempt to find either eigenvector                   (M1)

0.8x+0.3y=x-0.2x+0.3y=0 so an eigenvector is 32                     A1

0.8x+0.3y=0.5x0.3x+0.3y=0 so an eigenvector is 1-1                     A1


Note: Accept multiples of the stated eigenvectors.

 

[5 marks]

b.

P=3 12 -1  OR  P=1 3-1 2                    A1


Note: Examiners should be aware that different, correct, matrices P may be seen.

 

[1 mark]

c.i.

D=1 00 0.5  OR  D=0.5 00 1                    A1


Note: P and D must be consistent with each other.

 

[1 mark]

c.ii.

0.5n0                             (M1)

Dn=1 00 0  OR  Dn=0 00 1                   (A1)


Note: Award A1 only if their Dn corresponds to their P


PDnP-1=0.6 0.60.4 0.4                   (M1)

60%                      A1

 

[4 marks]

d.

Examiners report

[N/A]
a.
[N/A]
b.
[N/A]
c.i.
[N/A]
c.ii.
[N/A]
d.



A continuous random variable X has probability density function f given by

f ( x ) = { x 2 a + b , 0 x 4 0 otherwise where  a  and  b  are positive constants.

It is given that P ( X 2 ) = 0.75 .

Eight independent observations of X are now taken and the random variable Y is the number of observations such that X 2 .

Show that a = 32 and b = 1 12 .

[5]
a.

Find E ( X ) .

[2]
b.

Find Var ( X ) .

[2]
c.

Find the median of X .

[3]
d.

Find E ( Y ) .

[2]
e.

Find P ( Y 3 ) .

[1]
f.

Markscheme

* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.

0 4 ( x 2 a + b ) d x = 1 [ x 3 3 a + b x ] 0 4 = 1 64 3 a + 4 b = 1     M1A1

2 4 ( x 2 a + b ) d x = 0.75 56 3 a + 2 b = 0.75     M1A1

 

Note:    0 2 ( x 2 a + b ) d x = 0.25 8 3 a + 2 b = 0.25 could be seen/used in place of either of the above equations.

 

evidence of an attempt to solve simultaneously (or check given a,b values are consistent)     M1

a = 32 ,   b = 1 12      AG

[5 marks]

a.

E ( X ) = 0 4 x ( x 2 32 + 1 12 ) d x     (M1)

E ( X ) = 8 3 ( = 2.67 )      A1

[2 marks]

b.

E ( X 2 ) = 0 4 x 2 ( x 2 32 + 1 12 ) d x      (M1)

Var ( X ) = E ( X 2 ) [ E ( X ) ] 2 = 16 15 ( = 1.07 )      A1

[2 marks]

c.

0 m ( x 2 32 + 1 12 ) d x = 0.5     (M1)

m 3 96 + m 12 = 0.5 ( m 3 + 8 m 48 = 0 )      (A1)

m = 2.91      A1

[3 marks]

d.

Y B ( 8 ,   0.75 )      (M1)

E ( Y ) = 8 × 0.75 = 6      A1

[2 marks]

e.

P ( Y 3 ) = 0.996      A1

[1 mark]

f.

Examiners report

[N/A]
a.
[N/A]
b.
[N/A]
c.
[N/A]
d.
[N/A]
e.
[N/A]
f.



Arianne plays a game of darts.

The distance that her darts land from the centre, O, of the board can be modelled by a normal distribution with mean 10cm and standard deviation 3cm.

Find the probability that

Each of Arianne’s throws is independent of her previous throws.

In a competition a player has three darts to throw on each turn. A point is scored if a player throws all three darts to land within a central area around O. When Arianne throws a dart the probability that it lands within this area is 0.8143.

In the competition Arianne has ten turns, each with three darts.

a dart lands less than 13cm from O.

[2]
a.i.

a dart lands more than 15cm from O.

[1]
a.ii.

Find the probability that Arianne throws two consecutive darts that land more than 15cm from O.

[2]
b.

Find the probability that Arianne does not score a point on a turn of three darts.

[2]
c.

Find Arianne’s expected score in the competition.

[4]
d.i.

Find the probability that Arianne scores at least 5 points in the competition.

[1]
d.ii.

Find the probability that Arianne scores at least 5 points and less than 8 points.

[2]
d.iii.

Given that Arianne scores at least 5 points, find the probability that Arianne scores less than 8 points.

[2]
d.iv.

Markscheme

Let X be the random variable “distance from O”.

X~N10, 32

PX<13=0.841  0.841344            (M1)(A1)

 

[2 marks]

a.i.

PX>15=  0.0478  0.0477903            A1

 

[1 mark]

a.ii.

PX>15×PX>15            (M1)

=0.00228  0.00228391            A1

 

[2 marks]

b.

1-0.81433            (M1)

0.460  0.460050            A1

 

[2 marks]

c.

let Y be the random variable “number of points scored”

evidence of use of binomial distribution           (M1)

Y~B10, 0.539949           (A1)

EY= 10×0.539949           (M1)

           =5.40            A1

 

[4 marks]

d.i.

PY5= 0.717  0.716650            A1

 

[1 mark]

d.ii.

P5Y<8           (M1)

=0.628  0.627788            A1


Note: Award M1 for a correct probability statement or indication of correct lower and upper bounds, 5 and 7.

[2 marks]

d.iii.

P5Y<8PY5 =0.6277880.716650           (M1)

=0.876  0.876003            A1

 

[2 marks]

d.iv.

Examiners report

Most candidates started this question well and applied the normal distribution correctly in part (a). The same goes for part (b) where the candidates were able to combine probabilities correctly. Part (c) was not very well done, and there were a surprising number of incorrect approaches on a seemingly straightforward problem. This suggests that candidates were not interpreting the problem correctly and there was a lack of careful reading to be sure of the scenario being described. In part (d) most candidates recognized the need to model the situation with the binomial distribution. However, many candidates did not choose a correct probability for the Bernoulli trial in this question and oversimplified the problem. This again seems to be a problem of “interpretation” rather than “conceptual understanding”.

a.i.

Most candidates started this question well and applied the normal distribution correctly in part (a). The same goes for part (b) where the candidates were able to combine probabilities correctly. Part (c) was not very well done, and there were a surprising number of incorrect approaches on a seemingly straightforward problem. This suggests that candidates were not interpreting the problem correctly and there was a lack of careful reading to be sure of the scenario being described. In part (d) most candidates recognized the need to model the situation with the binomial distribution. However, many candidates did not choose a correct probability for the Bernoulli trial in this question and oversimplified the problem. This again seems to be a problem of “interpretation” rather than “conceptual understanding”.

a.ii.

Most candidates started this question well and applied the normal distribution correctly in part (a). The same goes for part (b) where the candidates were able to combine probabilities correctly. Part (c) was not very well done, and there were a surprising number of incorrect approaches on a seemingly straightforward problem. This suggests that candidates were not interpreting the problem correctly and there was a lack of careful reading to be sure of the scenario being described. In part (d) most candidates recognized the need to model the situation with the binomial distribution. However, many candidates did not choose a correct probability for the Bernoulli trial in this question and oversimplified the problem. This again seems to be a problem of “interpretation” rather than “conceptual understanding”.

b.

Most candidates started this question well and applied the normal distribution correctly in part (a). The same goes for part (b) where the candidates were able to combine probabilities correctly. Part (c) was not very well done, and there were a surprising number of incorrect approaches on a seemingly straightforward problem. This suggests that candidates were not interpreting the problem correctly and there was a lack of careful reading to be sure of the scenario being described. In part (d) most candidates recognized the need to model the situation with the binomial distribution. However, many candidates did not choose a correct probability for the Bernoulli trial in this question and oversimplified the problem. This again seems to be a problem of “interpretation” rather than “conceptual understanding”.

c.

Most candidates started this question well and applied the normal distribution correctly in part (a). The same goes for part (b) where the candidates were able to combine probabilities correctly. Part (c) was not very well done, and there were a surprising number of incorrect approaches on a seemingly straightforward problem. This suggests that candidates were not interpreting the problem correctly and there was a lack of careful reading to be sure of the scenario being described. In part (d) most candidates recognized the need to model the situation with the binomial distribution. However, many candidates did not choose a correct probability for the Bernoulli trial in this question and oversimplified the problem. This again seems to be a problem of “interpretation” rather than “conceptual understanding”.

d.i.

Most candidates started this question well and applied the normal distribution correctly in part (a). The same goes for part (b) where the candidates were able to combine probabilities correctly. Part (c) was not very well done, and there were a surprising number of incorrect approaches on a seemingly straightforward problem. This suggests that candidates were not interpreting the problem correctly and there was a lack of careful reading to be sure of the scenario being described. In part (d) most candidates recognized the need to model the situation with the binomial distribution. However, many candidates did not choose a correct probability for the Bernoulli trial in this question and oversimplified the problem. This again seems to be a problem of “interpretation” rather than “conceptual understanding”.

d.ii.

Most candidates started this question well and applied the normal distribution correctly in part (a). The same goes for part (b) where the candidates were able to combine probabilities correctly. Part (c) was not very well done, and there were a surprising number of incorrect approaches on a seemingly straightforward problem. This suggests that candidates were not interpreting the problem correctly and there was a lack of careful reading to be sure of the scenario being described. In part (d) most candidates recognized the need to model the situation with the binomial distribution. However, many candidates did not choose a correct probability for the Bernoulli trial in this question and oversimplified the problem. This again seems to be a problem of “interpretation” rather than “conceptual understanding”.

d.iii.

Most candidates started this question well and applied the normal distribution correctly in part (a). The same goes for part (b) where the candidates were able to combine probabilities correctly. Part (c) was not very well done, and there were a surprising number of incorrect approaches on a seemingly straightforward problem. This suggests that candidates were not interpreting the problem correctly and there was a lack of careful reading to be sure of the scenario being described. In part (d) most candidates recognized the need to model the situation with the binomial distribution. However, many candidates did not choose a correct probability for the Bernoulli trial in this question and oversimplified the problem. This again seems to be a problem of “interpretation” rather than “conceptual understanding”.

d.iv.



A Chocolate Shop advertises free gifts to customers that collect three vouchers. The vouchers are placed at random into 10% of all chocolate bars sold at this shop. Kati buys some of these bars and she opens them one at a time to see if they contain a voucher. Let P ( X = n ) be the probability that Kati obtains her third voucher on the n th  bar opened.

(It is assumed that the probability that a chocolate bar contains a voucher stays at 10% throughout the question.)

It is given that P ( X = n ) = n 2 + a n + b 2000 × 0.9 n 3 for n 3 ,   n N .

Kati’s mother goes to the shop and buys x  chocolate bars. She takes the bars home for Kati to open.

Show that P ( X = 3 ) = 0.001 and P ( X = 4 ) = 0.0027 .

[3]
a.

Find the values of the constants a and b .

[5]
b.

Deduce that P ( X = n ) P ( X = n 1 ) = 0.9 ( n 1 ) n 3 for n > 3 .

[4]
c.

(i)     Hence show that X has two modes m 1 and m 2 .

(ii)     State the values of m 1 and m 2 .

[5]
d.

Determine the minimum value of x such that the probability Kati receives at least one free gift is greater than 0.5.

[3]
e.

Markscheme

* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.

P ( X = 3 ) = ( 0.1 ) 3    A1

= 0.001    AG

P ( X = 4 ) = P ( V V V ¯ V ) + P ( V V ¯ V V ) + P ( V ¯ V V V )    (M1)

= 3 × ( 0.1 ) 3 × 0.9 (or equivalent)     A1

= 0.0027    AG

[3 marks]

a.

METHOD 1

attempting to form equations in a and b     M1

9 + 3 a + b 2000 = 1 1000   ( 3 a + b = 7 )    A1

16 + 4 a + b 2000 × 9 10 = 27 10 000   ( 4 a + b = 10 )    A1

attempting to solve simultaneously     (M1)

a = 3 ,   b = 2    A1

METHOD 2

P ( X = n ) = ( n 1 2 ) × 0.1 3 × 0.9 n 3    M1

= ( n 1 ) ( n 2 ) 2000 × 0.9 n 3    (M1)A1

= n 2 3 n + 2 2000 × 0.9 n 3    A1

a = 3 , b = 2    A1

 

Note: Condone the absence of 0.9 n 3 in the determination of the values of a and b .

 

[5 marks]

b.

METHOD 1

EITHER

P ( X = n ) = n 2 3 n + 2 2000 × 0.9 n 3    (M1)

OR

P ( X = n ) = ( n 1 2 ) × 0.1 3 × 0.9 n 3    (M1)

THEN

= ( n 1 ) ( n 2 ) 2000 × 0.9 n 3    A1

P ( X = n 1 ) = ( n 2 ) ( n 3 ) 2000 × 0.9 n 4    A1

P ( X = n ) P ( X = n 1 ) = ( n 1 ) ( n 2 ) ( n 2 ) ( n 3 ) × 0.9    A1

= 0.9 ( n 1 ) n 3    AG

METHOD 2

P ( X = n ) P ( X = n 1 ) = n 2 3 n + 2 2000 × 0.9 n 3 ( n 1 ) 2 3 ( n 1 ) + 2 2000 × 0.9 n 4    (M1)

= 0.9 ( n 2 3 n + 2 ) ( n 2 5 n + 6 )    A1A1

 

Note: Award A1 for a correct numerator and A1 for a correct denominator.

 

= 0.9 ( n 1 ) ( n 2 ) ( n 2 ) ( n 3 )    A1

= 0.9 ( n 1 ) n 3    AG

[4 marks]

c.

(i)     attempting to solve 0.9 ( n 1 ) n 3 = 1 for n     M1

n = 21    A1

0.9 ( n 1 ) n 3 < 1 n > 21    R1

0.9 ( n 1 ) n 3 > 1 n < 21    R1

X has two modes     AG

 

Note: Award R1R1 for a clearly labelled graphical representation of the two inequalities (using P ( X = n ) P ( X = n 1 ) ).

 

(ii)     the modes are 20 and 21     A1

[5 marks]

d.

METHOD 1

Y B ( x ,   0.1 )    (A1)

attempting to solve P ( Y 3 ) > 0.5 (or equivalent eg  1 P ( Y 2 ) > 0.5 ) for x     (M1)

 

Note: Award (M1) for attempting to solve an equality (obtaining x = 26.4 ).

 

x = 27    A1

METHOD 2

n = 0 x P ( X = n ) > 0.5    (A1)

attempting to solve for x     (M1)

x = 27    A1

[3 marks]

e.

Examiners report

[N/A]
a.
[N/A]
b.
[N/A]
c.
[N/A]
d.
[N/A]
e.



The weights, in grams, of individual packets of coffee can be modelled by a normal distribution, with mean 102g and standard deviation 8g.

Find the probability that a randomly selected packet has a weight less than 100g.

[2]
a.

The probability that a randomly selected packet has a weight greater than w grams is 0.444. Find the value of w.

[2]
b.

A packet is randomly selected. Given that the packet has a weight greater than 105g, find the probability that it has a weight greater than 110g.

[3]
c.

From a random sample of 500 packets, determine the number of packets that would be expected to have a weight lying within 1.5 standard deviations of the mean.

[3]
d.

Packets are delivered to supermarkets in batches of 80. Determine the probability that at least 20 packets from a randomly selected batch have a weight less than 95g.

[4]
e.

Markscheme

* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.

Χ~N102, 82

PΧ<100=0.401        (M1)A1


[2 marks]

a.

PΧ>w=0.444       (M1)

w=103 g        A1


[2 marks]

b.

PΧ>100Χ>105=PΧ>100Χ>105PΧ>105       (M1)

=PΧ>100PΧ>105        (A1)

=0.158650.35383

=0.448       A1


[3 marks]

c.

EITHER


P90<Χ<114=0.866        (A1)


OR


P-1.5<Z<1.5=0.866        (A1)


THEN


0.866×500       (M1)

=433         A1


[3 marks]

d.

p=PΧ<95=0.19078         (A1)

recognising  Y~B80, p        (M1)

now using  Y~B80, 0.19078        (M1)

PY20=0.116          A1


[4 marks]

e.

Examiners report

[N/A]
a.
[N/A]
b.
[N/A]
c.
[N/A]
d.
[N/A]
e.



Hank sets up a bird table in his garden to provide the local birds with some food. Hank notices that a specific bird, a large magpie, visits several times per month and he names him Bill. Hank models the number of times per month that Bill visits his garden as a Poisson distribution with mean 3.1.

Over the course of 3 consecutive months, find the probability that Bill visits the garden:

Using Hank’s model, find the probability that Bill visits the garden on exactly four occasions during one particular month.

[1]
a.

on exactly 12 occasions.

[2]
b.i.

during the first and third month only.

[3]
b.ii.

Find the probability that over a 12-month period, there will be exactly 3 months when Bill does not visit the garden.

[4]
c.

After the first year, a number of baby magpies start to visit Hank’s garden. It may be assumed that each of these baby magpies visits the garden randomly and independently, and that the number of times each baby magpie visits the garden per month is modelled by a Poisson distribution with mean 2.1.

Determine the least number of magpies required, including Bill, in order that the probability of Hank’s garden having at least 30 magpie visits per month is greater than 0.2.

[4]
d.

Markscheme

X1~Po3.1

PX1=4=0.173  0.173349                A1


[1 mark]

a.

X2~Po3×3.1=Po9.3               (M1)

PX2=12=0.0799  0.0798950                A1


[2 marks]

b.i.

PX1>02×PX1=0               (M1)

0.954952×0.04505               (A1)

=0.0411  0.0410817                A1


[3 marks]

b.ii.

PX1=0=0.04505               (A1)

X1~B12, 0.04505               (M1)(A1)


Note:
Award M1 for recognizing binomial probability, and A1 for correct parameters.


=0.0133  0.013283                A1


[4 marks]

c.

METHOD ONE

               (M1)(A1)(A1)


Note: Award M1 for evidence of a cumulative Poisson with λ=3.1+2.1n, A1 for 0.136705 and A1 for 0.253384.


so require 12 magpies (including Bill)               A1

 

METHOD TWO

evidence of a cumulative Poisson with λ=3.1+2.1n               (M1)

sketch of curve and y=0.2               (A1)

(intersect at) 10.5810               (A1)

rounding up gives n=11

so require 12 magpies (including Bill)               A1

 

[4 marks]

d.

Examiners report

[N/A]
a.
[N/A]
b.i.
[N/A]
b.ii.
[N/A]
c.
[N/A]
d.



It is given that one in five cups of coffee contain more than 120 mg of caffeine.
It is also known that three in five cups contain more than 110 mg of caffeine.

Assume that the caffeine content of coffee is modelled by a normal distribution.
Find the mean and standard deviation of the caffeine content of coffee.

Markscheme

* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.

let X be the random variable “amount of caffeine content in coffee”

P ( X > 120 ) = 0.2 ,  P ( X > 110 ) = 0.6     (M1)

( P ( X < 120 ) = 0.8 ,  P ( X < 110 ) = 0.4 )

 

Note:     Award M1 for at least one correct probability statement.

 

120 μ σ = 0.84162 ,   110 μ σ = 0.253347     (M1)(A1)(A1)

 

Note:     Award M1 for attempt to find at least one appropriate z -value.

 

120 μ = 0.84162 σ ,   110 μ = 0.253347 σ

attempt to solve simultaneous equations     (M1)

μ = 112 ,   σ = 9.13     A1

[6 marks]

Examiners report

[N/A]



John likes to go sailing every day in July. To help him make a decision on whether it is safe to go sailing he classifies each day in July as windy or calm. Given that a day in July is calm, the probability that the next day is calm is 0.9. Given that a day in July is windy, the probability that the next day is calm is 0.3. The weather forecast for the 1st July predicts that the probability that it will be calm is 0.8.

Draw a tree diagram to represent this information for the first three days of July.

[3]
a.

Find the probability that the 3rd July is calm.

[2]
b.

Find the probability that the 1st July was calm given that the 3rd July is windy.

[4]
c.

Markscheme

* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.

M17/5/MATHL/HP2/ENG/TZ2/05.a/M     M1A2

 

Note:     Award M1 for 3 stage tree-diagram, A2 for 0.8, 0.9, 0.3 probabilities correctly placed.

 

[3 marks]

a.

0.2 × 0.7 × 0.3 + 0.2 × 0.3 × 0.9 + 0.8 × 0.1 × 0.3 + 0.8 × 0.9 × 0.9 = 0.768      (M1)A1

[2 marks]

b.

P ( 1st July is calm | 3rd July is windy) = P ( 1st July is calm and 3rd July is windy) P ( 3rd July is windy)      (M1)

= 0.8 × 0.1 × 0.7 + 0.8 × 0.9 × 0.1 1 0.768

OR 0.8 × 0.1 × 0.7 + 0.8 × 0.9 × 0.1 0.2 × 0.7 × 0.7 + 0.2 × 0.3 × 0.1 + 0.8 × 0.1 × 0.7 + 0.8 × 0.9 × 0.1

OR 0.128 0.232      (A1)(A1)

 

Note:     Award A1 for correct numerator, A1 for correct denominator.

 

= 0.552      A1

[4 marks]

c.

Examiners report

[N/A]
a.
[N/A]
b.
[N/A]
c.



The curve  y = f ( x )  is shown in the graph, for  0 x 10 .

The curve  y = f ( x )  passes through the following points.

It is required to find the area bounded by the curve, the x -axis, the y -axis and the line x = 10 .

One possible model for the curve  y = f ( x )  is a cubic function.

Use the trapezoidal rule to find an estimate for the area.

[3]
a.

Use all the coordinates in the table to find the equation of the least squares cubic regression curve.

[3]
b.i.

Write down the coefficient of determination.

[1]
b.ii.

Write down an expression for the area enclosed by the cubic regression curve, the x -axis, the y -axis and the line x = 10 .

[1]
c.i.

Find the value of this area.

[2]
c.ii.

Markscheme

Area =  2 2 ( 2 + 2 ( 4.5 + 4.2 + 3.3 + 4.5 ) + 8 )         M1A1

Area = 43        A1

[3 marks]

a.

y = 0.0389 x 3 0.534 x 2 + 2.06 x + 2.06       M1A2

[3 marks]

b.i.

R 2 = 0.991      A1

[1 mark]

b.ii.

Area =  0 10 y d x      A1

[1 mark]

c.i.

42.5     A2

[2 marks]

c.ii.

Examiners report

[N/A]
a.
[N/A]
b.i.
[N/A]
b.ii.
[N/A]
c.i.
[N/A]
c.ii.



A café serves sandwiches and cakes. Each customer will choose one of the following three options; buy only a sandwich, buy only a cake or buy both a sandwich and a cake.

The probability that a customer buys a sandwich is 0.72 and the probability that a customer buys a cake is 0.45.

Find the probability that a customer chosen at random will buy

On a typical day 200 customers come to the café.

It is known that 46 % of the customers who come to the café are male, and that 80 % of these buy a sandwich.

both a sandwich and a cake.

[3]
a.i.

only a sandwich.

[1]
a.ii.

Find the expected number of cakes sold on a typical day.

[1]
b.i.

Find the probability that more than 100 cakes will be sold on a typical day.

[3]
b.ii.

A customer is selected at random. Find the probability that the customer is male and buys a sandwich.

[1]
c.i.

A female customer is selected at random. Find the probability that she buys a sandwich.

[4]
c.ii.

Markscheme

use of formula or Venn diagram       (M1)

0.72 + 0.45 − 1       (A1)

= 0.17       A1

[3 marks]

a.i.

0.72 − 0.17 = 0.55      A1

[1 mark]

a.ii.

200 × 0.45 = 90      A1

[1 mark]

b.i.

let X be the number of customers who order cake

X ~ B(200,0.45)        (M1)

P(X > 100) = P(X ≥ 101)(= 1 − P(X ≤ 100))    (M1)

= 0.0681      A1

 

[3 marks]

b.ii.

0.46 × 0.8 = 0.368    A1

[1 mark]

c.i.

METHOD 1

0.368 + 0.54 × P ( S | F ) = 0.72        M1A1A1 

Note: Award M1 for an appropriate tree diagram. Award M1 for LHS, M1 for RHS.

P ( S | F ) = 0.652      A1  

 

METHOD 2

P ( S | F ) = P ( S F ) P ( F )        (M1)

= 0.72 0.368 0.54        A1A1

Note: Award A1 for numerator, A1 for denominator.

P ( S | F ) = 0.652      A1 

 

[4 marks]

c.ii.

Examiners report

[N/A]
a.i.
[N/A]
a.ii.
[N/A]
b.i.
[N/A]
b.ii.
[N/A]
c.i.
[N/A]
c.ii.



Dana has collected some data regarding the heights h (metres) of waves against a pier at 50 randomly chosen times in a single day. This data is shown in the table below.

She wishes to perform a χ2-test at the 5% significance level to see if the height of waves could be modelled by a normal distribution. Her null hypothesis is

H0: The data can be modelled by a normal distribution.

From the table she calculates the mean of the heights in her sample to be 0.828m and the standard deviation of the heights sn to be 0.257m.

She calculates the expected values for each interval under this null hypothesis, and some of these values are shown in the table below.

Use the given value of sn to find the value of sn-1.

[2]
a.

Find the value of a and the value of b, giving your answers correct to one decimal place.

[3]
b.

Find the value of the χ2 test statistic χcalc2 for this test.

[2]
c.

Determine the degrees of freedom for Dana’s test.

[2]
d.

It is given that the critical value for this test is 9.49.

State the conclusion of the test in context. Use your answer to part (c) to justify your conclusion.

[2]
e.

Markscheme

* This sample question was produced by experienced DP mathematics senior examiners to aid teachers in preparing for external assessment in the new MAA course. There may be minor differences in formatting compared to formal exam papers.

sn-1=5049×0.257        (M1)

 

Note: M1 is for the use of the correct formula

 

=0.260        A1

 

[2 marks]

a.

Using x¯=0.828 and sn-1=0.260        (M1)

a=7.3, b=7.6        A1A1

 

[3 marks]

b.

χcalc2=3.35        (M1)A1

 

[2 marks]

c.

Combining columns with expected values less than 5 leaves 7 cells       (M1)

7-1-2=4        A1

 

[2 marks]

d.

3.35<9.49       R1

hence insufficient evidence to reject H0 that the heights of the waves are normally distributed.       A1

 

Note: The A1 can be awarded independently of the R1.

 

[2 marks]

e.

Examiners report

[N/A]
a.
[N/A]
b.
[N/A]
c.
[N/A]
d.
[N/A]
e.



The times taken for male runners to complete a marathon can be modelled by a normal distribution with a mean 196 minutes and a standard deviation 24 minutes.

It is found that 5% of the male runners complete the marathon in less than T 1 minutes.

The times taken for female runners to complete the marathon can be modelled by a normal distribution with a mean 210 minutes. It is found that 58% of female runners complete the marathon between 185 and 235 minutes.

Find the probability that a runner selected at random will complete the marathon in less than 3 hours.

[2]
a.

Calculate T 1 .

[2]
b.

Find the standard deviation of the times taken by female runners.

[4]
c.

Markscheme

* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.

T N ( 196 ,   24 2 )

P ( T < 180 ) = 0.252      (M1)A1

[2 marks]

a.

P ( T < T 1 ) = 0.05      (M1)

T 1 = 157      A1

[2 marks]

b.

F N ( 210 ,   σ 2 )

P ( F < 235 ) = 0.79      (M1)

235 210 σ = 0.806421 or equivalent     (M1)(A1)

σ = 31.0      A1

[4 marks]

c.

Examiners report

[N/A]
a.
[N/A]
b.
[N/A]
c.



A Principal would like to compare the students in his school with a national standard. He decides to give a test to eight students made up of four boys and four girls. One of the teachers offers to find the volunteers from his class.

The marks out of 40, for the students who took the test, are:

25,   29,   38,   37,   12,   18,   27,   31.

For the eight students find

The national standard mark is 25.2 out of 40.

Two additional students take the test at a later date and the mean mark for all ten students is 28.1 and the standard deviation is 8.4.

For further analysis, a standardized score out of 100 for the ten students is obtained by multiplying the scores by 2 and adding 20.

For the ten students, find

Name the type of sampling that best describes the method used by the Principal.

[1]
a.

the mean mark.

[2]
b.i.

the standard deviation of the marks.

[1]
b.ii.

Perform an appropriate test at the 5% significance level to see if the mean marks achieved by the students in the school are higher than the national standard. It can be assumed that the marks come from a normal population.

[5]
c.

State one reason why the test might not be valid.

[1]
d.

their mean standardized score.

[1]
e.i.

the standard deviation of their standardized score.

[2]
e.ii.

Markscheme

quota      A1 

 

[1 mark]

a.

27.12527.1          (M1)A1

 

[2 marks]

b.i.

8.298158.30           A1

 

[1 mark]

b.ii.

(let μ be the national mean)

H0: μ=25.2

H1: μ>25.2           A1


Note: Accept hypotheses in words if they are clearly expressed and ‘population mean’ or ‘school mean’ is referred to. Do not accept H0: μ=μ0 unless μ0 is explicitly defined as “national standard mark” or given as 25.2.


recognizing t-test             (M1)

p-value =0.279391           A1

0.279391>0.05           R1


Note: The R1 mark is for the comparison of their p-value with 0.05.


insufficient evidence to reject the null hypothesis (that the mean for the school is 25.2)           A1


Note:
Award the final A1 only if the null hypothesis is also correct (e.g. μ0=25.2 or (population) mean =25.2) and the conclusion is consistent with both the direction of the inequality and the alternative hypothesis.

 

[5 marks]

c.

EITHER

the sampling process is not random         R1

For example:

the school asked for volunteers

the students were selected from a single class


OR

the quota might not be representative of the student population         R1

For example:

the school may have only 4 boys and 400 girls.


Note: Do not accept ‘the sample is too small’.

 

[1 mark]

d.

(28.1×2+20=)  76.2         A1

 

[1 mark]

e.i.

8.4×2          (A1)

=16.8         A1

 

[2 marks]

e.ii.

Examiners report

The most common answer to this question was ‘convenience sampling’. Though it is a convenience sample because four boys and four girls were required the most appropriate response was ‘quota sampling’.

a.

Some candidates still try to calculate a mean and standard deviation by hand. This is not expected.

b.i.
[N/A]
b.ii.

This was surprisingly poorly answered given that statistical testing forms a large part of the course. Candidates need to give the null and alternative hypotheses, find a p-value, compare this to the significance level and write their conclusion, in context of the question; examination questions may ask for each element individually or the question may say “Perform the test” wherein it is expected that each individual element will be clearly stated (as the test is incomplete if any are omitted). Many candidates had the null hypothesis as an inequality. The easiest way to write the null hypothesis is H0: μ=25.2, but it could also be stated in words so long as it is clear that the population mean is being referred to rather than the sample mean. For example, H0: The mean score of the whole school is equal to 25.2.

c.

The answer that the sample was self-selecting or unrepresentative was the expected response. The sample being small was also accepted if the additional reason of therefore ‘not able to assume a normal population’ was also given. In general, a small sample can be valid (though will probably not be reliable).

d.

Some candidates missed the point of this question, that it was concerned with transformations of the mean and standard deviation, and instead tried to work out the actual values for the extra two candidates.

e.i.
[N/A]
e.ii.



The number of marathons that Audrey runs in any given year can be modelled by a Poisson distribution with mean 1.3 .

Calculate the probability that Audrey will run at least two marathons in a particular year.

[2]
a.

Find the probability that she will run at least two marathons in exactly four out of the following five years.

[4]
b.

Markscheme

X Po ( 1.3 )

P ( X 2 ) = 0.373        (M1)A1

[2 marks]

a.

V B ( 5 0.373 )        (M1)A1

Note: Award (M1) for recognition of binomial or equivalent, A1 for correct parameters.

P ( V = 4 ) = 0.0608        (M1)A1

[4 marks]

b.

Examiners report

[N/A]
a.
[N/A]
b.



Packets of biscuits are produced by a machine. The weights X , in grams, of packets of biscuits can be modelled by a normal distribution where X N ( μ ,   σ 2 ) . A packet of biscuits is considered to be underweight if it weighs less than 250 grams.

The manufacturer makes the decision that the probability that a packet is underweight should be 0.002. To do this μ is increased and σ remains unchanged.

The manufacturer is happy with the decision that the probability that a packet is underweight should be 0.002, but is unhappy with the way in which this was achieved. The machine is now adjusted to reduce σ and return μ to 253.

Given that μ = 253 and σ = 1.5 find the probability that a randomly chosen packet of biscuits is underweight.

[2]
a.

Calculate the new value of μ giving your answer correct to two decimal places.

[3]
b.

Calculate the new value of σ .

[2]
c.

Markscheme

* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.

P ( X < 250 ) = 0.0228      (M1)A1

[2 marks]

a.

250 μ 1.5 = 2.878      (M1)(A1)

μ = 254.32      A1

 

Notes:     Only award A1 here if the correct 2dp answer is seen. Award M0 for use of 1.5 2 .

 

[3 marks]

b.

250 253 σ = 2.878      (A1)

σ = 1.04      A1

[2 marks]

c.

Examiners report

[N/A]
a.
[N/A]
b.
[N/A]
c.



In a reforested area of pine trees, heights of trees planted in a specific year seem to follow a normal distribution. A sample of 100 such trees is selected to test the validity of this hypothesis. The results of measuring tree heights, to the nearest centimetre, are recorded in the first two columns of the table below.

Describe what is meant by

a goodness of fit test (a complete explanation required);

[2]
a.i.

the level of significance of a hypothesis test.

[1]
a.ii.

Find the mean and standard deviation of the sample data in the table above. Show how you arrived at your answers.

[4]
b.

Most of the expected frequencies have been calculated in the third column. (Frequencies have been rounded to the nearest integer, and frequencies in the first and last classes have been extended to include the rest of the data beyond 15 and 225. Find the values of a , b  and c and show how you arrived at your answers.

[4]
c.

In order to test for the goodness of fit, the test statistic was calculated to be 1.0847. Show how this was done.

[3]
d.

State your hypotheses, critical number, decision rule and conclusion (using a 5% level of significance).

[5]
e.

Markscheme

A goodness of fit test is a statistical test of the hypothesis that a set of observed counts of k cells of a certain large population is consistent with a set of theoretical counts.                (R1)

The test statistic has a χ 2  distribution with k n degrees of freedom. One degree of freedom is lost for every parameter that has to be estimated from the sample.            (R1)

[2 marks]

a.i.

The level of significance of a hypothesis test is the maximal probability that we reject a true null hypothesis.      (R1)

[1 mark]

a.ii.

We use the class midpoints in the calculation of the mean and standard deviation.

x ¯ = x i f ( x i ) f ( x i ) = 30 × 6 + 60 × 11 + 90 × 15 + 100 = 13350 100                 (M1)

= 133.5                (A1)

s = x i 2 f ( x i ) f ( x i ) ( x ¯ ) 2 = 900 × 6 + 3600 × 11 + 100 133.5 2                 (M1)

= 56.345  (= 56.3 to 3 sf)                (A1)

[4 marks]

b.

Every frequency is the product of the number of observations and the probability of a number in each class. Since by hypothesis we have a normal distribution, the probabilities can be read from a normal table with mean 133.5 and standard deviation 56.345                 (M1)

E1 = 100 × P(45 ≤ x  ≤ 75) ≈ 9          so a  = 9              (A1)

E2 = 100 × P(135 ≤ x  ≤ 165) ≈ 20    so b = 20              (A1)

E3 = 100 × P(195 ≤ x  ≤ 225) ≈ 9      so c  = 9              (A1)

[4 marks]

c.

The test statistic is a χ 2  variable. Hence                 (M1)

χ 2 = ( f e f o ) 2 f e = ( 6 6 ) 2 6 + ( 9 11 ) 2 9 + ( 5 6 ) 2 5                  (M1)

= 1.0847              (A1)

[3 marks]

d.

H0: The distribution of tree heights is normally distributed

H1: The distribution is not normal            (M1)

Since the mean and standard deviation were estimated from the sample, the number of degrees of freedom is 8 – 1 – 2 = 5            (A1)

The critical number is χ 5 , 0.05 2 = 110705

If χ 2  > 11.0705 we reject H0            (A1)

Since χ 2  = 1.0847 < 11.0705, we fail to reject H0            (R1)

Conclusion: we do not have enough evidence to claim that the distribution of tree heights is not normal            (R1) 

[5 marks]

e.

Examiners report

[N/A]
a.i.
[N/A]
a.ii.
[N/A]
b.
[N/A]
c.
[N/A]
d.
[N/A]
e.



The random variable X has a normal distribution with mean μ = 50 and variance σ 2 = 16 .

Sketch the probability density function for X, and shade the region representing P(μ − 2σ < X < μ + σ).

[2]
a.

Find the value of P(μ − 2σ < X < μ + σ).

[2]
b.

Find the value of k for which P(μkσ < X < μ + kσ) = 0.5.

[2]
c.

Markscheme

* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.

normal curve centred on 50      A1

vertical lines at x = 42 and x = 54, with shading in between       A1

[2 marks]

a.

P(42 X < 54) (= P(− 2 Z < 1))     (M1)

= 0.819       A1

[2 marks]

b.

P(μ − kσ < X < μ + kσ) = 0.5 ⇒ P(X < μ + kσ) = 0.75      (M1)

k = 0.674       A1

Note: Award M1A0 for k = −0.674.

[2 marks]

c.

Examiners report

[N/A]
a.
[N/A]
b.
[N/A]
c.



Loreto is a manager at the Da Vinci health centre. If the mean rate of patients arriving at the health centre exceeds 1.5 per minute then Loreto will employ extra staff. It is assumed that the number of patients arriving in any given time period follows a Poisson distribution.

Loreto performs a hypothesis test to determine whether she should employ extra staff. She finds that 320 patients arrived during a randomly selected 3-hour clinic.

Loreto is also concerned about the average waiting time for patients to see a nurse. The health centre aims for at least 95% of patients to see a nurse in under 20 minutes.

Loreto assumes that the waiting times for patients are independent of each other and decides to perform a hypothesis test at a 10% significance level to determine whether the health centre is meeting its target.

Loreto surveys 150 patients and finds that 11 of them waited more than 20 minutes.

Write down null and alternative hypotheses for Loreto’s test.

[2]
a.i.

Using the data from Loreto’s sample, perform the hypothesis test at a 5% significance level to determine if Loreto should employ extra staff.

[5]
a.ii.

Write down null and alternative hypotheses for this test.

[2]
b.i.

Perform the test, clearly stating the conclusion in context.

[5]
b.ii.

Markscheme

let X be the random variable “number of patients arriving in a minute”, such that X~Pom

H0 : m=1.5           A1

H1 : m>1.5           A1

Note: Allow a value of 270 for m. Award at most A0A1 if it is not clear that it is the population mean being referred to e.g
        H0 : The number of patients is equal to 1.5 every minute
        H1 : The number of patients exceeds 1.5 every minute.
Referring to the “expected” number of patients or the use of μ or λ is sufficient for A1A1.

 

[2 marks]

a.i.

under H0 let Y be the number of patients in 3 hours

Y~Po270             (A1)

PY320 =1-PY319=0.00166  0.00165874             (M1)A1

since 0.00166<0.05             R1

(reject H0)

Loreto should employ more staff             A1

 

[5 marks]

a.ii.

H0 : The probability of a patient waiting less than 20 minutes is 0.95             A1

H1 : The probability of a patient waiting less than 20 minutes is less than 0.95             A1

 

[2 marks]

b.i.

Under H0 let W be the number of patients waiting more than 20 minutes

W~B150, 0.05             (A1)

PW11=0.132  0.132215             (M1)A1

since 0.132>0.1             R1

(fail to reject H0)

insufficient evidence to suggest they are not meeting their target             A1


Note: Do not accept “they are meeting target” for the A1. Accept use of B(150, 0.95) and PW139 and any consistent use of a random variable, appropriate p-value and significance level.

[5 marks]

b.ii.

Examiners report

In part (a) there was a general lack of consistency in how candidates wrote down their null and alternative hypotheses. It was surprising how many candidates solved a Poisson PDF rather than CDF to find their p-value. This suggests a lack of understanding of the nature of distributions or more specifically the concepts of hypothesis testing. In part (b), which was challenging, there were issues for many candidates in interpreting the situation. This is understandable since it was difficult, but as previously mentioned interpretation is a general issue in the paper. When writing down the conclusion of the tests, there was often very loose use of the terms accept/reject and candidates seemed unclear of the significance and importance of the correct use of these terms.

a.i.

In part (a) there was a general lack of consistency in how candidates wrote down their null and alternative hypotheses. It was surprising how many candidates solved a Poisson PDF rather than CDF to find their p-value. This suggests a lack of understanding of the nature of distributions or more specifically the concepts of hypothesis testing. In part (b), which was challenging, there were issues for many candidates in interpreting the situation. This is understandable since it was difficult, but as previously mentioned interpretation is a general issue in the paper. When writing down the conclusion of the tests, there was often very loose use of the terms accept/reject and candidates seemed unclear of the significance and importance of the correct use of these terms.

a.ii.

In part (a) there was a general lack of consistency in how candidates wrote down their null and alternative hypotheses. It was surprising how many candidates solved a Poisson PDF rather than CDF to find their p-value. This suggests a lack of understanding of the nature of distributions or more specifically the concepts of hypothesis testing. In part (b), which was challenging, there were issues for many candidates in interpreting the situation. This is understandable since it was difficult, but as previously mentioned interpretation is a general issue in the paper. When writing down the conclusion of the tests, there was often very loose use of the terms accept/reject and candidates seemed unclear of the significance and importance of the correct use of these terms.

b.i.

In part (a) there was a general lack of consistency in how candidates wrote down their null and alternative hypotheses. It was surprising how many candidates solved a Poisson PDF rather than CDF to find their p-value. This suggests a lack of understanding of the nature of distributions or more specifically the concepts of hypothesis testing. In part (b), which was challenging, there were issues for many candidates in interpreting the situation. This is understandable since it was difficult, but as previously mentioned interpretation is a general issue in the paper. When writing down the conclusion of the tests, there was often very loose use of the terms accept/reject and candidates seemed unclear of the significance and importance of the correct use of these terms.

b.ii.



The marks achieved by eight students in a class test are given in the following list.

The teacher increases all the marks by 2. Write down the new value for

the standard deviation.

Markscheme

2.22      A1

[1 mark]

Examiners report

[N/A]



A random variable X is normally distributed with mean μ  and standard deviation σ , such that P ( X < 30.31 ) = 0.1180 and P ( X > 42.52 ) = 0.3060 .

Find μ and σ .

[6]
a.

Find P ( | X μ | < 1.2 σ ) .

[2]
b.

Markscheme

* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.

P ( X < 42.52 ) = 0.6940    (M1)

either P ( Z < 30.31 μ σ ) = 0.1180  or P ( Z < 42.52 μ σ ) = 0.6940      (M1)

30.31 μ σ = Φ 1 ( 0.1180 ) 1.1850    (A1)

42.52 μ σ = Φ 1 ( 0.6940 ) 0.5072    (A1)

attempting to solve simultaneously     (M1)

μ = 38.9 and σ = 7.22      A1

[6 marks]

a.

P ( μ 1.2 σ < X < μ + 1.2 σ ) (or equivalent eg. 2 P ( μ < X < μ + 1.2 σ ) )     (M1)

= 0.770    A1

 

Note: Award (M1)A1 for P ( 1.2 < Z < 1.2 ) = 0.770 .

[2 marks]

b.

Examiners report

[N/A]
a.
[N/A]
b.



A geneticist uses a Markov chain model to investigate changes in a specific gene in a cell as it divides. Every time the cell divides, the gene may mutate between its normal state and other states.

The model is of the form

Xn+1Zn+1=MXnZn

where Xn is the probability of the gene being in its normal state after dividing for the nth time, and Zn is the probability of it being in another state after dividing for the nth time, where n.

Matrix M is found to be 0.94  b0.06  0.98.

The gene is in its normal state when n=0. Calculate the probability of it being in its normal state

Write down the value of b.

[1]
a.i.

What does b represent in this context?

[1]
a.ii.

Find the eigenvalues of M.

[3]
b.

Find the eigenvectors of M.

[3]
c.

when n=5.

[2]
d.i.

in the long term.

[2]
d.ii.

Markscheme

0.02         A1

 

[1 mark]

a.i.

the probability of mutating from ‘not normal state’ to ‘normal state’         A1


Note: The A1 can only be awarded if it is clear that transformation is from the mutated state.

 

[1 mark]

a.ii.

det0.94-λ  0.020.06  0.98-λ=0         (M1)


Note: Award M1 for an attempt to find eigenvalues. Any indication that detM-λI=0 has been used is sufficient for the (M1).


0.94-λ0.98-λ-0.0012=0  OR  λ2-1.92λ+0.92=0         (A1)

λ=1, 0.92  2325         A1

 

[3 marks]

b.

0.94  0.020.06  0.98xy=xy  OR  0.94  0.020.06  0.98xy=0.92xy         (M1)


Note: Award M1 can be awarded for attempting to find either eigenvector.


0.02y-0.06x=0  OR  0.02y+0.02x=0

13  and  1-1         A1A1


Note: Accept any multiple of the given eigenvectors.

 

[3 marks]

c.

0.94  0.020.06  0.98510  OR  0.744  0.08520.256  0.91510         (M1)


Note: Condone omission of the initial state vector for the M1.


0.744   0.744311            A1

 

[2 marks]

d.i.

0.250.75         (A1)


Note: Award A1 for 0.250.75  OR  0.25  0.250.75  0.75 seen.


0.25          A1

 

[2 marks]

d.ii.

Examiners report

There was some difficulty in interpreting the meaning of the values in the transition matrix, but most candidates did well with the rest of the question. In part (d) there was frequently evidence of a correct method, but a failure to identify the correct probabilities. It was surprising to see a significant number of candidates diagonalizing the matrix in part (d) and this often led to errors. Clearly this was not necessary.

a.i.
[N/A]
a.ii.
[N/A]
b.
[N/A]
c.
[N/A]
d.i.
[N/A]
d.ii.



Scientists have developed a type of corn whose protein quality may help chickens gain weight faster than the present type used. To test this new type, 20 one-day-old chicks were fed a ration that contained the new corn while another control group of 20 chicks was fed the ordinary corn. The data below gives the weight gains in grams, for each group after three weeks.

The scientists wish to investigate the claim that Group B gain weight faster than Group A. Test this claim at the 5% level of significance, noting which hypothesis test you are using. You may assume that the weight gain for each group is normally distributed, with the same variance, and independent from each other.

[6]
a.

The data from the two samples above are combined to form a single set of data. The following frequency table gives the observed frequencies for the combined sample. The data has been divided into five intervals.

Test, at the 5% level, whether the combined data can be considered to be a sample from a normal population with a mean of 380.

[10]
b.

Markscheme

This is a t-test of the difference of two means. Our assumptions are that the two populations are approximately normal, samples are random, and they are independent from each other.          (R1)

H0: μ1μ2 = 0

H1: μ1 − μ2 < 0          (A1)                

t = −2.460,          (A1)

degrees of freedom = 38          (A1)

Since the value of critical t = −1.686 we reject H0.          (A1)

Hence group B grows faster.          (R1)

[6 marks]

a.

This is a χ 2  goodness-of-fit test.

To finish the table, the frequencies of the respective cells have to be calculated. Since the standard deviation is not given, it has to be estimated using the data itself. s = 49.59, eg the third expected frequency is 40 × 0.308 = 12.32, since P(350.5 < W < 390.5) = 0.3078...

The table of observed and expected frequencies is:

      (M1)(A2)

Since the first expected frequency is 3.22, we combine the two cells, so that the first two rows become one row, that is,

      (M1)

Number of degrees of freedom is 4 – 1 – 1 = 2           (C1)          

H0: The distribution is normal with mean 380

H1: The distribution is not normal with mean 380         (A1)

The test statistic is

χ c a l c 2 = ( f e f 0 ) 2 f e = ( 11 11.04 ) 2 11.04 + ( 8 12.32 ) 2 12.32 + ( 15 10.48 ) 2 10.48 + ( 6 6.17 ) 2 6.17

= 3.469          (A1)

With 2 degrees of freedom, the critical number is χ 2  = 5.99           (A2)

So, we do not have enough evidence to reject the null hypothesis. Therefore, there is no evidence to say that the distribution is not normal with mean 380.           (R1)

[10 marks]

b.

Examiners report

[N/A]
a.
[N/A]
b.



The random variable X has a binomial distribution with parameters n and p.
It is given that E(X) = 3.5.

Find the least possible value of n.

[2]
a.

It is further given that P(X ≤ 1) = 0.09478 correct to 4 significant figures.

Determine the value of n and the value of p.

[5]
b.

Markscheme

* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.

np = 3.5      (A1)

p ≤ 1 ⇒ least n = 4       A1

[2 marks]

a.

(1 − p)n + np(1 − p)n−1 = 0.09478     M1A1

attempt to solve above equation with np = 3.5     (M1)

n = 12,  p 7 24 (=0.292)     A1A1

Note: Do not accept n as a decimal.

[5 marks]

b.

Examiners report

[N/A]
a.
[N/A]
b.



A survey of British holidaymakers found that 15% of those surveyed took a holiday in the Lake District in 2019.

A random sample of 16 British holidaymakers was taken. The number of people in the sample who took a holiday in the Lake District in 2019 can be modelled by a binomial distribution.

State two assumptions made in order for this model to be valid.

[2]
a.i.

Find the probability that at least three people from the sample took a holiday in the Lake District in 2019.

[2]
a.ii.

From a random sample of n holidaymakers, the probability that at least one of them took a holiday in the Lake District in 2019 is greater than 0.999.

Determine the least possible value of n.

[3]
b.

Markscheme

people’s holidays are independent of each other      R1

the proportion is constant (at 0.15)      R1


[2 marks]

a.i.

Χ~B16, 0.15

PΧ3=0.439      (M1)A1


[2 marks]

a.ii.

probability of at least one =1- probability of none

1-0.85n>0.999  OR  0.85n<0.001      (A1)

attempt to solve inequality      (M1)

n42.503

so least possible n=43       A1


[3 marks]

b.

Examiners report

[N/A]
a.i.
[N/A]
a.ii.
[N/A]
b.



A discrete random variable X follows a Poisson distribution Po ( μ ) .

Show that P ( X = x + 1 ) = μ x + 1 × P ( X = x ) ,   x N .

[3]
a.

Given that P ( X = 2 ) = 0.241667  and P ( X = 3 ) = 0.112777 , use part (a) to find the value of μ .

[3]
b.

Markscheme

* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.

METHOD 1

P ( X = x + 1 ) = μ x + 1 ( x + 1 ) ! e μ    A1

= μ x + 1 × μ x x ! e μ    M1A1

= μ x + 1 × P ( X = x )    AG

METHOD 2

μ x + 1 × P ( X = x ) = μ x + 1 × μ x x ! e μ    A1

= μ x + 1 ( x + 1 ) ! e μ    M1A1

= P ( X = x + 1 )    AG

METHOD 3

P ( X = x + 1 ) P ( X = x ) = μ x + 1 ( x + 1 ) ! e μ μ x x ! e μ    (M1)

= μ x + 1 μ x × x ! ( x + 1 ) !    A1

= μ x + 1    A1

and so P ( X = x + 1 ) = μ x + 1 × P ( X = x )      AG

[3 marks]

a.

P ( X = 3 ) = μ 3 P ( X = 2 )   ( 0.112777 = μ 3 0.241667 )    A1

attempting to solve for μ      (M1)

μ = 1.40    A1

[3 marks]

b.

Examiners report

[N/A]
a.
[N/A]
b.



Jenna is a keen book reader. The number of books she reads during one week can be modelled by a Poisson distribution with mean 2.6.

Determine the expected number of weeks in one year, of 52 weeks, during which Jenna reads at least four books.

Markscheme

* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.

let Χ be the random variable “number of books Jenna reads per week.”

then Χ~Po 2.6

PΧ4=0.264 0.263998       (M1)(A1)

0.263998×52       (M1)

=13.7       A1


Note:
Accept 14 weeks.


[4 marks]

Examiners report

[N/A]



It is known that 56 % of Infiglow batteries have a life of less than 16 hours, and 94 % have a life less than 17 hours. It can be assumed that battery life is modelled by the normal distribution N ( μ , σ 2 ) .

Find the value of μ and the value of σ .

[6]
a.

Find the probability that a randomly selected Infiglow battery will have a life of at least 15 hours.

[2]
b.

Markscheme

* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.

use of inverse normal (implied by ±0.1509… or ±1.554…)       (M1)

P(X < 16) = 0.56

16 μ σ = 0.1509       (A1)

P(X < 17) = 0.94

17 μ σ = 1.554       (A1)

attempt to solve a pair of simultaneous equations       (M1)

μ = 15.9,   σ = 0.712      A1A1

 

[6 marks]

a.

correctly shaded diagram or intent to find P(X ≥ 15)       (M1)

= 0.895       A1

Note: Accept answers rounding to 0.89 or 0.90. Award M1A0 for the answer 0.9.

 

[2 marks]

b.

Examiners report

[N/A]
a.
[N/A]
b.



The continuous random variable X has probability density function  f given by

f ( x ) = { 3 a x , 0 x < 0.5 a ( 2 x ) , 0.5 x < 2 0 , otherwise

 

Show that  a = 2 3 .

[3]
a.

Find  P ( X < 1 ) .

[3]
b.

Given that P ( s < X < 0.8 ) = 2 × P ( 2 s < X < 0.8 ) , and that 0.25 < s < 0.4 , find the value of s.

[7]
c.

Markscheme

* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.

 

a [ 0 0.5 3 x d x + 0.5 2 ( 2 x ) d x ] = 1      M1

Note: Award the M1 for the total integral equalling 1, or equivalent.

a ( 3 2 ) = 1      (M1)A1

a = 2 3      AG

[3 marks]

a.

EITHER

0 0.5 2 x d x + 2 3 0.5 1 ( 2 x ) d x      (M1)(A1)

= 2 3      A1

OR

2 3 1 2 ( 2 x ) d x = 1 3      (M1)

so  P ( X < 1 ) = 2 3       (M1)A1

[3 marks]

b.

P ( s < X < 0.8 ) = s 0.5 2 x d x + 2 3 0.5 0.8 ( 2 x ) d x      M1A1

= [ x 2 ] s 0.5 + 0.27

0.25 s 2 + 0.27      (A1)

P ( 2 s < X < 0.8 ) = 2 3 2 s 0.8 ( 2 x ) d x      A1

= 2 3 [ 2 x x 2 2 ] 2 s 0.8

2 3 ( 1.28 ( 4 s 2 s 2 ) )

equating

0.25 s 2 + 0.27 = 4 3 ( 1.28 ( 4 s 2 s 2 ) )      (A1)

attempt to solve for s      (M1)

s = 0.274      A1

[7 marks]

c.

Examiners report

[N/A]
a.
[N/A]
b.
[N/A]
c.



Steffi the stray cat often visits Will’s house in search of food. Let X be the discrete random variable “the number of times per day that Steffi visits Will’s house”.

The random variable X can be modelled by a Poisson distribution with mean 2.1.

Let Y be the discrete random variable “the number of times per day that Steffi is fed at Will’s house”. Steffi is only fed on the first four occasions that she visits each day.

Find the probability that on a randomly selected day, Steffi does not visit Will’s house.

[2]
a.

Copy and complete the probability distribution table for Y.

[4]
b.

Hence find the expected number of times per day that Steffi is fed at Will’s house.

[3]
c.

In any given year of 365 days, the probability that Steffi does not visit Will for at most n days in total is 0.5 (to one decimal place). Find the value of n .

[3]
d.

Show that the expected number of occasions per year on which Steffi visits Will’s house and is not fed is at least 30.

[4]
e.

Markscheme

* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.

X Po ( 2.1 )

P ( X = 0 ) = 0.122 ( = e 2.1 )        (M1)A1

[2 marks]

a.

      A1A1A1A1

Note: Award A1 for each correct probability for Y = 1, 2, 3, 4. Accept 0.162 for P(Y = 4).

[4 marks]

b.

E ( Y ) = y P ( Y = y )       (M1)

= 1 × 0.257 + 2 × 0.270 + 3 × 0.189 + 4 × 0.161       (A1)

= 2.01       A1

[3 marks]

c.

let T be the no of days per year that Steffi does not visit

T B ( 365 , 0.122 )       (M1)

require  0.45 P ( T n ) < 0.55       (M1)

P ( T 44 ) = 0.51

n = 44       A1

[3 marks]

d.

METHOD 1

let V be the discrete random variable “number of times Steffi is not fed per day”

E ( V ) = 1 × P ( X = 5 ) + 2 × P ( X = 6 ) + 3 × P ( X = 7 ) +         M1

= 1 × 0.0416 + 2 × 0.0145 + 3 × 0.00437 +       A1

= 0.083979...      A1

expected no of occasions per year > 0.083979... × 365 = 30.7      A1

hence Steffi can expect not to be fed on at least 30 occasions       AG

Note: Candidates may consider summing more than three terms in their calculation for E ( V ) .

 

METHOD 2

E ( X ) E ( Y ) = 0.0903        M1A1

0.0903… × 365       M1

= 33.0 > 30       A1AG

  

[4 marks]

e.

Examiners report

[N/A]
a.
[N/A]
b.
[N/A]
c.
[N/A]
d.
[N/A]
e.



Timmy owns a shop. His daily income from selling his goods can be modelled as a normal distribution, with a mean daily income of $820, and a standard deviation of $230. To make a profit, Timmy’s daily income needs to be greater than $1000.

Calculate the probability that, on a randomly selected day, Timmy makes a profit.

[2]
a.

The shop is open for 24 days every month.

Calculate the probability that, in a randomly selected month, Timmy makes a profit on between 5 and 10 days (inclusive).

[3]
b.

Markscheme

* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.

X ~ N(820, 2302)       (M1)

Note: Award M1 for an attempt to use normal distribution. Accept labelled normal graph.

⇒P(X > 1000) = 0.217       A1

[2 marks]

a.

Y ~ B(24,0.217...)       (M1)

Note: Award M1 for recognition of binomial distribution with parameters.

P(Y ≤ 10) − P(Y ≤ 4)         (M1)

Note: Award M1 for an attempt to find P(5 ≤ Y ≤ 10) or P(Y ≤ 10) − P(Y ≤ 4).

= 0.613       A1

[3 marks]

b.

Examiners report

[N/A]
a.
[N/A]
b.



The number of bananas that Lucca eats during any particular day follows a Poisson distribution with mean 0.2.

Find the probability that Lucca eats at least one banana in a particular day.

[2]
a.

Find the expected number of weeks in the year in which Lucca eats no bananas.

[4]
b.

Markscheme

* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.

let X be the number of bananas eaten in one day

X Po ( 0.2 )

P ( X 1 ) = 1 P ( X = 0 )     (M1)

= 0.181   ( = 1 e 0.2 )     A1

[2 marks]

a.

EITHER

let Y be the number of bananas eaten in one week

Y Po ( 1.4 )     (A1)

P ( Y = 0 ) = 0.246596   ( = e 1.4 )     (A1)

OR

let Z be the number of days in one week at least one banana is eaten

Z B ( 7 ,   0.181 )     (A1)

P ( Z = 0 ) = 0.246596     (A1)

THEN

52 × 0.246596     (M1)

= 12.8   ( = 52 e 1.4 )     A1

[4 marks]

b.

Examiners report

[N/A]
a.
[N/A]
b.



Events A and B are such that P ( A B ) = 0.95 ,  P ( A B ) = 0.6 and P ( A | B ) = 0.75 .

Find  P ( B ) .

[2]
a.

Find P ( A ) .

[2]
b.

Hence show that events A and B are independent.

[2]
c.

Markscheme

* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.

P ( A | B ) = P ( A B ) P ( B )

0.75 = 0.6 P ( B )     (M1)

P ( B )   ( = 0.6 0.75 ) = 0.8     A1

[2 marks]

a.

P ( A B ) = P ( A ) + P ( B ) P ( A B )

0.95 = P ( A ) + 0.8 0.6     (M1)

P ( A ) = 0.75     A1

[2 marks]

b.

METHOD 1

P ( A | B ) = P ( A B ) P ( B ) = 0.2 0.8 = 0.25     A1

P ( A | B ) = P ( A )     R1

hence A and B are independent     AG

 

Note:     If there is evidence that the student has calculated P ( A B ) = 0.2 by assuming independence in the first place, award A0R0.

 

METHOD 2

EITHER

P ( A ) = P ( A | B )     A1

OR

P ( A ) × P ( B ) = 0.75 × 0.80 = 0.6 = P ( A B )     A1

THEN

A and B are independent     R1

hence A and B are independent     AG

METHOD 3

P ( A ) × P ( B ) = 0.25 × 0.80 = 0.2     A1

P ( A ) × P ( B ) = P ( A B )     R1

hence A and B are independent     AG

[2 marks]

c.

Examiners report

[N/A]
a.
[N/A]
b.
[N/A]
c.



A horse breeder records the number of births for each of 100 horses during the past eight years. The results are summarized in the following table:

Stating null and alternative hypotheses carry out an appropriate test at the 5% significance level to decide whether the results can be modelled by B (6, 0.5).

[10]
a.

Without doing any further calculations, explain briefly how you would carry out a test, at the 5% significance level, to decide if the data can be modelled by B(6, p ), where p  is unspecified.

[2]
b.

A different horse breeder collected data on the time and outcome of births. The data are summarized in the following table:

Carry out an appropriate test at the 5% significance level to decide whether there is an association between time and outcome.

[8]
c.

Markscheme

METHOD 1

H0: distribution is B(6, 0.5); H1: distribution is not B(6, 0.5)      A1

( E 0 = 100 ( 0.5 ) 6 = 25 16 = 0.015625 )       A3

Combining the first two columns and the last two columns:       A1

χ 2 = O 2 E E

= 6 2 ( 175 16 ) + 26 2 ( 375 16 ) + 37 2 ( 500 16 ) + 18 2 ( 375 16 ) + 13 2 ( 175 16 ) 100         (M1)

= 5.22       A1

v = 4, so critical value of χ 5 2 = 9.488        A1A1

Since 5.22 < 9.488 the result is not significant and we accept H0       R1

 

METHOD 2

H0: distribution is B(6, 0.5); H1: distribution is not B(6, 0.5)      A1

By GDC, p = 0.266       A8

Since 0.266 > 0.05 the result is not significant and we accept H0       R1

 

[10 marks]

a.

Estimate p  from the data which would entail the loss of one degree of freedom      A1A1

[2 marks]

b.

H0: there is no association H1: there is an association       A1

       A2

χ 2 = 68 2 64.8 + 42 2 45.2 + 103 2 94.3 + + 6 2 10.6 + 12 2 7.4 314         (M1)

     = 15.7

v = 3 , χ 5 2 ( 3 ) = 7.815        A1A1

Since 15.7 > 7.815 we reject H0           R1

 

METHOD 2

H0: there is no association H1: there is an association       A1

By GDC, p = 0.00129         A6

Since 0.00129 < 0.05 we reject H0.           R1

 

[8 marks]

c.

Examiners report

[N/A]
a.
[N/A]
b.
[N/A]
c.



Each of the 25 students in a class are asked how many pets they own. Two students own three pets and no students own more than three pets. The mean and standard deviation of the number of pets owned by students in the class are  18 25 and  24 25  respectively.

Find the number of students in the class who do not own a pet.

Markscheme

* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.

METHOD 1

let p have no pets, q have one pet and r have two pets      (M1)

p + q + r + 2 = 25      (A1)

0p + 1q + 2r + 6 = 18      A1

Note: Accept a statement that there are a total of 12 pets.

attempt to use variance equation, or evidence of trial and error       (M1)

0 p + 1 q + 4 r + 18 25 ( 18 25 ) 2 = ( 24 25 ) 2      (A1)

attempt to solve a system of linear equations (M1)

p = 14      A1

 

METHOD 2

     (M1)

p + q + r + 2 25 = 1       (A1)

q + 2 r + 6 25 = 18 25 ( q + 2 r = 12 25 )       A1

q + 4 r + 18 25 ( 18 25 ) 2 = 576 625 ( q + 4 r = 18 25 )       (M1)(A1)

q = 6 25 , r = 3 25       (M1)

p = 14 25        A1

so 14 have no pets

[7 marks]

Examiners report

[N/A]



Iqbal attempts three practice papers in mathematics. The probability that he passes the first paper is 0.6. Whenever he gains a pass in a paper, his confidence increases so that the probability of him passing the next paper increases by 0.1. Whenever he fails a paper the probability of him passing the next paper is 0.6.

Complete the given probability tree diagram for Iqbal’s three attempts, labelling each branch with the correct probability.

[3]
a.

Calculate the probability that Iqbal passes at least two of the papers he attempts.

[2]
b.

Find the probability that Iqbal passes his third paper, given that he passed only one previous paper.

[3]
c.

Markscheme

* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.

     A1A1A1

Note: Award A1 for each correct column of probabilities.

[3 marks]

a.

probability (at least twice) =

EITHER

( 0.6 × 0.7 × 0.8 ) + ( 0.6 × 0.7 × 0.2 ) + ( 0.6 × 0.3 × 0.6 ) + ( 0.4 × 0.6 × 0.7 )        (M1)

OR

( 0.6 × 0.7 ) + ( 0.6 × 0.3 × 0.6 ) + ( 0.4 × 0.6 × 0.7 )        (M1)

Note: Award M1 for summing all required probabilities.

THEN

= 0.696     A1

[2 marks]

b.

P(passes third paper given only one paper passed before)

= P ( passes third AND only one paper passed before ) P ( passes once in first two papers )       (M1)

= ( 0.6 × 0.3 × 0.6 ) + ( 0.4 × 0.6 × 0.7 ) ( 0.6 × 0.3 ) + ( 0.4 × 0.6 )       A1

= 0.657     A1

[3 marks]

c.

Examiners report

[N/A]
a.
[N/A]
b.
[N/A]
c.



There are 75 players in a golf club who take part in a golf tournament. The scores obtained on the 18th hole are as shown in the following table.

M17/5/MATHL/HP2/ENG/TZ2/01

One of the players is chosen at random. Find the probability that this player’s score was 5 or more.

[2]
a.

Calculate the mean score.

[2]
b.

Markscheme

* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.

P ( 5  or more ) = 29 75 ( = 0.387 )      (M1)A1

[2 marks]

a.

mean score = 2 × 3 + 3 × 15 + 4 × 28 + 5 × 17 + 6 × 9 + 7 × 3 75      (M1)

= 323 75 ( = 4.31 )      A1

[2 marks]

b.

Examiners report

[N/A]
a.
[N/A]
b.



Jorge is carefully observing the rise in sales of a new app he has created.

The number of sales in the first four months is shown in the table below.

Jorge believes that the increase is exponential and proposes to model the number of sales N in month t with the equation

N=Aert, A,r

Jorge plans to adapt Euler’s method to find an approximate value for r.

With a step length of one month the solution to the differential equation can be approximated using Euler’s method where

Nn+1Nn+1×N'n, n

Jorge decides to take the mean of these values as the approximation of r for his model. He also decides the graph of the model should pass through the point (2, 52).

The sum of the square residuals for these points for the least squares regression model is approximately 6.555.

Show that Jorge’s model satisfies the differential equation

dNdt=rN

[2]
a.

Show that rNn+1-NnNn

[3]
b.

Hence find three approximations for the value of r.

[3]
c.

Find the equation for Jorge’s model.

[3]
d.

Find the sum of the square residuals for Jorge’s model using the values t=1, 2, 3, 4.

[2]
e.

Comment how well Jorge’s model fits the data.

[1]
f.i.

Give two possible sources of error in the construction of his model.

[2]
f.ii.

Markscheme

* This sample question was produced by experienced DP mathematics senior examiners to aid teachers in preparing for external assessment in the new MAA course. There may be minor differences in formatting compared to formal exam papers.

dNdt=rAert        (M1)A1

 

Note: M1 is for an attempt to find dNdt

 

=rN        AG

 

Note: Accept solution of the differential equation by separating variables

 

[2 marks]

a.

Nn+1Nn+1×N'nN'nNn+1-Nn        M1

rNnNn+1-Nn        M1A1

rNn+1-NnNn        AG

 

Note: Do not penalize the use of the = sign.

 

[3 marks]

b.

Correct method         (M1)

r52-4040=0.3

r70-5252=0.346

r98-7070=0.4        A2

 

Note: A1 for a single error A0 for two or more errors.

 

[3 marks]

c.

r=0.349 0.34871 or 68195        A1

52=Ae0.34871×2        (M1)

A=25.8887

N=25.9e0.349t        A1

 

[3 marks]

d.

36.6904-402+0+73.6951-702+104.4435-982        (M1)

=66.1 66.126        A1

 

[2 marks]

e.

The sum of the square residuals is approximately 10 times as large as the minimum possible, so Jorge’s model is unlikely to fit the data exactly     R1

 

[1 mark]

f.i.

For example

Selecting a single point for the curve to pass through

Approximating the gradient of the curve by the gradient of a chord       R1R1

 

[2 marks]

f.ii.

Examiners report

[N/A]
a.
[N/A]
b.
[N/A]
c.
[N/A]
d.
[N/A]
e.
[N/A]
f.i.
[N/A]
f.ii.



A random variable X has a probability distribution given in the following table.

N16/5/MATHL/HP2/ENG/TZ0/01

Determine the value of E ( X 2 ) .

[2]
a.

Find the value of Var ( X ) .

[3]
b.

Markscheme

* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.

E ( X 2 ) = Σ x 2 P ( X = x ) = 10.37   ( = 10.4  3 sf)    (M1)A1

[2 marks]

a.

METHOD 1

sd ( X ) = 1.44069    (M1)(A1)

Var ( X ) = 2.08   ( = 2.0756 )    A1

METHOD 2

E ( X ) = 2.88   ( = 0.06 + 0.27 + 0.5 + 0.98 + 0.63 + 0.44 )    (A1)

use of Var ( X ) = E ( X 2 ) ( E ( X ) ) 2      (M1)

 

Note: Award (M1) only if ( E ( X ) ) 2 is used correctly.

 

( Var ( X ) = 10.37 8.29 )

Var ( X ) = 2.08   ( = 2.0756 )    A1

 

Note: Accept 2.11.

 

METHOD 3

E ( X ) = 2.88   ( = 0.06 + 0.27 + 0.5 + 0.98 + 0.63 + 0.44 )    (A1)

use of Var ( X ) = E ( ( X E ( X ) ) 2 )      (M1)

( 0.679728 + + 0.549152 )

Var ( E ) = 2.08   ( = 2.0756 )    A1

[3 marks]

b.

Examiners report

[N/A]
a.
[N/A]
b.



When carpet is manufactured, small faults occur at random. The number of faults in Premium carpets can be modelled by a Poisson distribution with mean 0.5 faults per 20 m2. Mr Jones chooses Premium carpets to replace the carpets in his office building. The office building has 10 rooms, each with the area of 80 m2.

Find the probability that the carpet laid in the first room has fewer than three faults.

[3]
a.

Find the probability that exactly seven rooms will have fewer than three faults in the carpet.

[3]
b.

Markscheme

* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.

λ = 4 × 0.5      (M1)

λ = 2      (A1)

P ( X 2 ) = 0.677      A1

[3 marks]

a.

Y B ( 10 ,   0 , 677 )      (M1)(A1)

P ( Y = 7 ) = 0.263      A1

 

Note:     Award M1 for clear recognition of binomial distribution.

 

[3 marks]

b.

Examiners report

[N/A]
a.
[N/A]
b.



The hens on a farm lay either white or brown eggs. The eggs are put into boxes of six. The farmer claims that the number of brown eggs in a box can be modelled by the binomial distribution, B(6, p ). By inspecting the contents of 150 boxes of eggs she obtains the following data.

Show that this data leads to an estimated value of p = 0.4 .

[1]
a.

Stating null and alternative hypotheses, carry out an appropriate test at the 5 % level to decide whether the farmer’s claim can be justified.

[11]
b.

Markscheme

from the sample, the probability of a brown egg is

0 × 7 + 1 × 32 + 6 × 150 = 360 900 = 0.4        A1

p = 0.4        AG

[1 mark]

a.

if the data can be modelled by a binomial distribution with p = 0.4 , the expected frequencies of boxes are given in the table

          A3

Notes: Deduct one mark for each error or omission.
Accept any rounding to at least one decimal place.

null hypothesis: the distribution is binomial          A1

alternative hypothesis: the distribution is not binomial          A1

for a chi-squared test the last two columns should be combined           R1

χ calc 2 = ( 7 7 ) 2 7 + ( 32 28 ) 2 28 + = 6.05   (Accept 6.06)          (M1)A1

degrees of freedom = 4          A1

critical value = 9.488           A1

Or use of p -value

we conclude that the farmer’s claim can be justified          R1

[11 marks]

b.

Examiners report

[N/A]
a.
[N/A]
b.



The number of taxis arriving at Cardiff Central railway station can be modelled by a Poisson distribution. During busy periods of the day, taxis arrive at a mean rate of 5.3 taxis every 10 minutes. Let T represent a random 10 minute busy period.

Find the probability that exactly 4 taxis arrive during T.

[2]
a.i.

Find the most likely number of taxis that would arrive during T.

[2]
a.ii.

Given that more than 5 taxis arrive during T, find the probability that exactly 7 taxis arrive during T.

[3]
a.iii.

During quiet periods of the day, taxis arrive at a mean rate of 1.3 taxis every 10 minutes.

Find the probability that during a period of 15 minutes, of which the first 10 minutes is busy and the next 5 minutes is quiet, that exactly 2 taxis arrive.

[6]
b.

Markscheme

X Po ( 5.3 )

P ( X = 4 ) = e 5.3 5.3 4 4 !      (M1)

= 0.164      A1

[2 marks]

a.i.

METHOD 1

listing probabilities (table or graph)      M1

mode X = 5 (with probability 0.174)     A1

Note: Award M0A0 for 5 (taxis) or mode = 5 with no justification.

 

METHOD 2

mode is the integer part of mean      R1

E(X) = 5.3 ⇒ mode = 5      A1

Note: Do not allow R0A1.

[2 marks]

a.ii.

attempt at conditional probability       (M1)

P ( X = 7 ) P ( X 6 ) or equivalent  ( = 0.1163 0.4365 )       A1

= 0.267       A1

[3 marks]

a.iii.

METHOD 1

the possible arrivals are (2,0), (1,1), (0,2)       (A1)

Y Po ( 0.65 )      A1

attempt to compute, using sum and product rule,      (M1)

0.070106… × 0.52204… + 0.026455… × 0.33932… + 0.0049916… × 0.11028…      (A1)(A1)

Note: Award A1 for one correct product and A1 for two other correct products.

= 0.0461       A1

[6 marks]

 

METHOD 2

recognising a sum of 2 independent Poisson variables eg Z = X + Y      R1

λ = 5.3 + 1.3 2

P(Z = 2) = 0.0461     (M1)A3

[6 marks]

 

b.

Examiners report

[N/A]
a.i.
[N/A]
a.ii.
[N/A]
a.iii.
[N/A]
b.



The age, L, in years, of a wolf can be modelled by the normal distribution L ~ N(8, 5).

Find the probability that a wolf selected at random is at least 5 years old.

[2]
a.

Eight wolves are independently selected at random and their ages recorded.

Find the probability that more than six of these wolves are at least 5 years old.

[3]
b.

Markscheme

* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.

P(L ≥ 5) = 0.910      (M1)A1

[2 marks]

a.

X is the number of wolves found to be at least 5 years old recognising binomial distribution      M1

X ~ B(8, 0.910…)

P(X > 6) = 1 − P(X ≤ 6)      (M1)

= 0.843       A1

Note: Award M1A0 for finding P(X ≥ 6).

[3 marks]

b.

Examiners report

[N/A]
a.
[N/A]
b.



Consider two events A and B such that P ( A ) = k ,  P ( B ) = 3 k ,  P ( A B ) = k 2 and P ( A B ) = 0.5 .

Calculate k ;

[3]
a.

Find P ( A B ) .

[3]
b.

Markscheme

* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.

use of P ( A B ) = P ( A ) + P ( B ) P ( A B )      M1

0.5 = k + 3 k k 2      A1

k 2 4 k + 0.5 = 0

k = 0.129      A1

 

Note:     Do not award the final A1 if two solutions are given.

 

[3 marks]

a.

use of P ( A B ) = P ( B ) P ( A B ) or alternative     (M1)

P ( A B ) = 3 k k 2      (A1)

= 0.371      A1

[3 marks]

b.

Examiners report

[N/A]
a.
[N/A]
b.



The mean number of squirrels in a certain area is known to be 3.2 squirrels per hectare of woodland. Within this area, there is a 56 hectare woodland nature reserve. It is known that there are currently at least 168 squirrels in this reserve.

Assuming the population of squirrels follow a Poisson distribution, calculate the probability that there are more than 190 squirrels in the reserve.

Markscheme

* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.

X is number of squirrels in reserve
X ∼ Po(179.2)      A1

Note: Award A1 if 179.2 or 56 × 3.2 seen or implicit in future calculations.

recognising conditional probability     M1

P(X > 190 | X ≥ 168)

= P ( X > 190 ) P ( X 168 ) = ( 0.19827 0.80817 )        (A1)(A1)

= 0.245      A1

[5 marks]

Examiners report

[N/A]