
HL Paper 2
Charlotte decides to model the shape of a cupcake to calculate its volume.
From rotating a photograph of her cupcake she estimates that its cross-section passes through the points and , where all units are in centimetres. The cross-section is symmetrical in the -axis, as shown below:
She models the section from to as a straight line.
Charlotte models the section of the cupcake that passes through the points and with a quadratic curve.
Charlotte thinks that a quadratic with a maximum point at and that passes through the point would be a better fit.
Believing this to be a better model for her cupcake, Charlotte finds the volume of revolution about the -axis to estimate the volume of the cupcake.
Find the equation of the line passing through these two points.
Find the equation of the least squares regression quadratic curve for these four points.
By considering the gradient of this curve when , explain why it may not be a good model.
Find the equation of the new model.
Write down an expression for her estimate of the volume as a sum of two integrals.
Find the value of Charlotte’s estimate.
Markscheme
A1A1
Note: Award A1 for , A1 for .
Award a maximum of A0A1 if not part of an equation.
[2 marks]
(M1)A1
[2 marks]
gradient of curve is positive at R1
Note: Accept a sensible rationale that refers to the gradient.
[1 mark]
METHOD 1
let
differentiating or using (M1)
substituting in the coordinates
(A1)
(A1)
solve to get
OR A1
Note: Use of quadratic regression with points using the symmetry of the graph is a valid method.
METHOD 2
(M1)
(M1)
(A1)
OR A1
[4 marks]
(M1)(M1) (M1)A1
Note: Award (M1)(M1)(M1)A0 if is omitted but response is otherwise correct. Award (M1) for an integral that indicates volume, (M1) for their part (a) within their volume integral, (M1) for their part (b)(i) within their volume integral, A1 for their correct two integrals with all correct limits.
[4 marks]
A1
[1 mark]
Examiners report
It is known that the weights of male Persian cats are normally distributed with mean and variance .
A group of male Persian cats are drawn from this population.
The male cats are now joined by female Persian cats. The female cats are drawn from a population whose weights are normally distributed with mean and standard deviation .
Ten female cats are chosen at random.
Sketch a diagram showing the above information.
Find the proportion of male Persian cats weighing between and .
Determine the expected number of cats in this group that have a weight of less than .
Find the probability that exactly one of them weighs over .
Let be the number of cats weighing over .
Find the variance of .
A cat is selected at random from all cats.
Find the probability that the cat was female, given that its weight was over .
Markscheme
A1A1
Note: Award A1 for a normal curve with mean labelled or , A1 for indication of SD marks on horizontal axis at and/or OR and/or on the correct side and approximately correct position.
[2 marks]
OR labelled sketch of region (M1)
A1
[2 marks]
(A1)
(M1)
A1
[3 marks]
,
(A1)
use of binomial seen or implied (M1)
using (M1)
A1
[4 marks]
A1
[1 mark]
(A1)
attempt use of tree diagram OR use of (M1)
(A1)
A1
[4 marks]
Examiners report
A city has two cable companies, X and Y. Each year 20 % of the customers using company X move to company Y and 10 % of the customers using company Y move to company X. All additional losses and gains of customers by the companies may be ignored.
Initially company X and company Y both have 1200 customers.
Write down a transition matrix T representing the movements between the two companies in a particular year.
Find the eigenvalues and corresponding eigenvectors of T.
Hence write down matrices P and D such that T = PDP−1.
Find an expression for the number of customers company X has after years, where .
Hence write down the number of customers that company X can expect to have in the long term.
Markscheme
M1A1
[2 marks]
M1
and 0.7 A1
eigenvectors and (M1)A1
Note: Accept any scalar multiple of the eigenvectors.
[4 marks]
EITHER
P = D = A1A1
OR
P = D = A1A1
[2 marks]
P−1 = A1
M1A1
attempt to multiply matrices M1
so in company A, after years, A1
[5 marks]
400 × 2 = 800 A1
[1 mark]
Examiners report
In a small village there are two doctors’ clinics, one owned by Doctor Black and the other owned by Doctor Green. It was noted after each year that of Doctor Black’s patients moved to Doctor Green’s clinic and of Doctor Green’s patients moved to Doctor Black’s clinic. All additional losses and gains of patients by the clinics may be ignored.
At the start of a particular year, it was noted that Doctor Black had patients on their register, compared to Doctor Green’s patients.
Write down a transition matrix indicating the annual population movement between clinics.
Find a prediction for the ratio of the number of patients Doctor Black will have, compared to Doctor Green, after two years.
Find a matrix , with integer elements, such that , where is a diagonal matrix.
Hence, show that the long-term transition matrix is given by .
Hence, or otherwise, determine the expected ratio of the number of patients Doctor Black would have compared to Doctor Green in the long term.
Markscheme
M1A1
Note: Award M1A1 for .
Award the A1 for a transposed if used correctly in part (b) i.e. preceded by matrix rather than followed by a matrix.
[2 marks]
(M1)
so ratio is A1
[2 marks]
to solve
(M1)
OR (A1)
attempt to find eigenvectors for at least one eigenvalue (M1)
when (or any real multiple) (A1)
when (or any real multiple) (A1)
therefore (accept integer valued multiples of their eigenvectors and columns in either order) A1
[6 marks]
(A1)
Note: This mark is independent, and may be seen anywhere in part (d).
(A1)
(M1)A1
Note: Award (M1)A0 for finding correctly.
as R1
so A1
AG
Note: The AG line must be seen for the final A1 to be awarded.
[6 marks]
METHOD ONE
(M1)
so ratio is A1
METHOD TWO
long term ratio is the eigenvector associated with the largest eigenvalue (M1)
A1
[2 marks]
Examiners report
The masses in kilograms of melons produced by a farm can be modelled by a normal distribution with a mean of and a standard deviation of .
Find the probability that two melons picked at random and independently of each other will
One year due to favourable weather conditions it is thought that the mean mass of the melons has increased.
The owner of the farm decides to take a random sample of melons to test this hypothesis at the significance level, assuming the standard deviation of the masses of the melons has not changed.
Unknown to the farmer the favourable weather conditions have led to all the melons having greater mass than the model described above.
Find the probability that a melon selected at random will have a mass greater than .
both have a mass greater than .
have a total mass greater than .
Write down the null and alternative hypotheses for the test.
Find the critical region for this test.
Find the mean and standard deviation of the mass of the melons for this year.
Find the probability of a Type II error in the owner’s test.
Markscheme
* This sample question was produced by experienced DP mathematics senior examiners to aid teachers in preparing for external assessment in the new MAA course. There may be minor differences in formatting compared to formal exam papers.
Let represent the mass of a melon
(M1)A1
[2 marks]
(M1)
A1
[2 marks]
Let represent the total mass
A1
(M1)A1
A1
[4 marks]
Let be the mean mass of the melons produced by the farm.
, only A1
Note: Accept The mean mass of melons produced by the farm is equal to
The mean mass of melons produced by the farm is greater than
Note: Award A0 if does not appear in the hypothesis.
[1 mark]
Under A1
(M1)
(A1)
Critical region is A1
[4 marks]
Let represent the new mass of the melons
A1
Standard deviation of (M1)
A1
Note: award M1A0 for
[3 marks]
(M1)
A1
Note: Accept from use of the three‐figure answer to part (d)
[2 marks]
Examiners report
The random variable X is thought to follow a binomial distribution B (4, ). In order to investigate this belief, a random sample of 100 observations on X was taken with the following results.
An automatic machine is used to fill bottles of water. The amount delivered, ml, may be assumed to be normally distributed with mean ml and standard deviation 8 ml. Initially, the machine is adjusted so that the value of is 500. In order to check that the value of remains equal to 500, a random sample of 10 bottles is selected at regular intervals, and the mean amount of water, , in these bottles is calculated. The following hypotheses are set up.
H0: = 500; H1: ≠ 500
The critical region is defined to be .
State suitable hypotheses for testing this belief.
Calculate the mean of these data and hence estimate the value of .
Calculate an appropriate value of and state your conclusion, using a 1% significance level.
Find the significance level of this procedure.
Some time later, the actual value of is 503. Find the probability of a Type II error.
Markscheme
H0 : The data are B (4, ); H1 : The data are not B (4, ) A1
[1 mark]
Mean M1A1
= 1.8 A1
M1A1
[5 marks]
The expected frequencies are
A1A1A1A1A1
The last two classes must be combined because the expected frequency for = 4 is less than 5. R1
M2
= 18.0 A2
DF = 2 (A1)
Critical value = 9.21 A1
We conclude, at the 1% significance level, that X does not fit a binomial model. R1
Special case: award the following marks to candidates who do not combine classes.
M2
= 39.6 A0
DF = 3 (A1)
Critical value = 11.345 A1
We conclude, at the 1% significance level, that X does not fit a binomial model. R1
[13 marks]
Under H0, the distribution of is N (500, 6.4). (A1)
Significance level = P < 495 or > 505 | H0 M2
= 2 × 0.02405 (A1)
= 0.0481 A1 N5
[5 marks]
The distribution of is now N (503, 6.4). (A1)
P(Type ΙΙ error) = P(495 < < 505) (M1)
= 0.785 A1 N3
[3 marks]
Examiners report
A student investigating the relationship between chemical reactions and temperature finds the Arrhenius equation on the internet.
This equation links a variable with the temperature , where and are positive constants and .
The Arrhenius equation predicts that the graph of against is a straight line.
Write down
The following data are found for a particular reaction, where is measured in Kelvin and is measured in :
Find an estimate of
Show that is always positive.
Given that and , sketch the graph of against .
(i) the gradient of this line in terms of ;
(ii) the -intercept of this line in terms of .
Find the equation of the regression line for on .
.
It is not required to state units for this value.
.
It is not required to state units for this value.
Markscheme
attempt to use chain rule, including the differentiation of (M1)
A1
this is the product of positive quantities so must be positive R1
Note: The R1 may be awarded for correct argument from their derivative. R1 is not possible if their derivative is not always positive.
[3 marks]
A1A1A1
Note: Award A1 for an increasing graph, entirely in first quadrant, becoming concave down for larger values of , A1 for tending towards the origin and A1 for asymptote labelled at .
[3 marks]
taking of both sides OR substituting and (M1)
OR (A1)
(i) so gradient is A1
(ii) -intercept is A1
Note: The implied (M1) and (A1) can only be awarded if both correct answers are seen. Award zero if only one value is correct and no working is seen.
[4 marks]
an attempt to convert data to and (M1)
e.g. at least one correct row in the following table
line is A1
[2 marks]
A1
[1 mark]
attempt to rearrange or solve graphically (M1)
A1
Note: Accept an value of … from use of value.
[2 marks]
Examiners report
This question caused significant difficulties for many candidates and many did not even attempt the question. Very few candidates were able to differentiate the expression in part (a) resulting in difficulties for part (b). Responses to parts (c) to (e) illustrated a lack of understanding of linearizing a set of data. Those candidates that were able to do part (d) frequently lost a mark as their answer was given in x and y.
Willow finds that she receives approximately 70 emails per working day.
She decides to model the number of emails received per working day using the random variable , where follows a Poisson distribution with mean 70.
In order to test her model, Willow records the number of emails she receives per working day over a period of 6 months. The results are shown in the following table.
From the table, calculate
Archie works for a different company and knows that he receives emails according to a Poisson distribution, with a mean of emails per day.
Using this distribution model, find .
Using this distribution model, find the standard deviation of .
an estimate for the mean number of emails received per working day.
an estimate for the standard deviation of the number of emails received per working day.
Give one piece of evidence that suggests Willow’s Poisson distribution model is not a good fit.
Suppose that the probability of Archie receiving more than 10 emails in total on any one day is 0.99. Find the value of λ.
Now suppose that Archie received exactly 20 emails in total in a consecutive two day period. Show that the probability that he received exactly 10 of them on the first day is independent of λ.
Markscheme
(M1)
= 0.102 A1
[2 marks]
standard deviation = (= 8.37) (M1)A1
[2 marks]
use of midpoints (accept consistent use of 45, 55 etc.) (M1)
(M1)
A1
Note: If 45, 55, etc. are used consistently instead of midpoints (implied by the answer 71.58…) award M1M1A0.
[3 marks]
13.9 (M1)A1
[2 marks]
valid reason given to include the examples below R1
variance is 192 which is not close to the mean (accept not equal to) standard deviation too high (using parts (a)(ii) and (b)(ii))
relative frequency of ≤ 59 is 0.142 which is too high (using part (a)(i))
Poisson would give a frequency of roughly 14 for 80 ≤ ≤ 89
Note: Reasons which do not use values found in previous parts must be backed up with numerical evidence.
[1 mark]
(M1)
attempt to solve a correct equation (M1)
λ = 20.1 A1
[3 marks]
in 1 day, no of emails is X ~ Po(λ)
in 2 days, no of emails is Y ~ Po(2λ) (A1)
P(10 on first day | 20 in 2 days) (M1)
(M1)
A1
A1
which is independent of λ AG
[5 marks]
Examiners report
Long term experience shows that if it is sunny on a particular day in Vokram, then the probability that it will be sunny the following day is . If it is not sunny, then the probability that it will be sunny the following day is .
The transition matrix is used to model this information, where .
The matrix can be written as a product of three matrices, , where is a diagonal matrix.
It is sunny today. Find the probability that it will be sunny in three days’ time.
Find the eigenvalues and eigenvectors of .
Write down the matrix .
Write down the matrix .
Hence find the long-term percentage of sunny days in Vokram.
Markscheme
finding OR use of tree diagram (M1)
the probability of sunny in three days’ time is A1
[2 marks]
attempt to find eigenvalues (M1)
Note: Any indication that has been used is sufficient for the (M1).
A1
attempt to find either eigenvector (M1)
so an eigenvector is A1
so an eigenvector is A1
Note: Accept multiples of the stated eigenvectors.
[5 marks]
OR A1
Note: Examiners should be aware that different, correct, matrices may be seen.
[1 mark]
OR A1
Note: and must be consistent with each other.
[1 mark]
(M1)
OR (A1)
Note: Award A1 only if their corresponds to their
(M1)
A1
[4 marks]
Examiners report
A continuous random variable has probability density function given by
It is given that .
Eight independent observations of are now taken and the random variable is the number of observations such that .
Show that and .
Find .
Find .
Find the median of .
Find .
Find .
Markscheme
* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.
M1A1
M1A1
Note: could be seen/used in place of either of the above equations.
evidence of an attempt to solve simultaneously (or check given a,b values are consistent) M1
AG
[5 marks]
(M1)
A1
[2 marks]
(M1)
A1
[2 marks]
(M1)
(A1)
A1
[3 marks]
(M1)
A1
[2 marks]
A1
[1 mark]
Examiners report
Arianne plays a game of darts.
The distance that her darts land from the centre, , of the board can be modelled by a normal distribution with mean and standard deviation .
Find the probability that
Each of Arianne’s throws is independent of her previous throws.
In a competition a player has three darts to throw on each turn. A point is scored if a player throws all three darts to land within a central area around . When Arianne throws a dart the probability that it lands within this area is .
In the competition Arianne has ten turns, each with three darts.
a dart lands less than from .
a dart lands more than from .
Find the probability that Arianne throws two consecutive darts that land more than from .
Find the probability that Arianne does not score a point on a turn of three darts.
Find Arianne’s expected score in the competition.
Find the probability that Arianne scores at least points in the competition.
Find the probability that Arianne scores at least points and less than points.
Given that Arianne scores at least points, find the probability that Arianne scores less than points.
Markscheme
Let be the random variable “distance from ”.
(M1)(A1)
[2 marks]
A1
[1 mark]
(M1)
A1
[2 marks]
(M1)
A1
[2 marks]
let be the random variable “number of points scored”
evidence of use of binomial distribution (M1)
(A1)
(M1)
A1
[4 marks]
A1
[1 mark]
(M1)
A1
Note: Award M1 for a correct probability statement or indication of correct lower and upper bounds, and .
[2 marks]
(M1)
A1
[2 marks]
Examiners report
Most candidates started this question well and applied the normal distribution correctly in part (a). The same goes for part (b) where the candidates were able to combine probabilities correctly. Part (c) was not very well done, and there were a surprising number of incorrect approaches on a seemingly straightforward problem. This suggests that candidates were not interpreting the problem correctly and there was a lack of careful reading to be sure of the scenario being described. In part (d) most candidates recognized the need to model the situation with the binomial distribution. However, many candidates did not choose a correct probability for the Bernoulli trial in this question and oversimplified the problem. This again seems to be a problem of “interpretation” rather than “conceptual understanding”.
Most candidates started this question well and applied the normal distribution correctly in part (a). The same goes for part (b) where the candidates were able to combine probabilities correctly. Part (c) was not very well done, and there were a surprising number of incorrect approaches on a seemingly straightforward problem. This suggests that candidates were not interpreting the problem correctly and there was a lack of careful reading to be sure of the scenario being described. In part (d) most candidates recognized the need to model the situation with the binomial distribution. However, many candidates did not choose a correct probability for the Bernoulli trial in this question and oversimplified the problem. This again seems to be a problem of “interpretation” rather than “conceptual understanding”.
Most candidates started this question well and applied the normal distribution correctly in part (a). The same goes for part (b) where the candidates were able to combine probabilities correctly. Part (c) was not very well done, and there were a surprising number of incorrect approaches on a seemingly straightforward problem. This suggests that candidates were not interpreting the problem correctly and there was a lack of careful reading to be sure of the scenario being described. In part (d) most candidates recognized the need to model the situation with the binomial distribution. However, many candidates did not choose a correct probability for the Bernoulli trial in this question and oversimplified the problem. This again seems to be a problem of “interpretation” rather than “conceptual understanding”.
Most candidates started this question well and applied the normal distribution correctly in part (a). The same goes for part (b) where the candidates were able to combine probabilities correctly. Part (c) was not very well done, and there were a surprising number of incorrect approaches on a seemingly straightforward problem. This suggests that candidates were not interpreting the problem correctly and there was a lack of careful reading to be sure of the scenario being described. In part (d) most candidates recognized the need to model the situation with the binomial distribution. However, many candidates did not choose a correct probability for the Bernoulli trial in this question and oversimplified the problem. This again seems to be a problem of “interpretation” rather than “conceptual understanding”.
Most candidates started this question well and applied the normal distribution correctly in part (a). The same goes for part (b) where the candidates were able to combine probabilities correctly. Part (c) was not very well done, and there were a surprising number of incorrect approaches on a seemingly straightforward problem. This suggests that candidates were not interpreting the problem correctly and there was a lack of careful reading to be sure of the scenario being described. In part (d) most candidates recognized the need to model the situation with the binomial distribution. However, many candidates did not choose a correct probability for the Bernoulli trial in this question and oversimplified the problem. This again seems to be a problem of “interpretation” rather than “conceptual understanding”.
Most candidates started this question well and applied the normal distribution correctly in part (a). The same goes for part (b) where the candidates were able to combine probabilities correctly. Part (c) was not very well done, and there were a surprising number of incorrect approaches on a seemingly straightforward problem. This suggests that candidates were not interpreting the problem correctly and there was a lack of careful reading to be sure of the scenario being described. In part (d) most candidates recognized the need to model the situation with the binomial distribution. However, many candidates did not choose a correct probability for the Bernoulli trial in this question and oversimplified the problem. This again seems to be a problem of “interpretation” rather than “conceptual understanding”.
Most candidates started this question well and applied the normal distribution correctly in part (a). The same goes for part (b) where the candidates were able to combine probabilities correctly. Part (c) was not very well done, and there were a surprising number of incorrect approaches on a seemingly straightforward problem. This suggests that candidates were not interpreting the problem correctly and there was a lack of careful reading to be sure of the scenario being described. In part (d) most candidates recognized the need to model the situation with the binomial distribution. However, many candidates did not choose a correct probability for the Bernoulli trial in this question and oversimplified the problem. This again seems to be a problem of “interpretation” rather than “conceptual understanding”.
Most candidates started this question well and applied the normal distribution correctly in part (a). The same goes for part (b) where the candidates were able to combine probabilities correctly. Part (c) was not very well done, and there were a surprising number of incorrect approaches on a seemingly straightforward problem. This suggests that candidates were not interpreting the problem correctly and there was a lack of careful reading to be sure of the scenario being described. In part (d) most candidates recognized the need to model the situation with the binomial distribution. However, many candidates did not choose a correct probability for the Bernoulli trial in this question and oversimplified the problem. This again seems to be a problem of “interpretation” rather than “conceptual understanding”.
A Chocolate Shop advertises free gifts to customers that collect three vouchers. The vouchers are placed at random into 10% of all chocolate bars sold at this shop. Kati buys some of these bars and she opens them one at a time to see if they contain a voucher. Let be the probability that Kati obtains her third voucher on the bar opened.
(It is assumed that the probability that a chocolate bar contains a voucher stays at 10% throughout the question.)
It is given that for .
Kati’s mother goes to the shop and buys chocolate bars. She takes the bars home for Kati to open.
Show that and .
Find the values of the constants and .
Deduce that for .
(i) Hence show that has two modes and .
(ii) State the values of and .
Determine the minimum value of such that the probability Kati receives at least one free gift is greater than 0.5.
Markscheme
* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.
A1
AG
(M1)
(or equivalent) A1
AG
[3 marks]
METHOD 1
attempting to form equations in and M1
A1
A1
attempting to solve simultaneously (M1)
A1
METHOD 2
M1
(M1)A1
A1
A1
Note: Condone the absence of in the determination of the values of and .
[5 marks]
METHOD 1
EITHER
(M1)
OR
(M1)
THEN
A1
A1
A1
AG
METHOD 2
(M1)
A1A1
Note: Award A1 for a correct numerator and A1 for a correct denominator.
A1
AG
[4 marks]
(i) attempting to solve for M1
A1
R1
R1
has two modes AG
Note: Award R1R1 for a clearly labelled graphical representation of the two inequalities (using ).
(ii) the modes are 20 and 21 A1
[5 marks]
METHOD 1
(A1)
attempting to solve (or equivalent eg ) for (M1)
Note: Award (M1) for attempting to solve an equality (obtaining ).
A1
METHOD 2
(A1)
attempting to solve for (M1)
A1
[3 marks]
Examiners report
The weights, in grams, of individual packets of coffee can be modelled by a normal distribution, with mean and standard deviation .
Find the probability that a randomly selected packet has a weight less than .
The probability that a randomly selected packet has a weight greater than grams is . Find the value of .
A packet is randomly selected. Given that the packet has a weight greater than , find the probability that it has a weight greater than .
From a random sample of packets, determine the number of packets that would be expected to have a weight lying within standard deviations of the mean.
Packets are delivered to supermarkets in batches of . Determine the probability that at least packets from a randomly selected batch have a weight less than .
Markscheme
* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.
(M1)A1
[2 marks]
(M1)
A1
[2 marks]
(M1)
(A1)
A1
[3 marks]
EITHER
(A1)
OR
(A1)
THEN
(M1)
A1
[3 marks]
(A1)
recognising (M1)
now using (M1)
A1
[4 marks]
Examiners report
Hank sets up a bird table in his garden to provide the local birds with some food. Hank notices that a specific bird, a large magpie, visits several times per month and he names him Bill. Hank models the number of times per month that Bill visits his garden as a Poisson distribution with mean .
Over the course of consecutive months, find the probability that Bill visits the garden:
Using Hank’s model, find the probability that Bill visits the garden on exactly four occasions during one particular month.
on exactly occasions.
during the first and third month only.
Find the probability that over a -month period, there will be exactly months when Bill does not visit the garden.
After the first year, a number of baby magpies start to visit Hank’s garden. It may be assumed that each of these baby magpies visits the garden randomly and independently, and that the number of times each baby magpie visits the garden per month is modelled by a Poisson distribution with mean .
Determine the least number of magpies required, including Bill, in order that the probability of Hank’s garden having at least magpie visits per month is greater than .
Markscheme
A1
[1 mark]
(M1)
A1
[2 marks]
(M1)
(A1)
A1
[3 marks]
(A1)
(M1)(A1)
Note: Award M1 for recognizing binomial probability, and A1 for correct parameters.
A1
[4 marks]
METHOD ONE
(M1)(A1)(A1)
Note: Award M1 for evidence of a cumulative Poisson with , A1 for and A1 for .
so require magpies (including Bill) A1
METHOD TWO
evidence of a cumulative Poisson with (M1)
sketch of curve and (A1)
(intersect at) (A1)
rounding up gives
so require magpies (including Bill) A1
[4 marks]
Examiners report
It is given that one in five cups of coffee contain more than 120 mg of caffeine.
It is also known that three in five cups contain more than 110 mg of caffeine.
Assume that the caffeine content of coffee is modelled by a normal distribution.
Find the mean and standard deviation of the caffeine content of coffee.
Markscheme
* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.
let be the random variable “amount of caffeine content in coffee”
(M1)
Note: Award M1 for at least one correct probability statement.
(M1)(A1)(A1)
Note: Award M1 for attempt to find at least one appropriate -value.
attempt to solve simultaneous equations (M1)
A1
[6 marks]
Examiners report
John likes to go sailing every day in July. To help him make a decision on whether it is safe to go sailing he classifies each day in July as windy or calm. Given that a day in July is calm, the probability that the next day is calm is 0.9. Given that a day in July is windy, the probability that the next day is calm is 0.3. The weather forecast for the 1st July predicts that the probability that it will be calm is 0.8.
Draw a tree diagram to represent this information for the first three days of July.
Find the probability that the 3rd July is calm.
Find the probability that the 1st July was calm given that the 3rd July is windy.
Markscheme
* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.
M1A2
Note: Award M1 for 3 stage tree-diagram, A2 for 0.8, 0.9, 0.3 probabilities correctly placed.
[3 marks]
(M1)A1
[2 marks]
(M1)
OR
OR (A1)(A1)
Note: Award A1 for correct numerator, A1 for correct denominator.
A1
[4 marks]
Examiners report
The curve is shown in the graph, for .
The curve passes through the following points.
It is required to find the area bounded by the curve, the -axis, the -axis and the line .
One possible model for the curve is a cubic function.
Use the trapezoidal rule to find an estimate for the area.
Use all the coordinates in the table to find the equation of the least squares cubic regression curve.
Write down the coefficient of determination.
Write down an expression for the area enclosed by the cubic regression curve, the -axis, the -axis and the line .
Find the value of this area.
Markscheme
Area = M1A1
Area = 43 A1
[3 marks]
M1A2
[3 marks]
A1
[1 mark]
Area = A1
[1 mark]
42.5 A2
[2 marks]
Examiners report
A café serves sandwiches and cakes. Each customer will choose one of the following three options; buy only a sandwich, buy only a cake or buy both a sandwich and a cake.
The probability that a customer buys a sandwich is 0.72 and the probability that a customer buys a cake is 0.45.
Find the probability that a customer chosen at random will buy
On a typical day 200 customers come to the café.
It is known that 46 % of the customers who come to the café are male, and that 80 % of these buy a sandwich.
both a sandwich and a cake.
only a sandwich.
Find the expected number of cakes sold on a typical day.
Find the probability that more than 100 cakes will be sold on a typical day.
A customer is selected at random. Find the probability that the customer is male and buys a sandwich.
A female customer is selected at random. Find the probability that she buys a sandwich.
Markscheme
use of formula or Venn diagram (M1)
0.72 + 0.45 − 1 (A1)
= 0.17 A1
[3 marks]
0.72 − 0.17 = 0.55 A1
[1 mark]
200 × 0.45 = 90 A1
[1 mark]
let X be the number of customers who order cake
X ~ B(200,0.45) (M1)
P(X > 100) = P(X ≥ 101)(= 1 − P(X ≤ 100)) (M1)
= 0.0681 A1
[3 marks]
0.46 × 0.8 = 0.368 A1
[1 mark]
METHOD 1
M1A1A1
Note: Award M1 for an appropriate tree diagram. Award M1 for LHS, M1 for RHS.
A1
METHOD 2
(M1)
A1A1
Note: Award A1 for numerator, A1 for denominator.
A1
[4 marks]
Examiners report
Dana has collected some data regarding the heights (metres) of waves against a pier at randomly chosen times in a single day. This data is shown in the table below.
She wishes to perform a -test at the significance level to see if the height of waves could be modelled by a normal distribution. Her null hypothesis is
The data can be modelled by a normal distribution.
From the table she calculates the mean of the heights in her sample to be and the standard deviation of the heights to be .
She calculates the expected values for each interval under this null hypothesis, and some of these values are shown in the table below.
Use the given value of to find the value of .
Find the value of and the value of , giving your answers correct to one decimal place.
Find the value of the test statistic for this test.
Determine the degrees of freedom for Dana’s test.
It is given that the critical value for this test is .
State the conclusion of the test in context. Use your answer to part (c) to justify your conclusion.
Markscheme
* This sample question was produced by experienced DP mathematics senior examiners to aid teachers in preparing for external assessment in the new MAA course. There may be minor differences in formatting compared to formal exam papers.
(M1)
Note: M1 is for the use of the correct formula
A1
[2 marks]
Using and (M1)
A1A1
[3 marks]
(M1)A1
[2 marks]
Combining columns with expected values less than leaves cells (M1)
A1
[2 marks]
R1
hence insufficient evidence to reject that the heights of the waves are normally distributed. A1
Note: The A1 can be awarded independently of the R1.
[2 marks]
Examiners report
The times taken for male runners to complete a marathon can be modelled by a normal distribution with a mean 196 minutes and a standard deviation 24 minutes.
It is found that 5% of the male runners complete the marathon in less than minutes.
The times taken for female runners to complete the marathon can be modelled by a normal distribution with a mean 210 minutes. It is found that 58% of female runners complete the marathon between 185 and 235 minutes.
Find the probability that a runner selected at random will complete the marathon in less than 3 hours.
Calculate .
Find the standard deviation of the times taken by female runners.
Markscheme
* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.
(M1)A1
[2 marks]
(M1)
A1
[2 marks]
(M1)
or equivalent (M1)(A1)
A1
[4 marks]
Examiners report
A Principal would like to compare the students in his school with a national standard. He decides to give a test to eight students made up of four boys and four girls. One of the teachers offers to find the volunteers from his class.
The marks out of , for the students who took the test, are:
For the eight students find
The national standard mark is out of .
Two additional students take the test at a later date and the mean mark for all ten students is and the standard deviation is .
For further analysis, a standardized score out of for the ten students is obtained by multiplying the scores by and adding .
For the ten students, find
Name the type of sampling that best describes the method used by the Principal.
the mean mark.
the standard deviation of the marks.
Perform an appropriate test at the significance level to see if the mean marks achieved by the students in the school are higher than the national standard. It can be assumed that the marks come from a normal population.
State one reason why the test might not be valid.
their mean standardized score.
the standard deviation of their standardized score.
Markscheme
quota A1
[1 mark]
(M1)A1
[2 marks]
A1
[1 mark]
(let be the national mean)
A1
Note: Accept hypotheses in words if they are clearly expressed and ‘population mean’ or ‘school mean’ is referred to. Do not accept unless is explicitly defined as “national standard mark” or given as .
recognizing -test (M1)
-value A1
R1
Note: The R1 mark is for the comparison of their -value with .
insufficient evidence to reject the null hypothesis (that the mean for the school is ) A1
Note: Award the final A1 only if the null hypothesis is also correct (e.g. or (population) mean ) and the conclusion is consistent with both the direction of the inequality and the alternative hypothesis.
[5 marks]
EITHER
the sampling process is not random R1
For example:
the school asked for volunteers
the students were selected from a single class
OR
the quota might not be representative of the student population R1
For example:
the school may have only boys and girls.
Note: Do not accept ‘the sample is too small’.
[1 mark]
A1
[1 mark]
(A1)
A1
[2 marks]
Examiners report
The most common answer to this question was ‘convenience sampling’. Though it is a convenience sample because four boys and four girls were required the most appropriate response was ‘quota sampling’.
Some candidates still try to calculate a mean and standard deviation by hand. This is not expected.
This was surprisingly poorly answered given that statistical testing forms a large part of the course. Candidates need to give the null and alternative hypotheses, find a p-value, compare this to the significance level and write their conclusion, in context of the question; examination questions may ask for each element individually or the question may say “Perform the test” wherein it is expected that each individual element will be clearly stated (as the test is incomplete if any are omitted). Many candidates had the null hypothesis as an inequality. The easiest way to write the null hypothesis is , but it could also be stated in words so long as it is clear that the population mean is being referred to rather than the sample mean. For example, H0: The mean score of the whole school is equal to 25.2.
The answer that the sample was self-selecting or unrepresentative was the expected response. The sample being small was also accepted if the additional reason of therefore ‘not able to assume a normal population’ was also given. In general, a small sample can be valid (though will probably not be reliable).
Some candidates missed the point of this question, that it was concerned with transformations of the mean and standard deviation, and instead tried to work out the actual values for the extra two candidates.
The number of marathons that Audrey runs in any given year can be modelled by a Poisson distribution with mean 1.3 .
Calculate the probability that Audrey will run at least two marathons in a particular year.
Find the probability that she will run at least two marathons in exactly four out of the following five years.
Markscheme
(M1)A1
[2 marks]
(M1)A1
Note: Award (M1) for recognition of binomial or equivalent, A1 for correct parameters.
(M1)A1
[4 marks]
Examiners report
Packets of biscuits are produced by a machine. The weights , in grams, of packets of biscuits can be modelled by a normal distribution where . A packet of biscuits is considered to be underweight if it weighs less than 250 grams.
The manufacturer makes the decision that the probability that a packet is underweight should be 0.002. To do this is increased and remains unchanged.
The manufacturer is happy with the decision that the probability that a packet is underweight should be 0.002, but is unhappy with the way in which this was achieved. The machine is now adjusted to reduce and return to 253.
Given that and find the probability that a randomly chosen packet of biscuits is underweight.
Calculate the new value of giving your answer correct to two decimal places.
Calculate the new value of .
Markscheme
* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.
(M1)A1
[2 marks]
(M1)(A1)
A1
Notes: Only award A1 here if the correct 2dp answer is seen. Award M0 for use of .
[3 marks]
(A1)
A1
[2 marks]
Examiners report
In a reforested area of pine trees, heights of trees planted in a specific year seem to follow a normal distribution. A sample of 100 such trees is selected to test the validity of this hypothesis. The results of measuring tree heights, to the nearest centimetre, are recorded in the first two columns of the table below.
Describe what is meant by
a goodness of fit test (a complete explanation required);
the level of significance of a hypothesis test.
Find the mean and standard deviation of the sample data in the table above. Show how you arrived at your answers.
Most of the expected frequencies have been calculated in the third column. (Frequencies have been rounded to the nearest integer, and frequencies in the first and last classes have been extended to include the rest of the data beyond 15 and 225. Find the values of , and and show how you arrived at your answers.
In order to test for the goodness of fit, the test statistic was calculated to be 1.0847. Show how this was done.
State your hypotheses, critical number, decision rule and conclusion (using a 5% level of significance).
Markscheme
A goodness of fit test is a statistical test of the hypothesis that a set of observed counts of cells of a certain large population is consistent with a set of theoretical counts. (R1)
The test statistic has a distribution with degrees of freedom. One degree of freedom is lost for every parameter that has to be estimated from the sample. (R1)
[2 marks]
The level of significance of a hypothesis test is the maximal probability that we reject a true null hypothesis. (R1)
[1 mark]
We use the class midpoints in the calculation of the mean and standard deviation.
(M1)
= 133.5 (A1)
(M1)
= 56.345 (= 56.3 to 3 sf) (A1)
[4 marks]
Every frequency is the product of the number of observations and the probability of a number in each class. Since by hypothesis we have a normal distribution, the probabilities can be read from a normal table with mean 133.5 and standard deviation 56.345 (M1)
E1 = 100 × P(45 ≤ ≤ 75) ≈ 9 so = 9 (A1)
E2 = 100 × P(135 ≤ ≤ 165) ≈ 20 so b = 20 (A1)
E3 = 100 × P(195 ≤ ≤ 225) ≈ 9 so = 9 (A1)
[4 marks]
The test statistic is a variable. Hence (M1)
(M1)
= 1.0847 (A1)
[3 marks]
H0: The distribution of tree heights is normally distributed
H1: The distribution is not normal (M1)
Since the mean and standard deviation were estimated from the sample, the number of degrees of freedom is 8 – 1 – 2 = 5 (A1)
The critical number is = 110705
If > 11.0705 we reject H0 (A1)
Since = 1.0847 < 11.0705, we fail to reject H0 (R1)
Conclusion: we do not have enough evidence to claim that the distribution of tree heights is not normal (R1)
[5 marks]
Examiners report
The random variable X has a normal distribution with mean μ = 50 and variance σ 2 = 16 .
Sketch the probability density function for X, and shade the region representing P(μ − 2σ < X < μ + σ).
Find the value of P(μ − 2σ < X < μ + σ).
Find the value of k for which P(μ − kσ < X < μ + kσ) = 0.5.
Markscheme
* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.
normal curve centred on 50 A1
vertical lines at = 42 and = 54, with shading in between A1
[2 marks]
P(42 < X < 54) (= P(− 2 < Z < 1)) (M1)
= 0.819 A1
[2 marks]
P(μ − kσ < X < μ + kσ) = 0.5 ⇒ P(X < μ + kσ) = 0.75 (M1)
k = 0.674 A1
Note: Award M1A0 for k = −0.674.
[2 marks]
Examiners report
Loreto is a manager at the Da Vinci health centre. If the mean rate of patients arriving at the health centre exceeds per minute then Loreto will employ extra staff. It is assumed that the number of patients arriving in any given time period follows a Poisson distribution.
Loreto performs a hypothesis test to determine whether she should employ extra staff. She finds that patients arrived during a randomly selected -hour clinic.
Loreto is also concerned about the average waiting time for patients to see a nurse. The health centre aims for at least of patients to see a nurse in under minutes.
Loreto assumes that the waiting times for patients are independent of each other and decides to perform a hypothesis test at a significance level to determine whether the health centre is meeting its target.
Loreto surveys patients and finds that of them waited more than minutes.
Write down null and alternative hypotheses for Loreto’s test.
Using the data from Loreto’s sample, perform the hypothesis test at a significance level to determine if Loreto should employ extra staff.
Write down null and alternative hypotheses for this test.
Perform the test, clearly stating the conclusion in context.
Markscheme
let be the random variable “number of patients arriving in a minute”, such that
A1
A1
Note: Allow a value of for . Award at most A0A1 if it is not clear that it is the population mean being referred to e.g
The number of patients is equal to 1.5 every minute
The number of patients exceeds 1.5 every minute.
Referring to the “expected” number of patients or the use of or is sufficient for A1A1.
[2 marks]
under let be the number of patients in hours
(A1)
(M1)A1
since R1
(reject )
Loreto should employ more staff A1
[5 marks]
: The probability of a patient waiting less than minutes is A1
: The probability of a patient waiting less than minutes is less than A1
[2 marks]
Under let be the number of patients waiting more than minutes
(A1)
(M1)A1
since R1
(fail to reject )
insufficient evidence to suggest they are not meeting their target A1
Note: Do not accept “they are meeting target” for the A1. Accept use of and and any consistent use of a random variable, appropriate -value and significance level.
[5 marks]
Examiners report
In part (a) there was a general lack of consistency in how candidates wrote down their null and alternative hypotheses. It was surprising how many candidates solved a Poisson rather than to find their -value. This suggests a lack of understanding of the nature of distributions or more specifically the concepts of hypothesis testing. In part (b), which was challenging, there were issues for many candidates in interpreting the situation. This is understandable since it was difficult, but as previously mentioned interpretation is a general issue in the paper. When writing down the conclusion of the tests, there was often very loose use of the terms accept/reject and candidates seemed unclear of the significance and importance of the correct use of these terms.
In part (a) there was a general lack of consistency in how candidates wrote down their null and alternative hypotheses. It was surprising how many candidates solved a Poisson rather than to find their -value. This suggests a lack of understanding of the nature of distributions or more specifically the concepts of hypothesis testing. In part (b), which was challenging, there were issues for many candidates in interpreting the situation. This is understandable since it was difficult, but as previously mentioned interpretation is a general issue in the paper. When writing down the conclusion of the tests, there was often very loose use of the terms accept/reject and candidates seemed unclear of the significance and importance of the correct use of these terms.
In part (a) there was a general lack of consistency in how candidates wrote down their null and alternative hypotheses. It was surprising how many candidates solved a Poisson rather than to find their -value. This suggests a lack of understanding of the nature of distributions or more specifically the concepts of hypothesis testing. In part (b), which was challenging, there were issues for many candidates in interpreting the situation. This is understandable since it was difficult, but as previously mentioned interpretation is a general issue in the paper. When writing down the conclusion of the tests, there was often very loose use of the terms accept/reject and candidates seemed unclear of the significance and importance of the correct use of these terms.
In part (a) there was a general lack of consistency in how candidates wrote down their null and alternative hypotheses. It was surprising how many candidates solved a Poisson rather than to find their -value. This suggests a lack of understanding of the nature of distributions or more specifically the concepts of hypothesis testing. In part (b), which was challenging, there were issues for many candidates in interpreting the situation. This is understandable since it was difficult, but as previously mentioned interpretation is a general issue in the paper. When writing down the conclusion of the tests, there was often very loose use of the terms accept/reject and candidates seemed unclear of the significance and importance of the correct use of these terms.
The marks achieved by eight students in a class test are given in the following list.
The teacher increases all the marks by 2. Write down the new value for
the standard deviation.
Markscheme
2.22 A1
[1 mark]
Examiners report
A random variable is normally distributed with mean and standard deviation , such that and .
Find and .
Find .
Markscheme
* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.
(M1)
either (M1)
(A1)
(A1)
attempting to solve simultaneously (M1)
and A1
[6 marks]
(or equivalent eg. ) (M1)
A1
Note: Award (M1)A1 for .
[2 marks]
Examiners report
A geneticist uses a Markov chain model to investigate changes in a specific gene in a cell as it divides. Every time the cell divides, the gene may mutate between its normal state and other states.
The model is of the form
where is the probability of the gene being in its normal state after dividing for the time, and is the probability of it being in another state after dividing for the time, where .
Matrix is found to be .
The gene is in its normal state when . Calculate the probability of it being in its normal state
Write down the value of .
What does represent in this context?
Find the eigenvalues of .
Find the eigenvectors of .
when .
in the long term.
Markscheme
A1
[1 mark]
the probability of mutating from ‘not normal state’ to ‘normal state’ A1
Note: The A1 can only be awarded if it is clear that transformation is from the mutated state.
[1 mark]
(M1)
Note: Award M1 for an attempt to find eigenvalues. Any indication that has been used is sufficient for the (M1).
OR (A1)
A1
[3 marks]
OR (M1)
Note: Award M1 can be awarded for attempting to find either eigenvector.
OR
and A1A1
Note: Accept any multiple of the given eigenvectors.
[3 marks]
OR (M1)
Note: Condone omission of the initial state vector for the M1.
A1
[2 marks]
(A1)
Note: Award A1 for OR seen.
A1
[2 marks]
Examiners report
There was some difficulty in interpreting the meaning of the values in the transition matrix, but most candidates did well with the rest of the question. In part (d) there was frequently evidence of a correct method, but a failure to identify the correct probabilities. It was surprising to see a significant number of candidates diagonalizing the matrix in part (d) and this often led to errors. Clearly this was not necessary.
Scientists have developed a type of corn whose protein quality may help chickens gain weight faster than the present type used. To test this new type, 20 one-day-old chicks were fed a ration that contained the new corn while another control group of 20 chicks was fed the ordinary corn. The data below gives the weight gains in grams, for each group after three weeks.
The scientists wish to investigate the claim that Group B gain weight faster than Group A. Test this claim at the 5% level of significance, noting which hypothesis test you are using. You may assume that the weight gain for each group is normally distributed, with the same variance, and independent from each other.
The data from the two samples above are combined to form a single set of data. The following frequency table gives the observed frequencies for the combined sample. The data has been divided into five intervals.
Test, at the 5% level, whether the combined data can be considered to be a sample from a normal population with a mean of 380.
Markscheme
This is a t-test of the difference of two means. Our assumptions are that the two populations are approximately normal, samples are random, and they are independent from each other. (R1)
H0: μ1 − μ2 = 0
H1: μ1 − μ2 < 0 (A1)
t = −2.460, (A1)
degrees of freedom = 38 (A1)
Since the value of critical t = −1.686 we reject H0. (A1)
Hence group B grows faster. (R1)
[6 marks]
This is a goodness-of-fit test.
To finish the table, the frequencies of the respective cells have to be calculated. Since the standard deviation is not given, it has to be estimated using the data itself. s = 49.59, eg the third expected frequency is 40 × 0.308 = 12.32, since P(350.5 < W < 390.5) = 0.3078...
The table of observed and expected frequencies is:
(M1)(A2)
Since the first expected frequency is 3.22, we combine the two cells, so that the first two rows become one row, that is,
(M1)
Number of degrees of freedom is 4 – 1 – 1 = 2 (C1)
H0: The distribution is normal with mean 380
H1: The distribution is not normal with mean 380 (A1)
The test statistic is
= 3.469 (A1)
With 2 degrees of freedom, the critical number is = 5.99 (A2)
So, we do not have enough evidence to reject the null hypothesis. Therefore, there is no evidence to say that the distribution is not normal with mean 380. (R1)
[10 marks]
Examiners report
The random variable X has a binomial distribution with parameters n and p.
It is given that E(X) = 3.5.
Find the least possible value of n.
It is further given that P(X ≤ 1) = 0.09478 correct to 4 significant figures.
Determine the value of n and the value of p.
Markscheme
* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.
np = 3.5 (A1)
p ≤ 1 ⇒ least n = 4 A1
[2 marks]
(1 − p)n + np(1 − p)n−1 = 0.09478 M1A1
attempt to solve above equation with np = 3.5 (M1)
n = 12, p = (=0.292) A1A1
Note: Do not accept n as a decimal.
[5 marks]
Examiners report
A survey of British holidaymakers found that of those surveyed took a holiday in the Lake District in .
A random sample of British holidaymakers was taken. The number of people in the sample who took a holiday in the Lake District in can be modelled by a binomial distribution.
State two assumptions made in order for this model to be valid.
Find the probability that at least three people from the sample took a holiday in the Lake District in .
From a random sample of holidaymakers, the probability that at least one of them took a holiday in the Lake District in is greater than .
Determine the least possible value of .
Markscheme
people’s holidays are independent of each other R1
the proportion is constant (at ) R1
[2 marks]
(M1)A1
[2 marks]
probability of at least one probability of none
OR (A1)
attempt to solve inequality (M1)
so least possible A1
[3 marks]
Examiners report
A discrete random variable follows a Poisson distribution .
Show that .
Given that and , use part (a) to find the value of .
Markscheme
* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.
METHOD 1
A1
M1A1
AG
METHOD 2
A1
M1A1
AG
METHOD 3
(M1)
A1
A1
and so AG
[3 marks]
A1
attempting to solve for (M1)
A1
[3 marks]
Examiners report
Jenna is a keen book reader. The number of books she reads during one week can be modelled by a Poisson distribution with mean .
Determine the expected number of weeks in one year, of weeks, during which Jenna reads at least four books.
Markscheme
* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.
let be the random variable “number of books Jenna reads per week.”
then
(M1)(A1)
(M1)
A1
Note: Accept weeks.
[4 marks]
Examiners report
It is known that 56 % of Infiglow batteries have a life of less than 16 hours, and 94 % have a life less than 17 hours. It can be assumed that battery life is modelled by the normal distribution .
Find the value of and the value of .
Find the probability that a randomly selected Infiglow battery will have a life of at least 15 hours.
Markscheme
* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.
use of inverse normal (implied by ±0.1509… or ±1.554…) (M1)
P(X < 16) = 0.56
(A1)
P(X < 17) = 0.94
(A1)
attempt to solve a pair of simultaneous equations (M1)
= 15.9, = 0.712 A1A1
[6 marks]
correctly shaded diagram or intent to find P(X ≥ 15) (M1)
= 0.895 A1
Note: Accept answers rounding to 0.89 or 0.90. Award M1A0 for the answer 0.9.
[2 marks]
Examiners report
The continuous random variable X has probability density function given by
Show that .
Find .
Given that , and that 0.25 < s < 0.4 , find the value of s.
Markscheme
* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.
M1
Note: Award the M1 for the total integral equalling 1, or equivalent.
(M1)A1
AG
[3 marks]
EITHER
(M1)(A1)
A1
OR
(M1)
so (M1)A1
[3 marks]
M1A1
(A1)
A1
equating
(A1)
attempt to solve for s (M1)
s = 0.274 A1
[7 marks]
Examiners report
Steffi the stray cat often visits Will’s house in search of food. Let be the discrete random variable “the number of times per day that Steffi visits Will’s house”.
The random variable can be modelled by a Poisson distribution with mean 2.1.
Let Y be the discrete random variable “the number of times per day that Steffi is fed at Will’s house”. Steffi is only fed on the first four occasions that she visits each day.
Find the probability that on a randomly selected day, Steffi does not visit Will’s house.
Copy and complete the probability distribution table for Y.
Hence find the expected number of times per day that Steffi is fed at Will’s house.
In any given year of 365 days, the probability that Steffi does not visit Will for at most days in total is 0.5 (to one decimal place). Find the value of .
Show that the expected number of occasions per year on which Steffi visits Will’s house and is not fed is at least 30.
Markscheme
* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.
(M1)A1
[2 marks]
A1A1A1A1
Note: Award A1 for each correct probability for Y = 1, 2, 3, 4. Accept 0.162 for P(Y = 4).
[4 marks]
(M1)
(A1)
A1
[3 marks]
let be the no of days per year that Steffi does not visit
(M1)
require (M1)
A1
[3 marks]
METHOD 1
let be the discrete random variable “number of times Steffi is not fed per day”
M1
A1
= 0.083979... A1
expected no of occasions per year > 0.083979... × 365 = 30.7 A1
hence Steffi can expect not to be fed on at least 30 occasions AG
Note: Candidates may consider summing more than three terms in their calculation for .
METHOD 2
M1A1
0.0903… × 365 M1
= 33.0 > 30 A1AG
[4 marks]
Examiners report
Timmy owns a shop. His daily income from selling his goods can be modelled as a normal distribution, with a mean daily income of $820, and a standard deviation of $230. To make a profit, Timmy’s daily income needs to be greater than $1000.
Calculate the probability that, on a randomly selected day, Timmy makes a profit.
The shop is open for 24 days every month.
Calculate the probability that, in a randomly selected month, Timmy makes a profit on between 5 and 10 days (inclusive).
Markscheme
* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.
X ~ N(820, 2302) (M1)
Note: Award M1 for an attempt to use normal distribution. Accept labelled normal graph.
⇒P(X > 1000) = 0.217 A1
[2 marks]
Y ~ B(24,0.217...) (M1)
Note: Award M1 for recognition of binomial distribution with parameters.
P(Y ≤ 10) − P(Y ≤ 4) (M1)
Note: Award M1 for an attempt to find P(5 ≤ Y ≤ 10) or P(Y ≤ 10) − P(Y ≤ 4).
= 0.613 A1
[3 marks]
Examiners report
The number of bananas that Lucca eats during any particular day follows a Poisson distribution with mean 0.2.
Find the probability that Lucca eats at least one banana in a particular day.
Find the expected number of weeks in the year in which Lucca eats no bananas.
Markscheme
* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.
let be the number of bananas eaten in one day
(M1)
A1
[2 marks]
EITHER
let be the number of bananas eaten in one week
(A1)
(A1)
OR
let be the number of days in one week at least one banana is eaten
(A1)
(A1)
THEN
(M1)
A1
[4 marks]
Examiners report
Events and are such that and .
Find .
Find .
Hence show that events and are independent.
Markscheme
* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.
(M1)
A1
[2 marks]
(M1)
A1
[2 marks]
METHOD 1
A1
R1
hence and are independent AG
Note: If there is evidence that the student has calculated by assuming independence in the first place, award A0R0.
METHOD 2
EITHER
A1
OR
A1
THEN
and are independent R1
hence and are independent AG
METHOD 3
A1
R1
hence and are independent AG
[2 marks]
Examiners report
A horse breeder records the number of births for each of 100 horses during the past eight years. The results are summarized in the following table:
Stating null and alternative hypotheses carry out an appropriate test at the 5% significance level to decide whether the results can be modelled by B (6, 0.5).
Without doing any further calculations, explain briefly how you would carry out a test, at the 5% significance level, to decide if the data can be modelled by B(6, ), where is unspecified.
A different horse breeder collected data on the time and outcome of births. The data are summarized in the following table:
Carry out an appropriate test at the 5% significance level to decide whether there is an association between time and outcome.
Markscheme
METHOD 1
H0: distribution is B(6, 0.5); H1: distribution is not B(6, 0.5) A1
A3
Combining the first two columns and the last two columns: A1
(M1)
= 5.22 A1
= 4, so critical value of A1A1
Since 5.22 < 9.488 the result is not significant and we accept H0 R1
METHOD 2
H0: distribution is B(6, 0.5); H1: distribution is not B(6, 0.5) A1
By GDC, A8
Since 0.266 > 0.05 the result is not significant and we accept H0 R1
[10 marks]
Estimate from the data which would entail the loss of one degree of freedom A1A1
[2 marks]
H0: there is no association H1: there is an association A1
A2
(M1)
= 15.7
A1A1
Since 15.7 > 7.815 we reject H0 R1
METHOD 2
H0: there is no association H1: there is an association A1
By GDC, = 0.00129 A6
Since 0.00129 < 0.05 we reject H0. R1
[8 marks]
Examiners report
Each of the 25 students in a class are asked how many pets they own. Two students own three pets and no students own more than three pets. The mean and standard deviation of the number of pets owned by students in the class are and respectively.
Find the number of students in the class who do not own a pet.
Markscheme
* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.
METHOD 1
let p have no pets, q have one pet and r have two pets (M1)
p + q + r + 2 = 25 (A1)
0p + 1q + 2r + 6 = 18 A1
Note: Accept a statement that there are a total of 12 pets.
attempt to use variance equation, or evidence of trial and error (M1)
(A1)
attempt to solve a system of linear equations (M1)
p = 14 A1
METHOD 2
(M1)
(A1)
A1
(M1)(A1)
(M1)
A1
so 14 have no pets
[7 marks]
Examiners report
Iqbal attempts three practice papers in mathematics. The probability that he passes the first paper is 0.6. Whenever he gains a pass in a paper, his confidence increases so that the probability of him passing the next paper increases by 0.1. Whenever he fails a paper the probability of him passing the next paper is 0.6.
Complete the given probability tree diagram for Iqbal’s three attempts, labelling each branch with the correct probability.
Calculate the probability that Iqbal passes at least two of the papers he attempts.
Find the probability that Iqbal passes his third paper, given that he passed only one previous paper.
Markscheme
* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.
A1A1A1
Note: Award A1 for each correct column of probabilities.
[3 marks]
probability (at least twice) =
EITHER
(M1)
OR
(M1)
Note: Award M1 for summing all required probabilities.
THEN
= 0.696 A1
[2 marks]
P(passes third paper given only one paper passed before)
(M1)
A1
= 0.657 A1
[3 marks]
Examiners report
There are 75 players in a golf club who take part in a golf tournament. The scores obtained on the 18th hole are as shown in the following table.
One of the players is chosen at random. Find the probability that this player’s score was 5 or more.
Calculate the mean score.
Markscheme
* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.
(M1)A1
[2 marks]
(M1)
A1
[2 marks]
Examiners report
Jorge is carefully observing the rise in sales of a new app he has created.
The number of sales in the first four months is shown in the table below.
Jorge believes that the increase is exponential and proposes to model the number of sales in month with the equation
Jorge plans to adapt Euler’s method to find an approximate value for .
With a step length of one month the solution to the differential equation can be approximated using Euler’s method where
Jorge decides to take the mean of these values as the approximation of for his model. He also decides the graph of the model should pass through the point .
The sum of the square residuals for these points for the least squares regression model is approximately .
Show that Jorge’s model satisfies the differential equation
Show that
Hence find three approximations for the value of .
Find the equation for Jorge’s model.
Find the sum of the square residuals for Jorge’s model using the values .
Comment how well Jorge’s model fits the data.
Give two possible sources of error in the construction of his model.
Markscheme
* This sample question was produced by experienced DP mathematics senior examiners to aid teachers in preparing for external assessment in the new MAA course. There may be minor differences in formatting compared to formal exam papers.
(M1)A1
Note: M1 is for an attempt to find
AG
Note: Accept solution of the differential equation by separating variables
[2 marks]
M1
M1A1
AG
Note: Do not penalize the use of the sign.
[3 marks]
Correct method (M1)
A2
Note: A1 for a single error A0 for two or more errors.
[3 marks]
or A1
(M1)
A1
[3 marks]
(M1)
A1
[2 marks]
The sum of the square residuals is approximately times as large as the minimum possible, so Jorge’s model is unlikely to fit the data exactly R1
[1 mark]
For example
Selecting a single point for the curve to pass through
Approximating the gradient of the curve by the gradient of a chord R1R1
[2 marks]
Examiners report
A random variable has a probability distribution given in the following table.
Determine the value of .
Find the value of .
Markscheme
* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.
(M1)A1
[2 marks]
METHOD 1
(M1)(A1)
A1
METHOD 2
(A1)
use of (M1)
Note: Award (M1) only if is used correctly.
A1
Note: Accept 2.11.
METHOD 3
(A1)
use of (M1)
A1
[3 marks]
Examiners report
When carpet is manufactured, small faults occur at random. The number of faults in Premium carpets can be modelled by a Poisson distribution with mean 0.5 faults per 20m2. Mr Jones chooses Premium carpets to replace the carpets in his office building. The office building has 10 rooms, each with the area of 80m2.
Find the probability that the carpet laid in the first room has fewer than three faults.
Find the probability that exactly seven rooms will have fewer than three faults in the carpet.
Markscheme
* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.
(M1)
(A1)
A1
[3 marks]
(M1)(A1)
A1
Note: Award M1 for clear recognition of binomial distribution.
[3 marks]
Examiners report
The hens on a farm lay either white or brown eggs. The eggs are put into boxes of six. The farmer claims that the number of brown eggs in a box can be modelled by the binomial distribution, B(6, ). By inspecting the contents of 150 boxes of eggs she obtains the following data.
Show that this data leads to an estimated value of .
Stating null and alternative hypotheses, carry out an appropriate test at the 5 % level to decide whether the farmer’s claim can be justified.
Markscheme
from the sample, the probability of a brown egg is
A1
AG
[1 mark]
if the data can be modelled by a binomial distribution with , the expected frequencies of boxes are given in the table
A3
Notes: Deduct one mark for each error or omission.
Accept any rounding to at least one decimal place.
null hypothesis: the distribution is binomial A1
alternative hypothesis: the distribution is not binomial A1
for a chi-squared test the last two columns should be combined R1
(Accept 6.06) (M1)A1
degrees of freedom = 4 A1
critical value = 9.488 A1
Or use of -value
we conclude that the farmer’s claim can be justified R1
[11 marks]
Examiners report
The number of taxis arriving at Cardiff Central railway station can be modelled by a Poisson distribution. During busy periods of the day, taxis arrive at a mean rate of 5.3 taxis every 10 minutes. Let T represent a random 10 minute busy period.
Find the probability that exactly 4 taxis arrive during T.
Find the most likely number of taxis that would arrive during T.
Given that more than 5 taxis arrive during T, find the probability that exactly 7 taxis arrive during T.
During quiet periods of the day, taxis arrive at a mean rate of 1.3 taxis every 10 minutes.
Find the probability that during a period of 15 minutes, of which the first 10 minutes is busy and the next 5 minutes is quiet, that exactly 2 taxis arrive.
Markscheme
(M1)
= 0.164 A1
[2 marks]
METHOD 1
listing probabilities (table or graph) M1
mode X = 5 (with probability 0.174) A1
Note: Award M0A0 for 5 (taxis) or mode = 5 with no justification.
METHOD 2
mode is the integer part of mean R1
E(X) = 5.3 ⇒ mode = 5 A1
Note: Do not allow R0A1.
[2 marks]
attempt at conditional probability (M1)
or equivalent A1
= 0.267 A1
[3 marks]
METHOD 1
the possible arrivals are (2,0), (1,1), (0,2) (A1)
A1
attempt to compute, using sum and product rule, (M1)
0.070106… × 0.52204… + 0.026455… × 0.33932… + 0.0049916… × 0.11028… (A1)(A1)
Note: Award A1 for one correct product and A1 for two other correct products.
= 0.0461 A1
[6 marks]
METHOD 2
recognising a sum of 2 independent Poisson variables eg Z = X + Y R1
P(Z = 2) = 0.0461 (M1)A3
[6 marks]
Examiners report
The age, L, in years, of a wolf can be modelled by the normal distribution L ~ N(8, 5).
Find the probability that a wolf selected at random is at least 5 years old.
Eight wolves are independently selected at random and their ages recorded.
Find the probability that more than six of these wolves are at least 5 years old.
Markscheme
* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.
P(L ≥ 5) = 0.910 (M1)A1
[2 marks]
X is the number of wolves found to be at least 5 years old recognising binomial distribution M1
X ~ B(8, 0.910…)
P(X > 6) = 1 − P(X ≤ 6) (M1)
= 0.843 A1
Note: Award M1A0 for finding P(X ≥ 6).
[3 marks]
Examiners report
Consider two events and such that and .
Calculate ;
Find .
Markscheme
* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.
use of M1
A1
A1
Note: Do not award the final A1 if two solutions are given.
[3 marks]
use of or alternative (M1)
(A1)
A1
[3 marks]
Examiners report
The mean number of squirrels in a certain area is known to be 3.2 squirrels per hectare of woodland. Within this area, there is a 56 hectare woodland nature reserve. It is known that there are currently at least 168 squirrels in this reserve.
Assuming the population of squirrels follow a Poisson distribution, calculate the probability that there are more than 190 squirrels in the reserve.
Markscheme
* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.
X is number of squirrels in reserve
X ∼ Po(179.2) A1
Note: Award A1 if 179.2 or 56 × 3.2 seen or implicit in future calculations.
recognising conditional probability M1
P(X > 190 | X ≥ 168)
(A1)(A1)
= 0.245 A1
[5 marks]