User interface language: English | Español

HL Paper 1

The function  p ( x ) is defined by p(x)=x33x2+8x24 where x R .

Find the remainder when p ( x ) is divided by  ( x 2 ) .

[2]
a.i.

Find the remainder when p ( x ) is divided by  ( x 3 ) .

[1]
a.ii.

Prove that  p ( x ) has only one real zero.

[4]
b.

Write down the transformation that will transform the graph of  y = p ( x ) onto the graph of y=8x312x2+16x24.

[2]
c.

The random variable X follows a Poisson distribution with a mean of λ and  6 P ( X = 3 ) = 3 P ( X = 2 ) 2 P ( X = 1 ) + 3 P ( X = 0 ) .

Find the value of  λ .

[6]
d.



The following table shows the time, in days, from December 1st and the percentage of Christmas trees in stock at a shop on the beginning of that day.

The following table shows the natural logarithm of both d and x on these days to 2 decimal places.

Use the data in the second table to find the value of m and the value of b for the regression line, lnx=m(lnd)+b.

 

[2]
a.

Assuming that the model found in part (a) remains valid, estimate the percentage of trees in stock when d=25.

[3]
b.



The weights of apples from Tony’s farm follow a normal distribution with mean 158 g and standard deviation 13 g. The apples are sold in bags that contain six apples.

Find the mean weight of a bag of apples.

[2]
a.

Find the standard deviation of the weights of these bags of apples.

[2]
b.

Find the probability that a bag selected at random weighs more than 1kg.

[2]
c.



A manufacturer of chocolates produces them in individual packets, claiming to have an average of 85 chocolates per packet.

Talha bought 30 of these packets in order to check the manufacturer’s claim.

Given that the number of individual chocolates is x, Talha found that, from his 30 packets, Σx=2506 and Σx2=209738.

Find an unbiased estimate for the mean number (μ) of chocolates per packet.

[1]
a.

Use the formula sn-12=Σx2-Σx2nn-1 to determine an unbiased estimate for the variance of the number of chocolates per packet.

[2]
b.

Find a 95% confidence interval for μ. You may assume that all conditions for a confidence interval have been met.

[2]
c.

Suggest, with justification, a valid conclusion that Talha could make.

[1]
d.



A factory, producing plastic gifts for a fast food restaurant’s Jolly meals, claims that just 1% of the toys produced are faulty.

A restaurant manager wants to test this claim. A box of 200 toys is delivered to the restaurant. The manager checks all the toys in this box and four toys are found to be faulty.

The restaurant manager performs a one-tailed hypothesis test, at the 10% significance level, to determine whether the factory’s claim is reasonable. It is known that faults in the toys occur independently.

Identify the type of sampling used by the restaurant manager.

[1]
a.

Write down the null and alternative hypotheses.

[2]
b.

Find the p-value for the test.

[2]
c.

State the conclusion of the test. Give a reason for your answer.

[2]
d.



Saloni wants to find a model for the temperature of a bottle of water after she removes it from the fridge. She uses a temperature probe to record the temperature of the water, every 5 minutes.

After graphing the data, Saloni believes a suitable model will be

T = 28 a b t , where  a , b R + .

Explain why 28 T can be modeled by an exponential function.

[1]
a.

Find the equation of the least squares exponential regression curve for  28 T .

[3]
b.

Write down the coefficient of determination,  R 2 .

[1]
c.i.

Interpret what the value of R 2  tells you about the model.

[1]
c.ii.

Hence predict the temperature of the water after 3 minutes.

[2]
d.



A newspaper vendor in Singapore is trying to predict how many copies of The Straits Times they will sell. The vendor forms a model to predict the number of copies sold each weekday. According to this model, they expect the same number of copies will be sold each day.

To test the model, they record the number of copies sold each weekday during a particular week. This data is shown in the table.

A goodness of fit test at the 5% significance level is used on this data to determine whether the vendor’s model is suitable. The critical value for the test is 9.49.

Find an estimate for how many copies the vendor expects to sell each day.

[1]
a.

State the null and alternative hypotheses for this test.

[2]
b.i.

Write down the degrees of freedom for this test.

[1]
b.ii.

Write down the conclusion to the test. Give a reason for your answer.

[4]
b.iii.



Two unbiased tetrahedral (four-sided) dice with faces labelled 1, 2, 3, 4 are thrown and the scores recorded. Let the random variable T be the maximum of these two scores.

The probability distribution of T is given in the following table.

Find the value of a and the value of b.

[3]
a.

Find the expected value of T.

[2]
b.



The cars for a fairground ride hold four people. They arrive at the platform for loading and unloading every 30 seconds.

During the hour from 9 am the arrival of people at the ride in any interval of t minutes can be modelled by a Poisson distribution with a mean of 9t0<t<60.

When the 9 am car leaves there is no one in the queue to get on the ride.

Shunsuke arrives at 9.01 am.

Find the probability that more than 7 people arrive at the ride before Shunsuke.

[2]
a.

Find the probability there will be space for him on the 9.01 car.

[6]
b.



Chloe and Selena play a game where each have four cards showing capital letters A, B, C and D.
Chloe lays her cards face up on the table in order A, B, C, D as shown in the following diagram.

N17/5/MATHL/HP1/ENG/TZ0/10

Selena shuffles her cards and lays them face down on the table. She then turns them over one by one to see if her card matches with Chloe’s card directly above.
Chloe wins if no matches occur; otherwise Selena wins.

Chloe and Selena repeat their game so that they play a total of 50 times.
Suppose the discrete random variable X represents the number of times Chloe wins.

Show that the probability that Chloe wins the game is 3 8 .

[6]
a.

Determine the mean of X.

[3]
b.i.

Determine the variance of X.

[2]
b.ii.



A company produces bags of sugar with a labelled weight of 1kg. The weights of the bags are normally distributed with a mean of 1kg and a standard deviation of 100g. In an inspection, if the weight of a randomly chosen bag is less than 950g then the company fails the inspection.

Find the probability that the company fails the inspection.

[2]
a.

A statistician in the company suggests it would be fairer if the company passes the inspection when the mean weight of five randomly chosen bags is greater than 950g.

Find the probability of passing the inspection if the statistician’s suggestion is followed.

[4]
b.



The discrete random variable X has the following probability distribution, where p is a constant.

Find the value of p.

[2]
a.

Find μ, the expected value of X.

[2]
b.i.

Find P(X > μ).

[2]
b.ii.



The sex of cuttlefish is difficult to determine visually, so it is often found by weighing the cuttlefish.

The weights of adult male cuttlefish are known to be normally distributed with mean 10kg and standard deviation 0.5kg.

The weights of adult female cuttlefish are known to be normally distributed with mean 12kg and standard deviation 1kg.

A zoologist uses the null hypothesis that in the absence of information a cuttlefish is male.

If the weight is found to be above 11.5kg the cuttlefish is classified as female.

90% of adult cuttlefish are male.

Find the probability of making a Type I error when weighing a male cuttlefish.

[2]
a.

Find the probability of making a Type II error when weighing a female cuttlefish.

[2]
b.

Find the probability of making an error using the zoologist’s method.

[2]
c.



Consider two events, A and B , such that P ( A ) = P ( A B ) = 0.4  and  P ( A B ) = 0.1 .

By drawing a Venn diagram, or otherwise, find P ( A B ) .

[3]
a.

Show that the events A and B are not independent.

[3]
b.



It is believed that the power P of a signal at a point d km from an antenna is inversely proportional to dn where n+.

The value of P is recorded at distances of 1m to 5m and the values of log10d and log10P are plotted on the graph below.

The values of log10d and log10P are shown in the table below.

Explain why this graph indicates that P is inversely proportional to dn.

 

[2]
a.

Find the equation of the least squares regression line of log10P against log10d.

 

[2]
b.

Use your answer to part (b) to write down the value of n to the nearest integer.

[1]
c.i.

Find an expression for P in terms of d.

[2]
c.ii.



The number of fish that can be caught in one hour from a particular lake can be modelled by a Poisson distribution.

The owner of the lake, Emily, states in her advertising that the average number of fish caught in an hour is three.

Tom, a keen fisherman, is not convinced and thinks it is less than three. He decides to set up the following test. Tom will fish for one hour and if he catches fewer than two fish he will reject Emily’s claim.

State a suitable null and alternative hypotheses for Tom’s test.

[1]
a.

Find the probability of a Type I error.

[2]
b.

The average number of fish caught in an hour is actually 2.5.

Find the probability of a Type II error.

[3]
c.



The number of cars passing a certain point in a road was recorded during 80 equal time intervals and summarized in the table below.

Carry out a χ 2  goodness of fit test at the 5% significance level to decide if the above data can be modelled by a Poisson distribution.




A zoologist believes that the number of eggs laid in the Spring by female birds of a certain breed follows a Poisson law. She observes 100 birds during this period and she produces the following table.

The zoologist wishes to determine whether or not a Poisson law provides a suitable model.

Calculate the mean number of eggs laid by these birds.

[2]
a.

Write down appropriate hypotheses.

[2]
b.i.

Carry out a test at the 1% significance level, and state your conclusion.

[14]
b.ii.



Product research leads a company to believe that the revenue ( R ) made by selling its goods at a price ( p ) can be modelled by the equation.

R ( p ) = c p e d p c d R

There are two competing models, A and B with different values for the parameters c and d

Model A has c = 3,  d = −0.5 and model B has c = 2.5, d = −0.6.

The company experiments by selling the goods at three different prices in three similar areas and the results are shown in the following table.

The company will choose the model with the smallest value for the sum of square residuals.

Determine which model the company chose.




Mr Burke teaches a mathematics class with 15 students. In this class there are 6 female students and 9 male students.

Each day Mr Burke randomly chooses one student to answer a homework question.

In the first month, Mr Burke will teach his class 20 times.

Find the probability he will choose a female student 8 times.

[2]
a.

The Head of Year, Mrs Smith, decides to select a student at random from the year group to read the notices in assembly. There are 80 students in total in the year group. Mrs Smith calculates the probability of picking a male student 8 times in the first 20 assemblies is 0.153357 correct to 6 decimal places.

Find the number of male students in the year group.

[4]
b.



A company sends a group of employees on a training course. Afterwards, they survey these employees to gather data on the effectiveness of the training. In order to test the reliability of the survey, they design two sets of similar questions, which are given to the employees one week apart.

The questions in the survey were grouped in different sections. The mean scores of the employees on the first section of each survey are given in the table.

State the name of this test for reliability.

[1]
a.

State a possible disadvantage of using this test for reliability.

[1]
b.

Calculate Pearson’s product moment correlation coefficient for this data.

[2]
c.

Hence determine, with a reason, if the survey is reliable.

[2]
d.



On Paul’s farm, potatoes are packed in sacks labelled 50kg. The weights of the sacks of potatoes can be modelled by a normal distribution with mean weight 49.8kg and standard deviation 0.9kg.

Find the probability that a sack is under its labelled weight.

[2]
a.

Find the lower quartile of the weights of the sacks of potatoes.

[2]
b.

The sacks of potatoes are transported in crates. There are 10 sacks in each crate and the weights of the sacks of potatoes are independent of each other.

Find the probability that the total weight of the sacks of potatoes in a crate exceeds 500kg.

[3]
c.



Observations on 12 pairs of values of the random variables X, Y yielded the following results.

x=76.3, x2=563.7, y=72.2, y2=460.1, xy=495.4

Calculate the value of r, the product moment correlation coefficient of the sample.

[3]
a.i.

Assuming that the distribution of X, Y is bivariate normal with product moment correlation coefficient ρ, calculate the p-value of your result when testing the hypotheses H0: ρ=0 ; H1: ρ>0.

[3]
a.ii.

State whether your p-value suggests that X and Y are independent.

[1]
a.iii.

Given a further value x=5.2 from the distribution of X, Y, predict the corresponding value of y. Give your answer to one decimal place.

[3]
b.



The continuous random variable X has a probability density function given by

f ( x ) = { k sin ( π x 6 ) , 0 x 6 0 , otherwise .

Find the value of k .

[4]
a.

By considering the graph of f write down the mean of X ;

[1]
b.i.

By considering the graph of f write down the median of X ;

[1]
b.ii.

By considering the graph of f write down the mode of X .

[1]
b.iii.

Show that P ( 0 X 2 ) = 1 4 .

[4]
c.i.

Hence state the interquartile range of X .

[2]
c.ii.

Calculate P ( X 4 | X 3 ) .

[2]
d.



The number of coffees sold per hour at an independent coffee shop is modelled by a Poisson distribution with a mean of 22 coffees per hour.

Sheila, the shop’s owner wants to increase the number of coffees sold in the shop. She decides to offer a discount to customers who buy more than one coffee.

To test how successful this strategy is, Sheila records the number of coffees sold over a single 5-hour period. Sheila decides to use a 5% level of significance in her test.

State the null and alternative hypotheses for the test.

[1]
a.

Find the probability that Sheila will make a type I error in her test conclusion.

[4]
b.

Sheila finds 126 coffees were sold during the 5-hour period.

State Sheila’s conclusion to the test. Justify your answer.

[2]
c.



The heights, x metres, of the 241 new entrants to a men’s college were measured and the following statistics calculated.

x = 412.11 , x 2 = 705.5721

The Head of Mathematics decided to use a χ 2  test to determine whether or not these heights could be modelled by a normal distribution. He therefore divided the data into classes as follows.

Calculate unbiased estimates of the population mean and the population variance.

[3]
a.

State suitable hypotheses.

[1]
b.i.

Calculate the value of the χ 2  statistic and state your conclusion using a 10% level of significance.

[11]
b.ii.



The graph below shows a small maze, in the form of a network of directed routes. The vertices A to F show junctions in the maze and the edges show the possible paths available from one vertex to another.

A mouse is placed at vertex A and left to wander the maze freely. The routes shown by dashed lines indicate paths sprinkled with sugar.

When the mouse reaches any junction, she rests for a constant time before continuing.

At any junction, it may also be assumed that

Determine the transition matrix for this graph.

[3]
a.

If the mouse was left to wander indefinitely, use your graphic display calculator to estimate the percentage of time that the mouse would spend at point F.

[3]
b.

Comment on your answer to part (b), referring to at least one limitation of the model.

[2]
c.



Eggs at a farm are sold in boxes of six. Each egg is either brown or white. The owner believes that the number of brown eggs in a box can be modelled by a binomial distribution. He examines 100 boxes and obtains the following data.

Calculate the mean number of brown eggs in a box.

[1]
a.i.

Hence estimate p , the probability that a randomly chosen egg is brown.

[1]
a.ii.

By calculating an appropriate χ 2 statistic, test, at the 5% significance level, whether or not the binomial distribution gives a good fit to these data.

[8]
b.



Adesh wants to model the cooling of a metal rod. He heats the rod and records its temperature as it cools.

He believes the temperature can be modeled by  T ( t ) = a e b t + 25 , where a , b R .

Hence

Show that  ln ( T 25 ) = b t + ln a .

[2]
a.

Find the equation of the regression line of  ln ( T 25 ) on  t .

[3]
b.

find the value of a  and of b .

[3]
c.i.

predict the temperature of the metal rod after 3 minutes.

[2]
c.ii.



Katie likes to cycle to work as much as possible. If Katie cycles to work one day then she has a probability of 0.2 of not cycling to work on the next work day. If she does not cycle to work one day then she has a probability of 0.4 of not cycling to work on the next work day.

Complete the following transition diagram to represent this information.

[2]
a.

Katie works for 180 days in a year.

Find the probability that Katie cycles to work on her final working day of the year.

[3]
b.



A psychologist records the number of digits (d) of π that a sample of IB Mathematics higher level candidates could recall.

The psychologist has read that in the general population people can remember an average of 4.4 digits of π. The psychologist wants to perform a statistical test to see if IB Mathematics higher level candidates can remember more digits than the general population.

H0: μ=4.4 is the null hypothesis for this test.

Find an unbiased estimate of the population mean of d.

[1]
a.

Find an unbiased estimate of the population variance of d.

[2]
b.

State the alternative hypothesis.

[1]
c.i.

Given that all assumptions for this test are satisfied, carry out an appropriate hypothesis test. State and justify your conclusion. Use a 5% significance level.

[4]
c.ii.



A manager wishes to check the mean mass of flour put into bags in his factory. He randomly samples 10 bags and finds the mean mass is 1.478 kg and the standard deviation of the sample is 0.0196 kg.

Find s n 1 for this sample.

[2]
a.

Find a 95 % confidence interval for the population mean, giving your answer to 4 significant figures.

[2]
b.

The bags are labelled as being 1.5 kg mass. Comment on this claim with reference to your answer in part (b).

[1]
c.



As part of the selection process for an engineering course at a particular university, applicants are given an exam in mathematics. This year the university has produced a new exam and they want to test if it is a valid indicator of future performance, before giving it to applicants. They randomly select 8 students in their first year of the engineering course and give them the exam. They compare the exam scores with their results in the engineering course.

The results of the 8 students are shown in the table.

State the name of this test for validity.

[1]
a.

Calculate Pearson’s product moment correlation coefficient for this data.

[2]
b.

Hence determine, with a reason, if the new exam is a valid indicator of future performance.

[2]
c.



In a coffee shop, the time it takes to serve a customer can be modelled by a normal distribution with a mean of 1.5 minutes and a standard deviation of 0.4 minutes.

Two customers enter the shop together. They are served one at a time.

Find the probability that the total time taken to serve both customers will be less than 4 minutes.

Clearly state any assumptions you have made.




2 × 2  transition matrix for a Markov chain will have the form M = ( a 1 b 1 a b ) , 0 < a < 1 , 0 < b < 1 .

Show that  λ = 1  is always an eigenvalue for M and find the other eigenvalue in terms of a and b .

[4]
a.

Find the steady state probability vector for M in terms of a and b .

[5]
b.



The random variable X has the Poisson distribution Po ( m ) . Given that P ( X > 0 ) = 3 4 , find the value of m in the form ln a where a is an integer.

[3]
a.

The random variable Y has the Poisson distribution Po ( 2 m ) . Find P ( Y > 1 ) in the form b ln c c where b and c are integers.

[4]
b.



A robot moves around the maze shown below.

Whenever it leaves a room it is equally likely to take any of the exits.

The time interval between the robot entering and leaving a room is the same for all transitions.

Find the transition matrix for the maze.

[3]
a.

A scientist sets up the robot and then leaves it moving around the maze for a long period of time.

Find the probability that the robot is in room B when the scientist returns.

[2]
b.



The number of cars arriving at a junction in a particular town in any given minute between 9:00 am and 10:00 am is historically known to follow a Poisson distribution with a mean of 5.4 cars per minute.

A new road is built near the town. It is claimed that the new road has decreased the number of cars arriving at the junction.

To test the claim, the number of cars, X, arriving at the junction between 9:00 am and 10:00 am on a particular day will be recorded. The test will have the following hypotheses:

H0: the mean number of cars arriving at the junction has not changed,
H1: the mean number of cars arriving at the junction has decreased.

The alternative hypothesis will be accepted if X300.

Assuming the null hypothesis to be true, state the distribution of X.

[1]
a.

Find the probability of a Type I error.

[2]
b.

Find the probability of a Type II error, if the number of cars now follows a Poisson distribution with a mean of 4.5 cars per minute.

[4]
c.



Consider two events A and A defined in the same sample space.

Given that P ( A B ) = 4 9 ,  P ( B | A ) = 1 3  and P ( B | A ) = 1 6 ,

Show that P ( A B ) = P ( A ) + P ( A B ) .

[3]
a.

(i)     show that P ( A ) = 1 3 ;

(ii)     hence find P ( B ) .

[6]
b.



Sue sometimes goes out for lunch. If she goes out for lunch on a particular day then the probability that she will go out for lunch on the following day is 0.4. If she does not go out for lunch on a particular day then the probability she will go out for lunch on the following day is 0.3.

Write down the transition matrix for this Markov chain.

[2]
a.

We know that she went out for lunch on a particular Sunday, find the probability that she went out for lunch on the following Tuesday.

[2]
b.

Find the steady state probability vector for this Markov chain.

[3]
c.



The number of telephone calls received by a helpline over 80 one-minute periods are summarized in the table below.

Find the exact value of the mean of this distribution.

[2]
a.

Test, at the 5% level of significance, whether or not the data can be modelled by a Poisson distribution.

[12]
b.



The diagram below shows part of the screen from a weather forecasting website showing the data for town A. The percentages on the bottom row represent the likelihood of some rain during the hour leading up to the time given. For example there is a 69% chance (a probability of 0.69) of rain falling on any point in town A between 0900 and 1000.

Paula works at a building site in the area covered by this page of the website from 0900 to 1700. She has lunch from 1300 to 1400.

In the following parts you may assume all probabilities are independent.

Paula needs to work outside between 1000 and 1300 and will also spend her lunchtime outside.

Write down the probability it rains during Paula’s lunch break.

[1]
a.

Find the probability it will not rain while Paula is outside.

[2]
b.

Find the probability it will rain at least once while Paula is outside.

[2]
c.

Given it rains at least once while Paula is outside find the probability that it rains during her lunch hour.

[3]
d.



Let X be a random variable which follows a normal distribution with mean μ . Given that  P ( X < μ 5 ) = 0.2  , find

P ( X > μ + 5 ) .

[2]
a.

P ( X < μ + 5 | X > μ 5 ) .

[5]
b.



George goes fishing. From experience he knows that the mean number of fish he catches per hour is 1.1. It is assumed that the number of fish he catches can be modelled by a Poisson distribution.

On a day in which George spends 8 hours fishing, find the probability that he will catch more than 9 fish.




The matrix M=0.2  0.70.8  0.3 has eigenvalues -5 and 1.

A switch has two states, A and B. Each second it either remains in the same state or moves according to the following rule: If it is in state A it will move to state B with a probability of 0.8 and if it is in state B it will move to state A with a probability of 0.7.

Find an eigenvector corresponding to the eigenvalue of 1. Give your answer in the form ab, where a, b.

[3]
a.

Using your answer to (a), or otherwise, find the long-term probability of the switch being in state A. Give your answer in the form cd, where c, d+.

[2]
b.



The faces of a fair six-sided die are numbered 1, 2, 2, 4, 4, 6. Let X be the discrete random variable that models the score obtained when this die is rolled.

Complete the probability distribution table for X .

N16/5/MATHL/HP1/ENG/TZ0/02.a

[2]
a.

Find the expected value of X .

[2]
b.



The strength of earthquakes is measured on the Richter magnitude scale, with values typically between 0 and 8 where 8 is the most severe.

The Gutenberg–Richter equation gives the average number of earthquakes per year, N, which have a magnitude of at least M. For a particular region the equation is

log10N=a-M, for some a.

This region has an average of 100 earthquakes per year with a magnitude of at least 3.

The equation for this region can also be written as N=b10M.

Within this region the most severe earthquake recorded had a magnitude of 7.2.

The number of earthquakes in a given year with a magnitude of at least 7.2 can be modelled by a Poisson distribution, with mean N. The number of earthquakes in one year is independent of the number of earthquakes in any other year.

Let Y be the number of years between the earthquake of magnitude 7.2 and the next earthquake of at least this magnitude.

Find the value of a.

[2]
a.

Find the value of b.

[2]
b.

Find the average number of earthquakes in a year with a magnitude of at least 7.2.

[1]
c.

Find P(Y>100).

[3]
d.



Find the coordinates of the point of intersection of the planes defined by the equations x + y + z = 3 ,   x y + z = 5 and x + y + 2 z = 6 .