Poisson Distribution (Business)

The Poisson Distribution

Let $X$ denote the number of events (“successes”) that occur within a specified time interval. When the statements are true:

  • There is no natural upper limit to the number of trials.
  • Events occur independently, at an average rate $\lambda$ ($\lambda$ is a Greek letter and is pronounced 'lamb-da') within a specified time interval

we say that $X$ follows a Poisson Distribution and write and we write $X\sim\mathrm{Po}(\lambda)$.

The Poisson distribution is a useful general model for count data. For example:

  • The number of people entering a supermarket per hour.
  • The number of Polish cavalry officers kicked by their horses each year.
  • The number of cars passing a point on a road every minute.

The probability of $r$ events in a specified interval can be calculated using the formula:

\begin{equation} \mathrm{P}(X=r) = \dfrac{\lambda^r \times e^{-\lambda}}{r!}, r = 0,1,... \end{equation}

Where $ e$ is the exponential function and $r!$ is $r$ factorial.

Expectation and Variance

If $X\sim\mathrm{Po}(\lambda)$ then the expectation and variance of $X$ are given by:

\begin{equation} \begin{split} \mathrm{E}[X]&&=\lambda\\ \mathrm{Var(X)}&&=\lambda\\ \end{split} \end{equation}

Worked Example 1

Worked Example

A company selling personalised T-shirts owns a website where customers can purchase T-shirts online at a price of £$15$ per T-shirt and a delivery charge of £$2.50$ per T-shirt to anywhere within the UK (the company does not currently ship its T-shirts outside of the UK). It is believed that on average $2$ orders arrive per minute. If the number of orders received in a minute exceeds $5$, then the website crashes, meaning the website becomes inaccessible to the public for ten minutes.

(A) What is the value of $\lambda$?

(B) Calculate the probability that the website crashes at between $10:30$am and $10:31$am tomorrow morning given that the website is accessible to the public at $10:30$am.

(C) Calculate the probability that the number of orders in any one-minute interval is the expected number of orders.

(D) What is the variance of the number of orders per minute?

Solution

The number of orders arriving at the website per minute is modelled by a $\mathrm{Poisson}(2)$ distribution (since we have an average of $2$ “successes” within a specified time limit.)

(A) Recall that for a Poisson distribution $\lambda$ is the expectation or average rate of successes (arrivals of orders). In this example we have been told that the average number of orders per minute is 2. We therefore have $\lambda = 2$.

(B) The probability that the website crashes between $10:30$am and $10:31$am morning is the same as the probability of the website crashing in any one minute interval.

Let $X$ denote the number of orders in a one-minute interval. We know that the website crashes if the number of orders exceeds 5 in one minute. We thus have:

$\mathrm{P}$(website crashes between $10:30$am and $10:31$am) $=\mathrm{P}(X \gt 5)$, where:

\begin{align} \mathrm{P}(X \gt 5) &=1-\mathrm{P}(X \leq 5)\\ &=1-(\mathrm{P}(X =0) + \mathrm{P}(X =1) + \mathrm{P}(X =2) + \mathrm{P}(X =3) + \mathrm{P}(X =4))\\ &=1-\left(\dfrac{~2^0 \times e^{-2}~}{0!} + \dfrac{~2^1 \times e^{-2}~}{1!} + \dfrac{~2^2 \times e^{-2}~}{2!} + \dfrac{~2^3 \times e^{-2}~}{3!} + \dfrac{~2^4 \times e^{-2}~}{4!}\right)\\ &=0.095 \text{ (to 3 d.p.)}.\\ \end{align}

Note: in this example we were able to sum the probabilities to obtain $\mathrm{P}(X \leq 5)$: i.e.

\[\mathrm{P}(X \leq 5)=\mathrm{P}(X =0) + \mathrm{P}(X =1) + \mathrm{P}(X =2) + \mathrm{P}(X =3) + \mathrm{P}(X =4)\]

because the Poisson is a discrete probability distribution.

(C) We have:

\begin{align} \mathrm{P}(X= \lambda)&=\mathrm{P}(X =2)\\ &=\dfrac{~2^2 \times e^{-2}~}{2!}\\ &=0.27 \text{ (to 2 d.p.)}. \end{align}

(D) Using the formula for the variance of a Poisson distribution we have:

\begin{align} \mathrm{Var(X)}&=\lambda\\ &=2. \end{align}

What does a Poisson distribution look like?

Below are bar plots of a $\mathrm{Poisson}(2)$ distribution and a $\mathrm{Poisson}(5)$ distribution.

|center

|center

Worked Example 2

Worked Example

At a call centre the average number of calls received per minute is $7$.

Calculate the probability that, in the next minute, there are:

(A) four calls.

(B) eight calls.

(C) zero calls.

(D) at most three calls.

(E) What is the expected number of calls in a minute?

Solution

(A) Let $X$ denote the number of calls received by the call centre in a minute.

The number of calls received by the call centre per minute follows an average rate of $7$. Therefore $X\sim\mathrm{Po}(7)$ and so:

\begin{align} \mathrm{P}(X=4) &= \dfrac{~7^4 \times e^{-7}~}{4!}\\ &=0.091\text{(3 d.p.)}\\ \end{align}

(B) Now we need to calculate $\mathrm{P}(X=8)$.

\begin{align} \mathrm{P}(X=8) &= \dfrac{~7^8 \times e^{-7}~}{8!}\\ &=0.130\text{ (3 d.p.)}\\ \end{align}

(C) Now we need to calculate $\mathrm{P}(X=0)$. \begin{align} \mathrm{P}(X=0) &= \dfrac{~7^0\times e^{-7}~}{0!}\\ &=e^{-7}\\ &=0.001\text{ (3 d.p.)}\\ \end{align}

(D)

\begin{align} \mathrm{P}(X\leq3) &= \mathrm{P}(Y=0) + \mathrm{P}(Y=1) + \mathrm{P}(Y=2) + \mathrm{P}(Y=3)\\ &= \dfrac{~7^0 \times e^{-7}~}{0!} + \dfrac{~7^1 \times e^{-7}~}{1!} + \dfrac{~7^2 \times e^{-7}~}{2!} + \dfrac{~7^3 \times e^{-7}~}{3!}\\ &= 0.001+0.006+0.022+0.052\\ &= 0.082 \text{ (3 d.p.)}\\ \end{align}

(E) The expected number of calls per minute is the same as the rate for the Poisson Distribution, $7$.

Note: There are also Poisson tables available which can tell us the cumulative probabilities, which is useful for when $r$ is relatively large, for example, $P(X \leq 12)$.

Using the Poisson Distribution as an Approximation to the Binomial Distribution

When the value of $n$ is very large it can be very tedious and time-consuming to calculate probabilities using the Binomial distribution. However, for very large $n$ and very small $p$ the Poisson distribution can be used as an approximation to the Binomial distribution, where larger values of $n$ and smaller values of $p$ result in a better approximation.

The Poisson approximation to the Binomial distribution has mean $\lambda=np$ (where $n$ is the number of trials and $p$ is the probability of success for a Binomial distribution) and the same variance $\lambda=np$. Thus we have:

\begin{align} \mathrm{P}(X = r)& \cong \dfrac{(np)^r \times e^{-np}}{r!}, r = 0,1,...\\ \end{align}

Worked Example 3

Worked Example

Suppose in a small town there are $5000$ business and a fire occurs in any given business with probability $0.001$ over a decade. What's the probability that there is at most one fire in the next decade?

Solution

Let $X$ denote the number of fires in the city in the next decade. We wish to find $\mathrm{P}(X\leq1)$.

First we shall calculate this using the binomial distribution.

\begin{align} \mathrm{P}(X\leq1) &= \mathrm{P}(X = 0) + \mathrm{P}(X = 1)\\ &= {}^{5000}C_0 \times 0.001^{0} \times 0.999^{5000} + {}^{5000}C_1 \times 0.001^{1} \times 0.999^{4999}\\ &= 0.00672+0.03364\\ &= 0.040\text{ (3 d.p.)}\\ \end{align}

Now we calculate using the Poisson approximation to this Binomial distribution. Here, $\lambda = \mathrm{E}[X] = n \times p = 5000 \times 0.001 = 5$ so $X\sim\mathrm{Po}(5)$.

\begin{align} \mathrm{P}(X\leq1) &= \mathrm{P}(X=0) + \mathrm{P}(X=1)\\ &= \dfrac{~5^0 e ^{-5}~}{0!} + \dfrac{~5^1 e ^{-5}~}{1!}\\ &= 0.00674 + 0.03369\\ &= 0.040\text{ (3 d.p.)}. \end{align}

As we would expect, the two methods give the same result (to 3 d.p), which shows that in this instance the Poisson distribution is a good approximation to the Binomial distribution .

An insurance company could use this approximation to calculate how likely it is a business will need to claim for fire damage or the expected number of claims likely to be made.

Worked Example 4

Worked Example

At a chocolate factory in Slough with 120 production workers, there is a $10$% chance that a worker will be absent on any given day. The probability that one worker is assumed not to affect the probability that another is absent. The factory is able to operate on any given day as long as there are no more than 50 workers absent on that day. What is the probability that any $2$ out of $9$ randomly chosen workers will be absent next Monday?

Solution

Now we have seen how to calculate probabilities with the Binomial distribution, we can go back to solving the Business example from earlier.

The manager needs to know how likely it is that the performance rates will drop to a poor standard. He needs to know the probability that there will be five or more absences caused by the Norovirus.

Let $X$ be the number of people who catch the Norovirus. We know from above that there are $10$ people in the office who could catch the Norovirus with probability, $p = 0.6$. So we know that $X$ ~ $\mathrm{Bin}(10,0.6).$

\begin{align} \mathrm{P}(X \geq 5) &= 1 - P(X \leq 4)\\ &= 1 - \big( \mathrm{P}(X = 0) + \mathrm{P}(X = 1) + \mathrm{P}(X = 2) + \mathrm{P}(X = 3) + \mathrm{P}(X = 4) \big)\\ &= 1 - \big( ({}^{10}C_0 \times 0.6^0 \times 0.4^{(10-0)}) + ({}^{10}C_1 \times 0.6^1 \times 0.4^{(10-1)}) + ({}^{10}C_2 \times 0.6^2 \times 0.4^{(10-2)})\\ &\quad + \; ({}^{10}C_3 \times 0.6^3 \times 0.4^{(10-3)}) + ({}^{10}C_4 \times 0.6^4 \times 0.4^{(10-4)}) \big)\\ &= 1 - \big( (0.4^10) + (10 \times 0.6 \times 0.4^9) + (45 \times 0.6^2 \times 0.4^8) + (120 \times 0.6^3 \times 0.4^7)\\ &\quad + (210 \times 0.6^4 \times 0.4^6)\big)\\ &= 1 - \big(0.0001 + 0.0016 + 0.0106 + 0.0425 + 0.1115)\big)\text{ (4 d.p.)}\\ &= 1 - 0.1662\text{ (4 d.p.)}\\ &= 0.8338 \text{ (4 d.p.)}\\ \end{align}

There is a high probability that more than $5$ people will be absent due to the illness. Therefore, it would be wise for the manager to request for help from other branches. Here, the expected number of people who catch the virus is $n \times p$, which is $10 \times 0.6 = 6$. This also suggests that the manager should request help for his department.

Test Yourself

To practice questions on the Binomial distribution and Poisson distribution click the following link:

Numbas test.

See Also

For information on continuous probability distributions see the other pages on continuous probability distributions.