Skip to main content

Statistical Distributions (Extended)

Statistical Distributions (Extended Treatment)

This document provides rigorous coverage of the binomial, normal, and Poisson distributions, their approximations, and hypothesis testing applications.

info

Always state the distribution you are using in full, including the parameter values, before calculating probabilities. For example: "XB(20,0.3)X \sim B(20, 0.3)".


1. The Binomial Distribution

1.1 Definition

A random variable XX has a binomial distribution with parameters nn and pp (written XB(n,p)X \sim B(n, p)) if:

P(X=r)=(nr)pr(1p)nr,r=0,1,2,,nP(X = r) = \binom{n}{r}p^r(1-p)^{n-r}, \quad r = 0, 1, 2, \ldots, n

Conditions for a binomial distribution:

  1. A fixed number nn of trials.
  2. Each trial has exactly two outcomes (success/failure).
  3. The probability of success pp is constant for each trial.
  4. Trials are independent.

1.2 Mean and variance

E(X)=np,Var(X)=np(1p)E(X) = np, \qquad \mathrm{Var}(X) = np(1-p)

Proof of E(X)=npE(X) = np.

E(X)=r=0nr(nr)pr(1p)nr=r=1nr(nr)pr(1p)nrE(X) = \sum_{r=0}^{n} r\binom{n}{r}p^r(1-p)^{n-r} = \sum_{r=1}^{n} r\binom{n}{r}p^r(1-p)^{n-r}

Using r(nr)=n(n1r1)r\binom{n}{r} = n\binom{n-1}{r-1}:

=npr=1n(n1r1)pr1(1p)nr=npk=0n1(n1k)pk(1p)n1k=np1=np= np\sum_{r=1}^{n}\binom{n-1}{r-1}p^{r-1}(1-p)^{n-r} = np\sum_{k=0}^{n-1}\binom{n-1}{k}p^k(1-p)^{n-1-k} = np \cdot 1 = np \quad \blacksquare

1.3 Cumulative probabilities

P(Xr)=k=0r(nk)pk(1p)nkP(X \leq r) = \sum_{k=0}^{r}\binom{n}{k}p^k(1-p)^{n-k}

P(Xr)=1P(Xr1)P(X \geq r) = 1 - P(X \leq r-1)

1.4 Worked example

Problem. A fair coin is tossed 12 times. Find the probability of getting: (a) exactly 7 heads; (b) at most 4 heads; (c) between 5 and 9 heads inclusive.

XB(12,0.5)X \sim B(12, 0.5).

(a) P(X=7)=(127)(0.5)12=7924096=995120.1934P(X = 7) = \dbinom{12}{7}(0.5)^{12} = \dfrac{792}{4096} = \dfrac{99}{512} \approx 0.1934

(b) P(X4)=k=04(12k)(0.5)12=1+12+66+220+4954096=79440960.1938P(X \leq 4) = \displaystyle\sum_{k=0}^{4}\dbinom{12}{k}(0.5)^{12} = \dfrac{1 + 12 + 66 + 220 + 495}{4096} = \dfrac{794}{4096} \approx 0.1938

(c) P(5X9)=P(X9)P(X4)=1P(X4)P(X10)P(5 \leq X \leq 9) = P(X \leq 9) - P(X \leq 4) = 1 - P(X \leq 4) - P(X \geq 10)

P(X10)=P(X2)P(X \geq 10) = P(X \leq 2) (by symmetry of p=0.5p = 0.5) =1+12+664096=794096= \dfrac{1 + 12 + 66}{4096} = \dfrac{79}{4096}

P(5X9)=17944096794096=322340960.7869P(5 \leq X \leq 9) = 1 - \dfrac{794}{4096} - \dfrac{79}{4096} = \dfrac{3223}{4096} \approx 0.7869


2. The Normal Distribution

2.1 Definition

A random variable XX has a normal distribution with parameters μ\mu and σ2\sigma^2 (written XN(μ,σ2)X \sim N(\mu, \sigma^2)) if its probability density function is:

f(x)=LB1RB◆◆LBσLB2πRB◆◆RBexp ⁣(LB(xμ)2RB◆◆LB2σ2RB),xRf(x) = \frac◆LB◆1◆RB◆◆LB◆\sigma\sqrt◆LB◆2\pi◆RB◆◆RB◆\exp\!\left(-\frac◆LB◆(x - \mu)^2◆RB◆◆LB◆2\sigma^2◆RB◆\right), \quad x \in \mathbb{R}

2.2 Properties

  • The distribution is symmetric about x=μx = \mu.
  • The mean, median, and mode are all equal to μ\mu.
  • E(X)=μE(X) = \mu, Var(X)=σ2\mathrm{Var}(X) = \sigma^2.
  • Approximately 68% of data lies within μ±σ\mu \pm \sigma.
  • Approximately 95% of data lies within μ±2σ\mu \pm 2\sigma.
  • Approximately 99.7% of data lies within μ±3σ\mu \pm 3\sigma.

2.3 Standardisation

To find probabilities, we standardise to the standard normal ZN(0,1)Z \sim N(0, 1):

Z=LBXμRB◆◆LBσRBZ = \frac◆LB◆X - \mu◆RB◆◆LB◆\sigma◆RB◆

P(Xx)=P ⁣(ZLBxμRB◆◆LBσRB)=Φ ⁣(LBxμRB◆◆LBσRB)P(X \leq x) = P\!\left(Z \leq \frac◆LB◆x - \mu◆RB◆◆LB◆\sigma◆RB◆\right) = \Phi\!\left(\frac◆LB◆x - \mu◆RB◆◆LB◆\sigma◆RB◆\right)

where Φ(z)\Phi(z) denotes the cumulative distribution function of the standard normal.

2.4 Worked example

Problem. The masses of bags of sugar are normally distributed with mean 1.02  kg1.02\;\mathrm{kg} and standard deviation 0.03  kg0.03\;\mathrm{kg}. Find: (a) the probability a randomly selected bag has mass less than 1.00  kg1.00\;\mathrm{kg}; (b) the probability the mass is between 0.980.98 and 1.05  kg1.05\;\mathrm{kg}; (c) the value mm such that 90% of bags have mass less than mm.

XN(1.02,0.032)X \sim N(1.02, 0.03^2).

(a) P(X<1.00)=P ⁣(Z<1.001.020.03)=P(Z<0.667)=1Φ(0.667)10.7476=0.2524P(X \lt 1.00) = P\!\left(Z \lt \dfrac{1.00 - 1.02}{0.03}\right) = P(Z \lt -0.667) = 1 - \Phi(0.667) \approx 1 - 0.7476 = 0.2524

(b) P(0.98<X<1.05)=P ⁣(0.981.020.03<Z<1.051.020.03)=P(1.333<Z<1.000)P(0.98 \lt X \lt 1.05) = P\!\left(\dfrac{0.98 - 1.02}{0.03} \lt Z \lt \dfrac{1.05 - 1.02}{0.03}\right) = P(-1.333 \lt Z \lt 1.000)

=Φ(1.000)Φ(1.333)=0.8413(10.9088)=0.84130.0912=0.7501= \Phi(1.000) - \Phi(-1.333) = 0.8413 - (1 - 0.9088) = 0.8413 - 0.0912 = 0.7501

(c) We need Φ ⁣(m1.020.03)=0.90\Phi\!\left(\dfrac{m - 1.02}{0.03}\right) = 0.90, so m1.020.03=1.282\dfrac{m - 1.02}{0.03} = 1.282.

m=1.02+0.03×1.282=1.058  kgm = 1.02 + 0.03 \times 1.282 = 1.058\;\mathrm{kg}

2.5 The normal approximation to the binomial

If XB(n,p)X \sim B(n, p) and nn is large, then XX is approximately normal with:

XN(np,np(1p))X \approx N(np, np(1-p))

Continuity correction. Since the binomial is discrete and the normal is continuous, apply a continuity correction:

  • P(Xk)P(Y<k+0.5)P(X \leq k) \approx P(Y \lt k + 0.5)
  • P(Xk)P(Y>k0.5)P(X \geq k) \approx P(Y \gt k - 0.5)
  • P(X=k)P(k0.5<Y<k+0.5)P(X = k) \approx P(k - 0.5 \lt Y \lt k + 0.5)

The approximation is reasonable when np>5np \gt 5 and n(1p)>5n(1-p) \gt 5.

2.6 Worked example: normal approximation

Problem. XB(80,0.45)X \sim B(80, 0.45). Use a normal approximation to find P(X>35)P(X \gt 35).

μ=80×0.45=36\mu = 80 \times 0.45 = 36, σ2=80×0.45×0.55=19.8\sigma^2 = 80 \times 0.45 \times 0.55 = 19.8, σ=4.45\sigma = 4.45.

XN(36,19.8)X \approx N(36, 19.8).

P(X>35)P(Y>34.5)=P ⁣(Z>34.5364.45)=P(Z>0.337)P(X \gt 35) \approx P(Y \gt 34.5) = P\!\left(Z \gt \dfrac{34.5 - 36}{4.45}\right) = P(Z \gt -0.337)

=1Φ(0.337)=Φ(0.337)0.632= 1 - \Phi(-0.337) = \Phi(0.337) \approx 0.632


3. The Poisson Distribution

3.1 Definition

A random variable XX has a Poisson distribution with parameter λ\lambda (written XPo(λ)X \sim \mathrm{Po}(\lambda)) if:

P(X=r)=LBeλλrRB◆◆LBr!RB,r=0,1,2,P(X = r) = \frac◆LB◆e^{-\lambda}\lambda^r◆RB◆◆LB◆r!◆RB◆, \quad r = 0, 1, 2, \ldots

Conditions:

  1. Events occur independently at a constant average rate.
  2. The probability of more than one event in a sufficiently small interval is negligible.
  3. Events occur singly in continuous time or space.

3.2 Mean and variance

E(X)=λ,Var(X)=λE(X) = \lambda, \qquad \mathrm{Var}(X) = \lambda

The equality of mean and variance is a distinguishing feature of the Poisson distribution.

3.3 Worked example

Problem. A call centre receives an average of 4.5 calls per minute. Assuming a Poisson model, find: (a) the probability of exactly 6 calls in a minute; (b) the probability of at most 2 calls in a minute; (c) the probability of more than 8 calls in a two-minute period.

XPo(4.5)X \sim \mathrm{Po}(4.5).

(a) P(X=6)=e4.5(4.5)66!=LBe4.5×8303.8RB◆◆LB720RB0.1271P(X = 6) = \dfrac{e^{-4.5}(4.5)^6}{6!} = \dfrac◆LB◆e^{-4.5} \times 8303.8◆RB◆◆LB◆720◆RB◆ \approx 0.1271

(b) P(X2)=e4.5 ⁣(1+4.5+4.522)=e4.5(1+4.5+10.125)=15.625e4.50.1736P(X \leq 2) = e^{-4.5}\!\left(1 + 4.5 + \dfrac{4.5^2}{2}\right) = e^{-4.5}(1 + 4.5 + 10.125) = 15.625\,e^{-4.5} \approx 0.1736

(c) For two minutes, YPo(9)Y \sim \mathrm{Po}(9).

P(Y>8)=1P(Y8)=1e9r=089rr!10.4557=0.5443P(Y \gt 8) = 1 - P(Y \leq 8) = 1 - e^{-9}\displaystyle\sum_{r=0}^{8}\dfrac{9^r}{r!} \approx 1 - 0.4557 = 0.5443

3.4 Poisson approximation to the binomial

If XB(n,p)X \sim B(n, p) where nn is large and pp is small (so that npnp is moderate), then:

XPo(np)X \approx \mathrm{Po}(np)

This is valid when n50n \geq 50 and p0.1p \leq 0.1 (and np10np \leq 10 as a rough guideline).

3.5 Worked example: Poisson approximation

Problem. A machine produces items with a defect rate of 0.02. In a batch of 200 items, find the probability that exactly 3 are defective.

XB(200,0.02)X \sim B(200, 0.02). Since n=200n = 200 is large and p=0.02p = 0.02 is small, XPo(4)X \approx \mathrm{Po}(4).

P(X=3)=LBe443RB◆◆LB3!RB=64e46=323e40.1954P(X = 3) = \frac◆LB◆e^{-4} \cdot 4^3◆RB◆◆LB◆3!◆RB◆ = \frac{64e^{-4}}{6} = \frac{32}{3}e^{-4} \approx 0.1954


4. Choosing the Correct Distribution

4.1 Decision framework

SituationDistribution
Fixed trials, two outcomes, const ppBinomial B(n,p)B(n, p)
Rare events, constant ratePoisson Po(λ)\mathrm{Po}(\lambda)
Continuous, symmetric, bell-shapedNormal N(μ,σ2)N(\mu, \sigma^2)

4.2 Sums of independent Poisson variables

Theorem. If XPo(λ1)X \sim \mathrm{Po}(\lambda_1) and YPo(λ2)Y \sim \mathrm{Po}(\lambda_2) are independent, then X+YPo(λ1+λ2)X + Y \sim \mathrm{Po}(\lambda_1 + \lambda_2).

Proof sketch. Using MGFs or direct convolution:

P(X+Y=r)=k=0rP(X=k)P(Y=rk)=k=0rLBeλ1λ1kRB◆◆LBk!RBLBeλ2λ2rkRB◆◆LB(rk)!RBP(X + Y = r) = \sum_{k=0}^{r}P(X = k)P(Y = r-k) = \sum_{k=0}^{r}\frac◆LB◆e^{-\lambda_1}\lambda_1^k◆RB◆◆LB◆k!◆RB◆ \cdot \frac◆LB◆e^{-\lambda_2}\lambda_2^{r-k}◆RB◆◆LB◆(r-k)!◆RB◆

=LBe(λ1+λ2)RB◆◆LBr!RBk=0r(rk)λ1kλ2rk=LBe(λ1+λ2)(λ1+λ2)rRB◆◆LBr!RB= \frac◆LB◆e^{-(\lambda_1+\lambda_2)}◆RB◆◆LB◆r!◆RB◆\sum_{k=0}^{r}\binom{r}{k}\lambda_1^k\lambda_2^{r-k} = \frac◆LB◆e^{-(\lambda_1+\lambda_2)}(\lambda_1+\lambda_2)^r◆RB◆◆LB◆r!◆RB◆ \quad \blacksquare

4.3 Worked example

Problem. A shop receives orders at an average rate of 3 per hour from online and 2 per hour from walk-in customers. Find the probability of receiving more than 7 orders in a two-hour period.

Total rate per hour =3+2=5= 3 + 2 = 5. For two hours, XPo(10)X \sim \mathrm{Po}(10).

P(X>7)=1P(X7)=1e10r=0710rr!10.2202=0.7798P(X \gt 7) = 1 - P(X \leq 7) = 1 - e^{-10}\displaystyle\sum_{r=0}^{7}\dfrac{10^r}{r!} \approx 1 - 0.2202 = 0.7798

warning

Common Pitfall When using the Poisson approximation to the binomial, always check that the conditions are met (large nn, small pp). If pp is close to 0.5, the normal approximation is more appropriate.


5. Practice Problems

Problem 1

XB(15,0.35)X \sim B(15, 0.35). Find: (a) P(X=5)P(X = 5); (b) P(3X7)P(3 \leq X \leq 7); (c) the most likely value of XX.

Solution

(a) P(X=5)=(155)(0.35)5(0.65)100.2123P(X = 5) = \dbinom{15}{5}(0.35)^5(0.65)^{10} \approx 0.2123.

(b) P(3X7)=P(X7)P(X2)0.95060.0355=0.9151P(3 \leq X \leq 7) = P(X \leq 7) - P(X \leq 2) \approx 0.9506 - 0.0355 = 0.9151.

(c) Mode (n+1)p=16×0.35=5.6\approx (n+1)p = 16 \times 0.35 = 5.6, so check r=5r = 5 and r=6r = 6.

P(X=5)0.2123P(X = 5) \approx 0.2123, P(X=6)0.2186P(X = 6) \approx 0.2186. The mode is X=6X = 6.

Problem 2

The heights of men are normally distributed with mean 175  cm175\;\mathrm{cm} and standard deviation 8  cm8\;\mathrm{cm}. Find the probability that a randomly selected man is: (a) taller than 190  cm190\;\mathrm{cm}; (b) between 168  cm168\;\mathrm{cm} and 182  cm182\;\mathrm{cm}; (c) what height is exceeded by only 5% of men?

Solution

(a) P(X>190)=P ⁣(Z>158)=P(Z>1.875)=10.9696=0.0304P(X \gt 190) = P\!\left(Z \gt \dfrac{15}{8}\right) = P(Z \gt 1.875) = 1 - 0.9696 = 0.0304.

(b) P(168<X<182)=P(0.875<Z<0.875)=2Φ(0.875)1=2(0.8092)1=0.6184P(168 \lt X \lt 182) = P(-0.875 \lt Z \lt 0.875) = 2\Phi(0.875) - 1 = 2(0.8092) - 1 = 0.6184.

(c) P(Z>z)=0.05    z=1.645P(Z \gt z) = 0.05 \implies z = 1.645. Height =175+1.645×8=188.2  cm= 175 + 1.645 \times 8 = 188.2\;\mathrm{cm}.

Problem 3

The number of emails received per hour follows a Poisson distribution with mean 6. Find the probability that: (a) exactly 4 emails are received in an hour; (b) more than 10 emails in two hours.

Solution

(a) P(X=4)=LBe664RB◆◆LB4!RB=129624e6=54e60.1335P(X = 4) = \dfrac◆LB◆e^{-6} \cdot 6^4◆RB◆◆LB◆4!◆RB◆ = \dfrac{1296}{24}e^{-6} = 54e^{-6} \approx 0.1335.

(b) For two hours, YPo(12)Y \sim \mathrm{Po}(12).

P(Y>10)=1P(Y10)10.6528=0.3472P(Y \gt 10) = 1 - P(Y \leq 10) \approx 1 - 0.6528 = 0.3472.

Problem 4

A die is rolled 60 times. Use a suitable approximation to find the probability that the number of sixes is between 8 and 14 inclusive.

Solution

XB(60,1/6)X \sim B(60, 1/6). μ=10\mu = 10, σ2=60×16×56=2538.333\sigma^2 = 60 \times \dfrac{1}{6} \times \dfrac{5}{6} = \dfrac{25}{3} \approx 8.333.

σ2.887\sigma \approx 2.887.

P(8X14)P(7.5<Y<14.5)P(8 \leq X \leq 14) \approx P(7.5 \lt Y \lt 14.5) where YN(10,25/3)Y \sim N(10, 25/3).

=P ⁣(7.5102.887<Z<14.5102.887)=P(0.866<Z<1.558)= P\!\left(\dfrac{7.5 - 10}{2.887} \lt Z \lt \dfrac{14.5 - 10}{2.887}\right) = P(-0.866 \lt Z \lt 1.558)

=Φ(1.558)Φ(0.866)=0.94040.1931=0.7473= \Phi(1.558) - \Phi(-0.866) = 0.9404 - 0.1931 = 0.7473.