Tutorial 3: Worked Examples

Last updated on Sep 26, 2021 0 Comments Question and Answers

Introduction

This is the third tutorial of a series of common problems in Statistics, alongside some suggested solutions. This particular tutorial primarily focuses on problems (deemed either basic or medium) for undergraduate Statistics (or First years master’s education). You suggestions are are appreciated, and highly welcomed.

Problem 1 (Basic)

1.(a)

Suppose the length of a square is a random variable uniformly distributed on [0,1]. If X is the length of the square, calculate the expected area of the square.

Sol.

Given $X \sim U (0, 1)$ . Define $g (x)$ as the area the square such that; $\begin{aligned} Area & = g (x) = X^{2} \\ E (g (x)) & = \int_{0}^{1} g (x) f_{X} (x) d x \\ E (X^{2}) & = \int_{0}^{1} x^{2} d x = \frac{1}{3} x^{3} |_{0}^{1} \\ = \frac{1}{3} sq. units \end{aligned}$

1.(b)

Let $X_{1}, X_{2}, X_{3}, \dots$ be independent random variables. Suppose the distribution of each $X_{i}$ is Poisson with parameter $λ_{i} i = 1, 2, \dots n$ . Using moment generating function, show that $Y = \sum_{i = 1}^{n} X_{i}$ has Poisson distribution with parameter $\sum_{i = 1}^{n} λ_{i}$ .

Sol.

If $X_{i} \sim P o i (λ_{i})$ , then its moment generating function, m.g.f, is $M_{X} (t) = e^{[λ_{i} (e^{t} - 1)]}$ .

Since the variables are independent, and for $Y = \sum_{i = 1}^{n} X_{i}$ , we have; $\begin{aligned} M_{Y} & = M_{X_{1}} (t) \cdot M_{X_{2}} (t) \dots \cdot M_{X_{n}} (t) \\ = e^{[λ_{1} (e^{t} - 1)]} \cdot e^{[λ_{2} (e^{t} - 1)]} \cdot \dots \cdot e^{[λ_{n} (e^{t} - 1)]} \\ = e^{[(λ_{1} + λ_{2} + \dots + λ_{n}) (e^{t} - 1)]} \\ M_{\sum X_{i}} (t) & = e^{[\sum_{i = 1}^{n} λ_{i} (e^{t} - 1)]} \end{aligned}$ $∴ \sum_{i = 1}^{n} X_{i} \sim Poi (\sum_{i}^{n} λ_{i})$

1.(c)

Let $X$ be a random variable which follows a Gamma distribution defined as; $f (x; n, β) = \frac{1}{(n - 1)! β^{n}} x^{n - 1} e^{- \frac{x}{β}} 0 < x < \infty, β > 0, n > 0$

Find the probability density function for $g (x) = \frac{1}{x}$ .

Sol.

Given a density function, one way to solve this is to use the method of transformation.

Let $y = g (x) = \frac{1}{x}$ . The density of $g (y)$ can be computed as; $g (y) = | \frac{d x}{d y} | \cdot f (W (y))$ where $W (y)$ is the inverse function of g(x).

$\begin{aligned} y & = \frac{1}{x} ⟹ x = \frac{1}{y} \\ | \frac{d x}{d y} | & = | - \frac{1}{y^{2}} | = \frac{1}{y^{2}} \end{aligned}$

$\begin{aligned} g (x) & = \frac{1}{y^{2}} \times \frac{1}{(n - 1)! β^{n}} {(\frac{1}{y})}^{n - 1} e^{- \frac{1}{y β}} \\ = \frac{1}{(n - 1)! β^{n}} {(\frac{1}{y})}^{(n + 2) - 1} e^{- \frac{1}{y β}} \\ ∴ g (\frac{1}{x}) & \sim Gamma (n + 2, β) \end{aligned}$

Problem 2 (Medium)

2.(a)

In a certain population, it is believed that $1.5 %$ of the population has disease X. Assuming health providers in that community decide to embark on a free screening exercise for the said disease. And it is known that, for a person who has the disease, the test has an accuracy of $97 %$ for a positive test result. Also, for a person who does not have the disease, the test has an accuracy of $95 %$ for a negative test result.

i.) What is the probability that a test result returns positive?

ii.) Assuming you presented yourself for testing and your test result came out positive for the disease. what is the probability that you actually have the disease?

Sol.

This is clearly a Bayesian problem. Define $D$ as an event that a subject has the disease; $T^{+}$ be event that a subject’s test result returns positive, and the event that a subject’s test result $T^{-}$ returns negative. Thus far, we proceed with the following pieces of information;

$\begin{aligned} P (D) & = 0.015 (prevalence) \\ P (T^{+} | D) & = 0.97 (sensitivity) \\ P (T^{-} | D^{'}) & = 0.95 (specificity) \end{aligned}$

i.)

We partition $T$ into $T^{+}$ and $T^{-}$ . Thus,

$\begin{aligned} P (T^{+} | D) & = P (D) P (T^{+} | D) + P (D^{'}) P (T^{+} | D^{'}) \\ = 0.015 \times 0.97 + (1 - 0.015) \times (1 - 0.95) \\ = 0.905975 \end{aligned}$

ii.)

We’re interested in $P (D | T^{+})$ , the probability you have the disease given you tested positive to the disease. Using the Bayesian formulation, we proceed as follows:

$\begin{aligned} P (D | T^{+}) & = \frac{P (D) P (T^{+} | D)}{P (D) P (T^{+} | D) + P (D^{'}) P (T^{+} | D^{'})} \\ = \frac{0.015 \times 0.97}{0.905975} \\ = 0.0161 (1.61 %) \end{aligned}$

2.(b)

$40 %$ of the products produced in a certain factory are produced by machine A. Machine B produces $10 %$ of the products whiles machine C produces $50 %$ of the products. Of these products produced, defective ones produced by these $3$ machines are $2 %, 3 %$ and $4 %$ respectively. One of the products in the factory is selected at random.

i.) Find the probability that this product is defective.

ii.) If the product is defective, find the probability it is coming from:

$α$ . Machine A

$β$ . Machine B

$γ$ . Machine C

Sol.

$P (A) = 0.40, P (B) = 0.10, P (C) = 0.50$

Let $D$ be the event that a defective product is produced

$P (D | A) = 0.02$
$P (D | B) = 0.03$
$P (D | C) = 0.04$

i.)

$\begin{aligned} P (D) & = P (A) P (D | A) + P (B) P (D | B) + P (C) P (D | C) \\ = 0.40 (0.02) + 0.10 (0.03) + 0.50 (0.04) \\ = 0.031 \end{aligned}$

ii.)

$α .$ $P (A | D) = \frac{P (A) P (D | A)}{P (D)} = \frac{0.40 \times 0.02}{0.031} = \frac{8}{31}$

$β .$ $P (B | D) = \frac{P (B) P (D | A)}{P (D)} = \frac{0.10 \times 0.03}{0.031} = \frac{3}{31}$

$γ .$ $P (C | D) = \frac{P (C) P (D | A)}{P (D)} = \frac{0.50 \times 0.04}{0.031} = \frac{20}{31}$

2.(c)

$X_{1}, X_{2}$ are independent variables and $Y$ is the dependent variable. A sample of $10$ units is drawn as shown in the table below.

X.1	X.2	y
5	1	3
5	1	4
6	3	5
6	4	5
7	5	5
7	6	6
7	6	7
8	5	7
9	3	8
10	6	10

Find the equation of regression, $\hat{y} = β_{0} + β_{1} X_{1} + β X_{2} + ϵ$

Sol.

Using Matrix approach, Let $\hat{y} = X β + ϵ$ such that;

$β = (X^{'} X)^{- 1} X^{'} y$

$\begin{aligned} X^{'} X & = (\begin{array}{c} 1 & 1 & \dots & 1 \\ 5 & 6 & \dots & 10 \\ 1 & 1 & \dots & 6 \end{array}) (\begin{array}{c} 1 & 5 & 1 \\ 1 & 6 & 1 \\ ⋮ & ⋮ & ⋮ \\ 1 & 10 & 6 \end{array}) \\ = (\begin{array}{c} 10 & 70 & 40 \\ 70 & 514 & 298 \\ 40 & 298 & 194 \end{array}) \end{aligned}$

$\begin{array}{r} (X^{'} X)^{- 1} = (\begin{array}{c} 2.2179 & - 0.3374 & - 0.0610 \\ - 0.3374 & 0.0691 & - 0.0366 \\ - 0.0610 & - 0.0366 & 0.0488 \end{array}) \end{array}$

$\begin{array}{r} X^{'} y = (\begin{array}{c} 60 \\ 449 \\ 264 \end{array}) \end{array}$

$\begin{aligned} \hat{β} & = (\begin{array}{c} 2.2179 & - 0.3374 & - 0.0610 \\ - 0.3374 & 0.0691 & - 0.0366 \\ - 0.0610 & - 0.0366 & 0.0488 \end{array}) (\begin{array}{c} 60 \\ 449 \\ 264 \end{array}) \\ = (\begin{array}{c} - 2.3146 \\ 1.1195 \\ 0.1098 \end{array}) \\ ∴ \hat{y} & = - 2.3146 + 1.1195 X_{1} + 0.1098 X_{2} \end{aligned}$

Go a little further and test

H_{0} : β_{1} = 0 vs H_{1} : β_{1} \neq 0

α = 0.05

.

Hint: Use

t = \frac{\hat{β_{1}} - β_{1}}{\sqrt{S_{E} (\hat{β_{1}})}}

and reject

H_{0}

| t | > t_{α, (n - p)}

Problem 3 (Basic-medium)

3.(a)

Suppose $X$ is a random variable with its density defined such that;

$f (x) = {\begin{cases} \frac{2}{5} | x |, & - 1 < x < 2 \\ 0, & otherwise \end{cases}$

i. Evaluate $\int_{- \infty}^{\infty} x f (x) d x$

ii. Find the variance of $X, Var(X)$ .

Sol.

$\begin{aligned} \int_{- \infty}^{\infty} x f (x) d x & = \int_{- 1}^{0} \frac{2 x | x |}{5} d x + \int_{0}^{2} \frac{2 x | x |}{5} d x \\ E (X) & = - \frac{2}{5} \int_{- 1}^{0} x^{2} d x + \frac{2}{5} \int_{0}^{2} x^{2} d x \\ = - \frac{2}{15} x^{3} |_{- 1}^{0} + \frac{2}{15} x^{3} |_{0}^{2} \\ = - \frac{2}{15} + \frac{16}{15} \\ = \frac{14}{15} . \end{aligned}$

ii.

$\begin{aligned} Var(X) & = E (X^{2}) - (E (X))^{2} \\ = - \int_{- 1}^{0} \frac{2 x^{2} | x |}{5} d x + \int_{0}^{2} \frac{2 x^{2} | x |}{5} d x - {(\frac{14}{15})}^{2} \\ = - \frac{1}{10} x^{4} |_{- 1}^{0} + \frac{1}{10} x^{4} |_{0}^{2} - {(\frac{14}{15})}^{2} \\ = - \frac{1}{10} + \frac{16}{10} - {(\frac{14}{15})}^{2} = \frac{17}{10} - \frac{196}{225} \\ = \frac{95}{102} . \end{aligned}$

3.(b)

A random variable, $Y$ , is uniformly distributed over the interval (-1, 8), i.e. $Y \sim U (- 1, 8)$ .

Find the probability that the equation $2 x^{2} + 4 Y x + 3 Y + 2 = 0$ has real roots.

Sol.

$Y \sim U (- 1, 8) = {\begin{cases} \frac{1}{9}, & - 1 < x < 8 \\ 0, & otherwise \end{cases}$

For the equation $2 x^{2} + 4 Y x + 3 Y + 2 = 0$ to have real roots, the discriminant must be non-negative, thus;

$\begin{aligned} b^{2} - 4 a c \geq 0 & ⟹ (4 Y)^{2} - 4 (2) (3 Y + 2) \geq 0 \\ ⟹ 16 Y^{2} - 24 Y + 16 \geq 0 \\ ⟹ 2 Y^{2} - 8 Y + 2 \geq 0 \\ ⟹ Y \geq 2 or Y \leq - \frac{1}{2} \end{aligned}$

Using the probability density function above, and the idea of mutually exclusive events, we have;

$\begin{aligned} P (Y \leq - \frac{1}{2} \cup Y \geq 2) & = P (Y \leq - \frac{1}{2}) + P (Y \geq 2) \\ = \int_{- 1}^{\frac{1}{2}} f_{Y} (y) d y + \int_{2}^{8} f_{Y} (y) d y \\ = \frac{1}{9} (\frac{1}{2} + 6) \\ = \frac{13}{18} . \end{aligned}$

3.(c)

A random variable X has moment generating function $M_{X} (t) = \frac{0.5}{0.5 - t}, t > \frac{1}{2}$ Using this piece of information;

i. $P (X > In4)$
ii. $E (X)$
iii. $Var(X)$ .

Sol.

Note that the moment generating function for the Exponential distribution is given as $M_{X} (t) = \frac{λ}{λ - t}, t < λ$ . Comparing this with the above function, we may infer that;

$f (x) = {\begin{cases} \frac{1}{2} e^{- \frac{1}{2} λ t}, & t < λ \\ 0, & otherwise \end{cases}$

The cumulative density function for the Exponential distribution is thus; $1 - F (x) = e^{- λ t}, t < λ$ $∴ P (X > In4) = e^{- \frac{1}{2} In4} = \frac{1}{2} .$

ii.

$\begin{aligned} E (X) & = \frac{d}{d x} M_{X} (t) |_{t = 0} \\ = \frac{0.5}{(0.5 - t)^{2}} |_{t = 0} \\ = 2 \\ E (X^{2}) & = \frac{d^{2}}{d x^{2}} M_{X} (t) |_{t = 0} \\ = \frac{2 (0.5)}{(0.5 - t)^{3}} |_{t = 0} \\ = 8 \\ Var(X) & = E (X^{2}) - [E (X)]^{2} \\ = 8 - 2^{2} = 4 \\ ∴ Var(X) & = 4. \end{aligned}$

moment generating function regression tranformation Bayes theorem

Tutorial 3: Worked Examples

Introduction

Problem 1 (Basic)

Sol.

Sol.

Sol.

Problem 2 (Medium)

Sol.

Sol.

Sol.

Problem 3 (Basic-medium)

Sol.

Sol.

Sol.

Dr. Abubakari S. Sumaila

Research Assistant

Related

Tutorial 3: Worked Examples

Introduction

Problem 1 (Basic)

Sol.

Sol.

Sol.

Problem 2 (Medium)

Sol.

Sol.

Sol.

Problem 3 (Basic-medium)

Sol.

Sol.

Sol.

Did you find this post helpful? Any suggestions? Consider sharing it😊😊😊

Dr. Abubakari S. Sumaila

Research Assistant

Related