Chapter 1 Statistical Distributions

Recap: CM 1110-Probability

Axioms of probability

Probability of an event quantifies the uncertainty, randomness, or the possibility of occurrence the event.
The probability of event E is usually denoted by $P(E)$.
Mathematically, the function $P(.)$ is a set function defined from sample space $(\Omega)$ to $[0, 1]$ interval, satisfying the following properties.
These are called the ‘axioms of probability’.
Axiom 1: For any event $A$, $P(A) \geq 0$
Axiom 2: $P(\Omega) = 1$
Axiom 3:
- 1. If $A_1, A_2, \dots, A_k$ is a finite collection of mutually exclusive events, then \[P(A_1\cup A_2\cup \dots \cup A_k)= \sum_{i=1}^kP(A_i)\]
  2. If $A_1, A_2, \dots$ is an infinite collection of mutually exclusive events, then \[P(A_1\cup A_2\cup \dots)= \sum_{i=1}^\infty P(A_i)\]

NOTE

Axioms 1 and 2 imply that for any event $E$, $0 \leq P (E) \leq 1$.
$P (E) = 1 \iff$ the event E is certain to occur.
$P (E) = 0 \iff$ the event E cannot occur.

Methods for determining Probability

There are several ways for determining the probability of events.
Usually we use the following methods to obtain the probability of events.
- Classical method
- Relative frequency method (Empirical approach)
- Subjective method
- Using probability models

1.1 Random Variable

Some sample spaces contain quantitative (numerical) outcomes, others contain qualitative outcomes.
Often it is convenient to work with sample spaces containing numerical outcomes.
A function that maps the original sample space into the real numbers is called a ‘random variable’.
This is more useful when the original sample space contains qualitative outcomes.

Definition 1: Random Variable

Let $\Omega$ be a sample space. Let $X$ be a function from $\Omega$ to $\Re$ (i.e. $X:\Omega \rightarrow \Re$). Then $X$ is called a random variable.

A random variable assigns a real number to each outcome of a sample space.
In other words, to each outcome of an experiment or a sample point $\omega_i$, of the sample spaces, there is a unique real number $x_i$, known as the value of the random variable $X$.
The range of the random variable is called the induced sample space.
A note on notation: Random variables will always denoted with uppercase letters and the realized values of the random variable (or its range) will be denoted by the corresponding lowercase letters. Thus, the random variable $X$ can take the value $x$.
Each outcome of a sample space occurs with a certain probability. Therefore, each possible value of a random variable is associated with a probability.
Any events of a sample space can be written in terms of a suitably defined random variable.

1.1.1 Types of Random Variables

A random variable is of two types
- Discrete Random Variable
- Continuous Random Variable

1.1.1.1 Discrete Random Variable

If the induced sample space is discrete, then the random variable is called a discrete random variable.

Example 01 Consider the experiment of tossing a coin. Express the following events using a suitably defined random variable

$H=$ The event of getting a head

$T=$ The event of getting a tail

Example 02

Consider the experiment of rolling of a die. Express the following events using a suitably defined random variable

$A=$ The event that the number faced up is less than 5

$B=$ The event that the number faced up is even

$C=$ The event that the number faced up is 2 or 5

Example 03

Consider the experiment of tossing a coin 10 times. Then the sample space $\Omega$ contains $2^{10} = 1024$ outcomes. Each outcome is a sequence of 10 H’s and T’s.

Express the following events in terms of a suitably defined random variable.

$D=$ The event that the number of heads is 5

$E=$ The event that the number of tails is less than 4

1.1.1.2 Continuous Random Variable

If the induced sample space is continuous, then the random variable is called a continuous random variable.

Example 04

Consider the experiment of measuring the lifetime (in hours) of a randomly selected bulb. Express the following events in terms of a suitably defined random variable.

$F=$ The event that the lifetime is less than 300 hours

$G=$ The event that the lifetime is 1000 hours

1.2 Probability Mass Function

Definition 2: Discrete density function of a discrete random variable

If $X$ is a discrete random variable with distinct values $x_1, x_2, \dots, x_n, \dots,$ then the function, denoted by $f_X(.)$ and defined by

\[\begin{equation} f_X(x) = \begin{cases} P(X=x) & \text{if } x=x_j, j=1,2,\dots,n,\dots\\ 0 & \text{if } x \neq x_j \end{cases} \end{equation}\]

is defined to be the discrete density function of $X$.

The values of a discrete random variable are often called mass points.
$f_X(x)$ denotes the mass associated with the mass point $x_j$.
Probability mass function discrete frequency function and probability function are other terms used in place of discrete density function
Probability function gives the measure of probability for different values of $X$.

1.2.1 Properties of a Probability Mass Function

Let $X$ be a discrete random variable with probability mass function $f_X(x)$. Then,

For any $x\in \Re$, $0\leq f_X(x) \leq 1.$
Let $E$ be an event and $I= \{X(\omega):\omega \in E\}.$ Then $P(E) = P(X\in I) = \sum_{x \in I}f_X(x).$
Let $R = \{X(\omega):\omega \in \Omega\}.$ Then $\sum_{x\in \Re} f_X(x) = 1.$

1.2.2 Representations of Probability Mass Functions

Example 05

Consider the experiment of tossing a fair coin. Let

\[\begin{equation} X = \begin{cases} 0 & \text{if the outcome is a Tail }\\ 1 & \text{if the outcome is a Head} \end{cases} \end{equation}\]

Find the probability mass function of $X$. Is $X$ discrete or continuous?

1.2.2.1 Using a table

1.2.2.2 Using a function

1.2.2.3 Using a graph

1.3 Probability Density Function

Let $X$ be a continuous random variable.
Then, it is not possible to define a pmf $f_x$ with properties mentioned in Section . Why?
Instead, we can find a function $f_x$ with the some different properties.
Probability density function (pdf) of a continuous random variable is a function that describes the relative likelihood for this random variable to occur at a given point.

1.3.1 Properties of a Probability Density Function

Let $X$ be a continuous random variable with probability density function $f_x$. Then,

For any $x\in \Re$, $f_X(x) \geq0$.
Let $E$ be an event and $I= \{X(\omega):\omega \in E\}.$ Then $P(E) = P(X\in I) = \int_If_X(x)dx.$
Let $R = \{X(\omega):\omega \in \Omega\}.$ Then $\int_\Re f_X(x)dx= 1.$

1.3.2 Existence of pdf

To see the existence of such a function, consider a continuous random variable $X$,
Suppose that we have a very large number of observations, $N$, of $X$, measured to high accuracy (large number of decimal places).
consider the following grouped frequency table and the histogram constructed from those data.
The height of the bar on a class interval of this histogram is equal to the relative frequency per unit in that class interval.

Interval	Class boundaries	Class frequency	Height of the bar	Area of the bar
$I_1$	$x_1 -\delta x/2, x_1+\delta x/2$	$n_1$	$\frac{n_1}{\delta x*N}$	$\frac{n1}{N}$
$I_2$	$x_2 -\delta x/2, x_2+\delta x/2$	$n_2$	$\frac{n_2}{\delta x*N}$	$\frac{n2}{N}$
$\vdots$	$\vdots$	$\vdots$	$\vdots$	$\vdots$
$I_k$	$x_k -\delta x/2, x_k+\delta x/2$	$n_k$	$\frac{n_k}{\delta x*N}$	$\frac{nk}{N}$
Total

Figure 1.1: Histograms with different class intervals and a possible model for the pdf

Then, for the $i^{th}$ interval,

\[P(x_i - \frac{\delta x}{2} \leq X \leq x_i + \frac{\delta x}{2}) \approx \text{Area of the bar} \] and therefore

\[\text{Height of the bar} \approx \frac{\text{Area of the bar}}{\delta x} \approx \frac{ P(x_i - \frac{\delta x}{2} \leq X \leq x_i + \frac{\delta x}{2})}{\delta x} \]

Therefore, the height of a bar represents the probability density in that class interval.
When $\delta x \rightarrow0$, it will allow us to approximate the histogram by a smooth curve as in Figure (d).
As the area under each histogram is 1, the area under the curve is also 1
For any point $x$,

\[\text{The height of the curve} \approx \lim \limits_{\delta x \to 0} \frac{P(x_i-\frac{\delta x}{2} \leq X \leq x_i+\frac{\delta x}{2})}{\delta x} \]

will represent the the probability density at point $x$.

Let the above smooth curve be denoted by $f_X$.
Then, $f_X$ has the properties mentioned in Section .
The function is called the probability density function of $X$.
NOTE Here $f_X(x)$ represents Probability density at point $x$. Not Probability at point $x$.

1.3.3 Calculation of Probability using pdf

Let $c,d \in \Re$ such that $c\leq d$. Then,

\[P(c\leq X \leq d) = \int_c^d f_X(x)dx\]

$P(c\leq x \leq d = \int_c^d f_X(x)dx$

Figure 1.2: $P(c\leq x \leq d = \int_c^d f_X(x)dx$

NOTE: if $X$ is a continuous random variable with the p.d.f $f_X$, then for any $k \in \Re$,

\[P(X=k) = P(k\leq X\leq k)=\int_k^kf_X(x)dx = 0\]

Therefore, for a continuous random variable $X$, \[P(c<X<d) = P(c\leq X<d) = P(c < X\leq d)= P(c \leq X\leq d)= \int_c^df_X(x)dx\]

1.4 Cumulative Distribution Function

There are many problems in which it is of interest to know the probability that the values of a random variable is less than or equal to some real number $x$.

Definition 3: Cumulative distribution function

The cumulative distibution function or $cdf$ of a random variable $X$, denoted by $F_X(x)$, is defined by

\[F_x(x) = P(X\leq x), \text{ for all } x \]

Therefore, if $X$ is a discrete random variable, the cdf is given by,

\[F_X(x)=\sum_{t\leq x}f_X(t), \text{ } -\infty < x<\infty\]

where $f_X(t)$ is the value of the pmf of $X$ at $t$.

1.4.1 Relationship between cdf and pdf

If $X$ is a continuous random variable, the cdf is given by,

\[F_X(x) = \int_{-\infty}^{x}f_X(t)dt \text{ } -\infty <x<\infty\] where $f_X(t)$ is the value of the pdf of $X$ at $t$. (Here $t$ is a dummy integration variable).

Conversely,

\[f_X(x)= \frac{dF_X(x)}{dx}\]

Example 06

An owner of a software engineering company is interested in knowing how many years his employees stay with his company. Let $X$ be the number of years an employee will stay with the company. Over the years, he has established the following probability distribution:

$x$	1	2	3	4	5	6	7
$f_X(x) = P(X=x)$	0.1	0.05	0.1	?	0.3	0.2	0.1

Find $f_X(4)$

Find $P(X<4)$

Find $P(X\leq4)$

Draw the probability mass function of $X$

Draw the cumulative distribution function of $X$

1.4.2 Properties of a cumulative distribution function of a Discrete random variable

Example 07

\[\begin{equation} f_X(x) = \begin{cases} \frac{1}{25}x & 0\leq x < 5\\ \frac{2}{5}- \frac{1}{25}x & 5\leq x \leq 10\\ 0 & \text{otherwise} \end{cases} \end{equation}\]

Find the CDF of $X$
Find $P(X\leq 8)$
Find $P(3\leq X\leq 8)$

1.5 Expectations and Moments

1.5.1 Expectation

The expected value, or expectation of a random variable is merely its average value.
By weighting the values of the random variable according to the probability distribution, we can obtain a number that summarizes a typical or expected value of an observation of the random variable.

Definition 4: Expected value

Let $X$ be a random variable. The expected value or mean of a random variable $g(X)$, denoted by $E[g(x)],$ is

\[\begin{equation} E[g(x)] = \begin{cases} \sum_{x}g(x) f_X(x) & \text{if } X \text{ is a discrete random variable with pmf } f_X(x) \\ \int_x g(x) f_X(x)dx & \text{if } X \text{ is a continuous random variable with pdf } f_X(x) \end{cases} \end{equation}\]

The mean of a random variable gives a measure of central location of the density of $X$.
The process of taking expectations is a linear operation.
For any constants $a$ and $b$, \[E(aX+b) = aE(X)+b\]

1.5.1.1 Properties of expected value

Theorem

$E(c) = c$ for a constant $c$
$E[cg(X)] = cE[g(X)]$ for a constant $c$
$E[c_1g_1(X)+c_2g_2(X)]= c_1E[g_1(X)]+c_2E[g_2(X)]$
If $g_1(x) \geq 0$ for all $x$, then $E[g_1(X)] \geq 0$
If $g_1(x) \geq g_2(x)$ for all $x$, then $E[g_1(X)] \geq E[g_2(X)]$
If $a\leq g_1(x) \leq b$ for all $x$, then $a\leq E[g_1(X)] \leq b$
If $X$ and $Y$ are two independent random variables, then $E(X \times Y) = E(X)\times E(Y)$

Example 08

Random variable $X$ has the following pmf

\[\begin{equation} f_X(x) = \begin{cases} 0.2 & x =2\\ 0.3 & x =4\\ 0.4 & x =5\\ 0.1 & x =7 \end{cases} \end{equation}\]

Find $E(X)$
Find $E(X^2)$
Find $E\left( \frac{1}{X}\right)$
Find $E(2X+3X^2-5)$

1.5.2 Moments

The various moments of a distribution are an important class of expectation

Definition 5: Moments

If $X$ is a random variable, the $r$th moment of $X$, usually denoted by $\mu_r^\prime,$ is defined as

\[\mu_r^\prime = E(X^r).\]

if the expectation exists.

Note that $\mu_1^\prime = E(X)= \mu,$ the mean of $X$.

Definition 6: Central moments

If $X$ is a random variable, the $r$th central moment of $X$ about $a$ is defined as $E[(x-a)^r]$.

If $a= E(X) = \mu,$ we have the $r$th central moment of $X$, about $E(X)$, denoted by $\mu_r,$ which is \[\mu_r=E[(X-E(X))^r] = E[(X-\mu)^r] .\]

Find $\mu_1$

Definition 7: Variance

If $X$ is a random variable, $Var(X)= E[(X-E(X))^2]= E(X^2) - [E(X)]^2$ provided $E(X^2)$ exists.

The variance of a random variable $X$ is its second central moment, $Var(X) = E[(X-E(X))^2]= E[(X-\mu)^2]$
The positive square root of $Var(X)$ is the standard deviation of $X$

The variance of a random variable gives a measure of the degree of spread of a distribution around its mean.
Let $X$ be a random variable, and let $\mu$ be $E(X).$ the variance of $X$, denoted by $\sigma^2$ of $Var(X),$ is defined by

\[\begin{equation} Var(X) = \begin{cases} \sum_{x}(x-\mu)^2f_X(x) & \text{if } X \text{ is discrete with mass points } x_!, x_2, \dots, x_j\dots \\ \int_x (x-\mu)^2 f_X(x)dx & \text{if } X \text{ is continuous with probability density function } f_X(x) \end{cases} \end{equation}\]

1.5.2.1 Properties of variance of a random variable

Theorem

If $c$ is a constant, then $V(cX) = c^2V(X)$
$V(c) = 0$, Variance of a constant is zero.
If $X$ is a random variable and $c$ is a constant, then $V(c+X) = V(X)$
If $a$ and $b$ are constants, then $V(aX+b)= a^2V(X)$
If $X$ and $Y$ are two independent random variables, then
1. $V(X+Y) = V(X) + V(Y)$
2. $V(X-Y) = V(X) + V(Y)$

1.6 Models for Discrete Distributions

1.6.1 Discrete Uniform Distribution

A random variable $X$ has a discrete uniform $(1, N)$ distribution if \[f_X(x)= P(X=x)=\frac{1}{N}, \;\; x=1,2,\dots, N\] where $N$ is a specified integer.

The distribution puts equal mass on each of the outcomes $1,2,\dots, N.$
If $X$ ha s a discrete uniform distribution, then $E(X) = (N+1)/2$ and $Var(X) = (N^2-1)/12.$

1.6.2 Bernoulli Distribution

Bernoulli Trial

A random experiment of which the outcome can be classified into two categories is called a Bernoulli trial

In general, the results of a Bernoulli Trial are called ‘success’ and ‘failure’. We denote these results by $S$ and $F$, respectively.
Consider a Bernoulli trial. Let

\[\begin{equation} X= \begin{cases} 0 & \text{if the Bernoulli trial results in a failure } \\ 1 & \text{if the Bernoulli trial results in a success} \end{cases} \end{equation}\]

Suppose that the probability of a ‘’success’ in any Bernoulli trial is $\theta.$
Then $X$ is said to have a Bernoulli distribution with probability mass function

$f_X(x)= P(X=x)=\theta^x(1-\theta)^{1-x}, \;\;x=0,1$

This is denoted as $X \sim Bernoulli(\theta).$
If $X$ has a Bernoulli distribution, then $E(X) = \theta$ and $Var(X) = \theta(1-\theta)$

1.6.3 Binomial Distribution

A random experiment with the following properties is called a ‘Binomial experiment’

The random experiment consists of a sequence of $n$ trials, where $n$ is fixed in advance of the random experiment.
Each trial can result in one of the same two possible outcomes: “success” $(S)$ or “failure” $(F)$
The trials are independent. Therefore the outcome of any particular trial does not influence the outcome of any other trial.
The probability of “success” is the same for each trial. Let this probability is $\theta.$

Binomial distribution

Consider a binomial experiment with $n$ trials and probability $\theta$ of a success.

A random variable $X$ is defined to have a binomial distribution if the discrete density function of $X$ is given by \[f_X(x)= P(X=x)= {n\choose x}\theta ^x(1-\theta)^{n-x} , \;\; x=0,1,2,\dots, n\]

This is denoted as $X \sim Bin(n,\theta).$
If $X$ has a binomial distribution, then $E(X) = n\theta$ and $Var(X) = n\theta(1-\theta)$
The binomial distribution reduces to the Bernoulli distribution when $n=1.$

1.6.4 Geometric Distribution

Consider a sequence of independent Bernoulli trials whose probability of “success” for each trial is $\theta$.
Let $X= \text{Number of failures before the first success}$
Then, $X$ is said to have a Geometric distribution with parameter $\theta.$
The probability mass function is given by \[f_X(x)= P(X=x)= \theta (1-\theta)^x , \;\; x=0,1,2,\dots\]
This is denoted as $X \sim Geometric(\theta).$
If $X$ has a geometric distribution, then $E(X) = (1-\theta)/\theta$ and $Var(X) = (1-\theta)/\theta^2$
A random variable $X$ that has a geometric distribution is often referred to as a discrete waiting-time random variable. It represents how long (in terms of number of failures) one has to wait for a “success.”

1.6.5 Negative Binomial Distribution

Consider a sequence of independent Bernoulli trials whose probability of “success” for each trial is $\theta$.
Let $X= \text{Number of failures before the rth success}$
Then, $X$ is said to have a Negative Binomial distribution with parameter $\theta.$
The probability mass function is given by \[f_X(x)= P(X=x)= {x+r-1\choose r-1} \theta^r (1-\theta)^x , \;\; x=0,1,2,\dots\]
This is denoted as $X \sim negbin(r,\theta).$
If $X$ has a Negative Binomial distribution, then $E(X) = r(1-\theta)/\theta$ and $Var(X) = r(1-\theta)/\theta^2$
If in the negative binomial distribution $r=1$, then the negative binomial density specializes to the geometric density.

1.6.6 Hypergeometric Distribution

Suppose a population of size $N$ has $M$ individuals of a certain kind (“success”).
A sample of $n$ items is taken from this population without replacement.
Let $X$ be the number of successes in the sample.
Then, $X$ is said to have a hypergeometric distribution.
The probability mass function is given by \[f_X(x)= P(X=x)= \frac{{M\choose x}{N-M\choose n-x}}{{N \choose n}} , \;\; x=0,1,2,\dots, n\]
Hypergeometric distribution can be used as a model for the number of “successes” in a sample of size $n$ if the sampling is done without replacement from a relatively small population.
If $X$ has a Hypergeometric distribution, then $E(X) = n.\frac{M}{N}$ and $Var(X) = n\frac{M}{N}\left(1-\frac{M}{N}\right)\left(\frac{N-n}{N-1}\right)$

1.6.7 Poisson Distribution

The Poisson distribution provides a realistic probability model for the number of events in a given period of time, space, region or length.
Example:
- The number of fatal traffic accidents per week in a given city
- The number of emails per hour coming into the company of a large business
- the number of defect per unit of some material
Poisson distribution is suitable if the following conditions hold.
1. the number of events within non-overlapping time intervals are independent.
2. Let $t$ be a fixed time point. For a small time interval $\delta t$, the probability of exactly one event happening in the interval $[t,t+\delta t]$ is approximately proportional to the length $\delta t$ of the interval. i.e.,

\[\frac{P(\text{exactly one event in }[t,t+\delta t])}{\delta t} \to \text{a positive constant}\] as $\delta t \to 0.$

Let $t$ be a fixed time point. For a small time interval $\delta t$, the probability of more than one event happening in the interval $[t,t+\delta t]$ is negligible. i.e.,

\[\frac{P(\text{more than one event in }[t,t+\delta t])}{\delta t} \to 0\] as $\delta t \to 0.$

Let $X$ be the number of events during a time interval.
Suppose that the average number of events during the interested time interval is $\lambda (>0).$
Then, the distribution of $X$ can be modeled by a Poisson Distribution with the probability mass function,

\[f_X(x)= P(X=x)= \frac{e^{-\lambda} (\lambda)^x}{x!} , \;\; x=0,1,2,\dots\] - This is denoted as $X \sim Poisson(\lambda).$

If $X$ has a Poisson distribution, then $E(X) = \lambda$ and $Var(X) = \lambda$
The Poisson distribution can be used for counts of some sort of a given area, space, region, volume or length as well.

Example 09

Phone calls arrive at a switchboard at an average rate of 2.0 calls per minute.

If the number of calls in any time interval follows the Poisson distribution, then

$X =$ number of phone calls in a given minute. $X \sim Poisson(\; )$

$Y =$ number of phone calls in a given hour. $Y \sim Poisson ( \;)$

$W =$ number of phone calls in a 15 seconds. $W \sim Poisson (\;)$

1.7 Models for Continuous Distributions

1.7.1 Uniform Distribution

The continuous uniform distribution is defined by spreading mass uniformly over an interval $[a,b]$.
A random variable $X$ is said to have a uniform distribution in $(a,b)$ if its probability density function is given by \[f_X(x) = \frac{1}{b-a};\; a\leq x\leq b\]
This is denoted as $X\sim U(a,b)$ or $X \sim Unif(a, b)$
It is easy to check $\int_a^bf(x) dx =1$.
We also have \[E(X) = \int_a^b\frac{x}{b-a}dx = \frac{a+b}{2}\]

\[Var(X) = \frac{(b-a)^2}{12}\]

Figure 1.3: Uniform probabiity density

1.7.2 Normal Distribution (Gaussian Distribution)

One commonly used bell shaped curve is called the normal distribution.
Many techniques use din applied statistics are based upon the normal distribution.
The normal distribution has two parameters, usually denoted by $\mu$ and $\sigma^2$, which are its mean and variance.
A random variable $X$ is said to have a normal distribution with location parameter $\mu$ and scale parameter $\sigma$, if its probability density function is given by,

\[f_X(x)=f_X(x;\mu, \sigma)= \frac{1}{\sqrt{2\pi}\sigma}e^{-\frac{1}{2}\left(\frac{x-\mu}{\sigma}\right)^2};\;\; -\infty<x<\infty\]

This is denoted by $X \sim N(\mu, \sigma^2)$.
The normal density function is symmetric around the location parameter $\mu$.
The dispersion of the distribution depends on the scale parameter $\sigma$.
If $X$ is a normal random variable, $E(X) = \mu$ and $Vax(X) = \sigma^2$

$Normal distribution for different $\mu$ and $\sigma$$

Figure 1.4: Normal distribution for different $\mu$ and $\sigma$

$P(a< X < b = \int_a^b f_X(x)dx$

Figure 1.5: $P(a< X < b = \int_a^b f_X(x)dx$

\[P(a<X<b)=\int_a^b \frac{1}{\sqrt{2\pi}\sigma}e^{-\frac{1}{2}\left(\frac{x-\mu}{\sigma}\right)^2}dx\]

Evaluating of this integration is somewhat tedious
When we calculate this type of probabilities of normal distribution manually, it is convenient to use a normal probability table.

1.7.2.1 Standard Normal Distribution

Normal distribution with $\mu =0$ and $\sigma = 1$ is called the standard normal distribution.
A random variable with standard normal distribution is usually denoted by $Z$.
The probability density function of standard normal distribution is denoted by $\phi$
If $Z\sim N(0,1)$, then

\[\phi_Z(z) = \frac{1}{\sqrt{2 \pi}}e^{-\frac{1}{2}z^2};\;\; -\infty< z< \infty\]

Probabilities related to Z can be found by using standard normal probability table.

Example 10

Let $Z$ be a standard normal random variable. Calculate probabilities given in table below.

No.	Calculate this probability	Answer
1	$P(Z < 0)$
2	$P(Z < 2.02)$
3	$P(Z > 0.95)$
4	$P(Z > -1.48)$
5	$P(Z < -1.76)$
6	$P(Z < 1.7)$
7	$P(Z < -0.33)$
8	$P(0.94 < Z < 2.41)$
9	$P(-2.41 < Z < -0.94)$
10	$P(-2.96 < Z < 1.05)$

Example contd…

Cumulative Standard Normal Distribution

$Z_\alpha$ Notation

$Z_\alpha$ denotes the value such that $P(Z\geq Z_\alpha) = \alpha$
Here $\alpha$ represents a probability.
Therfore $0\leq \alpha \leq 1$

Example 11

Find $Z_{0.025}$’’

$P(a< X < b = \int_a^b f_X(x)dx$

Figure 1.6: $P(a< X < b = \int_a^b f_X(x)dx$

Example 12

Find the following values

$Z_{0.01}$
$Z_{0.05}$
$Z_{0.9}$
$Z_{0.975}$
$Z_{0.85}$

Example contd …

1.7.2.2 Calculation of Probabilities of Normal Distribution

Suppose $X \sim N(\mu, \sigma^2)$.
Let $Z = \frac{x-\mu}{\sigma}$.
Then, $Z \sim N(0,1)$
This result can be used to find probabilities of any normal distribution.

Example 13

Let $X \sim N(10,4).$ Calculate $P(X \geq 15)$

Example 14

Calculate the probabilities in the table below

No.	$\mu$	$\sigma$	Calculate this probability	Answer
1	95	16	$P(104.92 \leq X \leq 115.16)$
2	65	15	$P(X \leq 86.66)$
3	96	20	$P( X < 86.3)$
4	93	8	$P(91.24 \leq X \leq 109.34)$
5	63	9	$P(65.55 < X < 76.61)$
6	102	8	$P( X > 80.55)$
7	79	18	$P(X < 131.15)$
8	86	6	$P( X \leq 69.2)$
9	85	2	$P(X < 86.46)$
10	100	5	$P(X \leq 112.26)$
11	58	10	$P(75.19 \leq X \leq 82.1)$	0.0348
12	49	7	$P(X \geq 48.52)$	0.5273
13	103	17	$P(73.97 \leq X \leq 138.28)$	0.9371
14	99	24	$P( X < 82.8)$	0.2498
15	52	10	$P( X \leq 53.58)$	0.5628
16	72	8	$P(70.45 < X \leq 93.5)$	0.5732
17	82	20	$P(48.14 < X < 99.49)$	0.7639
18	94	15	$P(91.93 \leq X \leq 98.55)$	0.1741
19	45	4	$P(42.36 \leq X \leq 50.59)$	0.6643
20	73	1	$P(X \geq 72.38)$	0.7324

Example 15

Calculate the quantiles $k$ in the table below

No.	$\mu$	$\sigma$	Calculate $k$ such that
1	85	6	$P( X <k)= 0.9936$
2	97	23	$P( X <k)= 0.0694$
3	77	5	$P( X >k)= 0.0002$
4	93	12	$P( X >k)= 0.0023$
5	67	3	$P( X <k)= 0.0197$
6	59	5	$P( X >k)= 0.9756$
7	94	13	$P( X >k)= 0.3228$
8	51	4	$P( X <k)= 0.1515$
9	49	10	$P( X >k)= 0.9693$
10	61	13	$P( X <k)= 0.9946$
11	69	14	$P( X <k)= 0.9357$
12	85	5	$P( X>k)= 0.008$
13	96	16	$P( X >k)= 0.0014$
14	96	7	$P( X <k)= 0.2578$
15	45	4	$P( X <k)= 0.2578$

1.7.2.3 Empirical Rule for Normal Distribution

Figure 1.7: Empirical Rule for Normal Distribution

Calculate the following probabilities 1. $P(|x-\mu|\leq \sigma)$ 2. $P(|x-\mu|\leq 2\sigma)$ 3. $P(|x-\mu|\leq 3\sigma)$

Empirical Rule for Normal Distribution

Approximately 68% of the values in any normal distribution lie within one standard deviation, approximately 95% lie within two standard deviations and approximately 99.7% lie within three standard deviations from the mean.

The normal distribution is somewhat special as its two parameters $\mu$ (the mean) and $\sigma^2$ (the variance), provide us with complete information about the exact shape and location of the distribution.
Straightforward calculus shows that the normal distribution has its maximum at $x=\mu$ and inflection point (where the curve changes from concave to convex) at $\mu \pm\sigma.$

1.7.3 Gamma Distribution

We come across with many practical situations in which the variable of interest has a skewed distribution.
The gamma family of distributions is a flexible family of distributions on $[0,\infty)$ that yields a wide variety of skewed distributions

A random variable $X$ is said to have a gamma distribution with shape parameter $\alpha$ and scale parameter $\beta$ if its probability density function is given by

\[f_X(x;\alpha, \beta)= \frac{1}{\Gamma(\alpha)\beta ^ {\alpha}}x^{\alpha-1}e^{-x/\beta}, \;\; 0<x<\infty,\;\; \alpha >0, \;\;\beta >0\]

Here $\Gamma(\alpha)$ is called the gamma function,

\[\Gamma(\alpha)=\int_0^{\infty}x^{\alpha-1}e^{-x}dx\] - The gamma function satisfies many useful relationships, in particular,

$\Gamma(\alpha+1)=\alpha\Gamma(\alpha),\;\; \alpha >0$ (can be verified through integration by parts)
$\Gamma(1) =1$
For any positive integer $n(>0)$, $\Gamma(n)=(n-1)!$
$\Gamma \left (\frac{1}{2} \right) = \sqrt{\pi}$

When $X$ has a gamma distribution with shape parameter $\alpha$ and scale parameter $\beta$, it is denoted as $X \sim gamma(\alpha, \beta)$
The parameter $\alpha$ is known as the shape parameter, since it most influences the peakedness of the distibution
The parameter $\beta$ is called the scale parameter, since most of its influence is on the spread of the distribution.

Figure 1.8: Gamma density functions

If $X$ has a gamma distribution with shape parameter $\alpha$ and scale parameter $\beta$, then
- $E(X) = \alpha \beta$
- $Var(X) = \alpha \beta^2$

1.7.4 Exponential Distribution

This distribution is often used to model lifetime of various items.
When the number of events in a time interval has a Poisson distribution, the length of time interval between successive events can be modeled by an exponential distribution.
A random variable $X$ is said to have an exponential distribution with scale parameter $\beta$, if its probability density function is given by

\[f_X(x; \beta)=\frac{1}{\beta}e^{-x/\beta},\;\; 0<x<\infty\]

This is denoted by $X \sim exponential(\beta)$

Figure 1.9: Exponential density functions

Note that exponential distibution is a special case of the gamma distribution.
It can be easily shown that $X\sim exponential(\beta) \iff X\sim gamma (1.\beta)$
If $X$ has an exponential distribution, then
- $E(X) = \beta$
- $Var(X) = \beta^{2}$

1.7.5 Beta Distribution

The beta family of distributions is a continuous family on $(0,1)$ indexed by two parameters.
The $beta(\alpha, \beta)$ probability density function is

\[f_X(x; \alpha, beta)= \frac{1}{B(\alpha, \beta)}x^{\alpha -1}(1-x)^{\beta-1},\;\;0<x<1,\; \alpha >0,\;\; \beta >0,\]

where $B(\alpha, \beta)$ denotes the beta function, \[B(\alpha, \beta)=\int_0^1x^{\alpha -1}(1-x)^{\beta-1}dx.\]

The beta function is related to the gamma function through the following identity \[B(\alpha, \beta)= \frac{\Gamma(\alpha)\Gamma(\beta)}{\Gamma(\alpha+\beta)}\]
The beta distibution is one of the few common “named” distributions that give probability 1 to a finite interval, here taken to be $(0,1)$.
Therefore, the beta distibution is often used to model proportions, which naturally lie between 0 and 1.

Figure 1.10: Beta density functions

1.8 Approximations

1.8.1 Poisson approximation to Binomial

Suppose $X\sim Bin(n, \theta)$ and $n$ is large and $\theta$ is small. Then $X\sim Poisson(n\theta)$ and

\[f_X(x) \approx \frac{e^{-n\theta} (n\theta)^x}{x!}\]

Example 16

Suppose $X$ has a binomial distribution with $n=40$ and $p=0.005$. Find $f_X(1)$

1.8.2 Normal approximation to Binomial

Suppose $X\sim Bin(n, \theta)$ and $n\theta \geq 5$ and $n(1-\theta) \geq 5.$ Then $X \sim N(n\theta, n\theta(1-\theta))$ and

\[P(X \leq x)_{Binomial} = P(X \leq x + 0.5)_{Normal} \approx P(Z \leq \frac{x+0.5-n\theta}{\sqrt{n\theta(1-\theta)}})\]

Note: Since we are approximating a discrete distribution by a continuous distribution, we should apply a continuity correction

Example 17

Suppose $X$ has a binomial distribution with $n=40$ and $p=0.6.$ Find

$P(X\leq 20)$
$P(X < 25)$
$P(X > 15)$
$P(X \geq 20)$
$P(X = 30)$

1.8.3 Normal approximation to Poisson

Suppose $X\sim Poisson(\lambda)$ and $\lambda > 10,$

\[P(X \leq x)_{Poisson} = P(X \leq x + 0.5)_{Normal} \approx P(Z \leq \frac{x+0.5-\lambda}{\sqrt{\lambda}})\]

Note: Since we are approximating a discrete distribution by a continuous distribution, we should apply a continuity correction

Example 18

Suppose $X$ has a Poisson distribution with $\lambda=25$

$P(X\leq 20)$
$P(X < 25)$
$P(X > 15)$
$P(X \geq 20)$
$P(X = 30)$

1.9 Distribution of Functions of Random Variables

Distribution of the linear transformation of a normal random variable

Suppose that $X \sim N(\mu, \sigma^2).$ Let $Y= ax+b,$ where $a$ and $b$ are constants. Then

\[Y\sim N(a\mu +b, a^2\sigma^2)\]

Standardization of a normal random variable

Suppose that $X \sim N(\mu, \sigma^2).$ Let $Z= \frac{X-\mu}{\sigma}.$ Then

\[Z\sim N(0,1).\]

Distribution of the square of a standard normal random variable

Suppose that $Z \sim N(0,1).$ Let $Y= Z^2.$ Then

\[Y\sim \chi^2_1\]

1.10 Distribution of Sum of Independent Random Variables

Distribution of sum of i.i.d Bernoulli random variables

Suppose that $X_1, X_2, \dots, X_n$ are independent, identically distributed (i.i.d) random variables with $Bernoulli(\theta)$ distribution. Let $Y= X_1+X_2+\dots +X_n.$ Then

\[Y\sim Bin(n, \theta); \;\;y=0,1,2,\dots, n.\]

Distribution of sum of i.i.d Poisson random variables

Suppose that $X_1, X_2, \dots, X_n$ are independent, identically distributed (i.i.d) random variables with $Poisson(\lambda)$ distribution. Let $Y= X_1+X_2+\dots +X_n.$ Then

\[Y\sim Poisson(n\lambda); \;\;y=0,1,2,\dots,.\]

Distribution of sum of independent Poisson random variables

Suppose that $X_1, X_2, \dots, X_n$ are independent random variables with $Poisson(\lambda_i), \; i=1,2,\dots, n.$ Let $Y= X_1+X_2+\dots +X_n.$ Then

\[Y\sim Poisson \left(\sum_{i=1}^n\lambda_i\right); \;\;y=0,1,2,\dots,.\]

Distribution of sum of i.i.d Geometric random variables

Suppose that $X_1, X_2, \dots, X_n$ are independent, identically distributed (i.i.d) random variables with $Geometric(\theta)$ distribution. Let $Y= X_1+X_2+\dots +X_n.$ Then

\[Y\sim Neg.bin(n, \theta); \;\;y=n, n+1, n+2,\dots,.\]

Distribution of sum of independent Normal random variables

Suppose that $X_1, X_2, \dots, X_n$ are independent random variables with $X_i\sim N(\mu_i, \sigma_i^2); \; i=1,2,\dots, n$. Let $Y= X_1+X_2+\dots +X_n.$ Then

\[Y\sim N\left(\sum_{i=1}^n \mu_i,\sum_{i=1}^n \sigma_i^2 \right).\]

Distribution of sum of i.i.d exponential random variables

Suppose that $X_1, X_2, \dots, X_n$ are independent, identically distributed (i.i.d) random variables with $X_i\sim exp(\lambda);\; i=1,2,\dots, n$. Let $Y= X_1+X_2+\dots +X_n.$ Then

\[Y \sim gamma(n, \lambda)\]

Distribution of sum of independent gamma random variables

Suppose that $X_1, X_2, \dots, X_n$ are independent random variables with $X_i\sim gamma(\alpha_i, \lambda);\; i=1,2,\dots, n$. Let $Y= X_1+X_2+\dots +X_n.$ Then

\[Y \sim gamma\left(\sum_{i=1}^n\alpha_i, \lambda\right).\]

1.11 Sampling Distribution

Distribution of sample mean of a normal distribution

Suppose that $X_1, X_2, \dots, X_n$ is a random sample from a normal distribution with mean $\mu$ and variance $\sigma^2$. Let $\bar{X} = \frac{X_1+X_2+\dots + X_n}{n}$ be the sample mean. Then,

\[\bar{X} \sim N\left(\mu, \frac{\sigma^2}{n}\right)\]

Distribution of sample variance of a normal distribution

\[\frac{(n-1)S^2}{\sigma^2} \sim \chi^2_{n-1}.\]

Large sample distribution of sample average - Central Limit Theorem)

Suppose that $X_1, X_2, \dots, X_n$ is a random sample from any distribution with mean $\mu$ and variance $\sigma^2$. Let $\bar{X} = \frac{X_1+X_2+\dots + X_n}{n}$ be the sample mean and $Z_n = \frac{\bar{X}_n-\mu}{\sigma/\sqrt{n}}.$

Then, the distribution of $Z_n$ approaches the standard normal distribution as $n$ approaches $\infty.$

References

Casella, G., & Berger, R. L. (2002). Statistical inference (Vol. 2, pp. 337-472).Pacific Grove, CA: Duxbury

Mood, A.M., Graybill, F.A. and Boes, D.C. (2007): Introduction to the Theory of Statistics, 3rd Edn. (Reprint). Tata McGraw-Hill Pub. Co. Ltd.

Tutorial

Consider the experiment of taking two products randomly form a production line and determine whether each is defective or not. Express the following events using a suitably defined random variable.

$D_0=$ The event that both products are non defective

$D_1=$ The event that one product is defective

$D_2=$ The event that both products are defective

$E=$ The event that at least one product is defective

Let X be a random variable with the following probability distribution

$x$	1	1.5	2	2.5	3	other
$f_X(x)$	$k$	$2k$	$4k$	$2k$	$k$	0

Find the value of $k$
Find $P(X = 2.5)$
Calculate $P (X \geq 1.75)$

The sample space of a random experiment is $\{a, b, c, d, e, f\}$, and each outcome is equally likely. A random variable $X$ is defined as follows:

Outcome	$a$	$b$	$c$	$d$	$e$	$f$
$x$	0	0	1.5	1.5	2	3

Determine the probability mass function of $X$. Use the probability mass function to determine the following probabilities:

$P(X = 1.5)$
$P(0.5 < X < 2.7)$
$P(X > 3)$
$P(0 \leq X < 2)$
$P(X = 0 \text{ or } X = 2)$

Verify that the following function is a probability mass function, and determine the requested probabilities.

\[f_X(x)=\frac{8}{7} \left(\frac{1}{2} \right)^x, \text{ x= 1,2,3 }\]

$P(X \leq 1)$
$P(X > 1)$
$P (2 < X < 6)$
$P(X \leq 1 \text{ or } X > 1)$

A disk drive manufacturer sells storage devices with capacities of one terabyte, 500 gigabytes, and 100 gigabytes with probabilities 0.5, 0.3, and 0.2, respectively. The revenues associated with the sales in that year are estimated to be $50 million, $25 million, and $10 million, respectively. Let $X$ denotes the revenue of storage devices during that year.

Determine the probability mass function of $X$.
Calculate the probability of getting more than $20 million of revenue during that year.
Determine the cumulative distribution function of $X$

Consider the following cumulative distribution function:

\[\begin{equation} F_X(x) = \begin{cases} 0 & x<-2\\ 0.2 & -2 \leq x< 0\\ 0.7 & 0\leq x<2\\ 1 & 2 \leq x \end{cases} \end{equation}\]

Draw the plot of $F_X(x)$
Discuss the properties of $F_X(x)$ (eg: whether it is discrete or continuous, whether it is decreasing or increasing function etc. )
Determine the probability mass function of $X$ from the above cumulative distribution function:
Plot the probability mass function of $X$

The number of e-mail messages received per hour varies from 10 to 16 with the following probabilities

$x =$ number of messages	10	11	12	13	14	15	16
$P(X=x)$	0.08	0.15	0.1	0.2	0.1	0.07	0.3

Let $X$ be the number of e-mail messages received per hour. Find the probability mass function of X
Determine the mean and standard deviation of the number of messages received per hour

According to past data, twenty percent of all telephones of a certain type are submitted for service while under warranty. Of these, $60\%$ can be repaired whereas the other $40\%$ must be replaced with new units. If a company purchases ten of these telephones, what is the probability that

two telephones will be submitted for service under warranty?
at most 3 telephones will be submitted for service under warranty?
two telephones will end up being replaced under warranty?
one telephone will end up being repaired under warranty?

Each sample of water has a $10\%$ chance of containing a particular organic pollutant. Assume that the samples are independent with regard to the presence of the pollutant. Find the probability that in the next 18 samples, exactly 2 contain the pollutant

Define a suitable random variable for the above question
Find the distribution of that random variable
Find the probability that in the next 18 samples, exactly 2 contain the pollutant

The space shuttle flight control system called Primary Avionics Software Set (PASS) uses four independent computers working in parallel. At each critical step, the computers “vote” to determine the appropriate step. The probability that a computer will ask for a roll to the left when a roll to the right is appropriate is 0.0001. Let $X$ denotes the number of computers that vote for a left roll when a right roll is appropriate.

What is the probability mass function of $X$?
What are the mean and variance of $X$?

A University lecturer never finishes his lecture before the end of the hour and always finishes his lectures within 2 minutes after the hour. Let $X$ = the time that elapses between the end of the hour and the end of the lecture and suppose the pdf of $X$ is

\[\begin{equation} f_X(x) = \begin{cases} kx^2 & 0\leq x\leq 2\\ 0 & otherwise \end{cases} \end{equation}\]

Find the value of $k$ and draw the density curve.
What is the probability that the lecture ends within 1 min of the end of the hour?
What is the probability that the lecture continues beyond the hour for between 60 and 90 sec?
What is the probability that the lecture continues for at least 90 sec beyond the end of the hour?

The daily sales of gasoline are uniformly distributed between 2,000 and 5,000 gallons. Find the probability that sales are:

between 2,500 and 3,000 gallon
more than 4000 gallons
exactly 2500 gallons

Suppose $X$ has a continuous uniform distribution over the interval $[-1,1]$. Determine the following:

Mean, variance, and standard deviation of $X$
Value for $k$ such that $P(-k < X < k ) = 0.90$
Cumulative distribution function

Suppose that the time it takes a data collection operator to fill out an electronic form for a database is uniformly between 1.5 and 2.2 minutes.

What are the mean and variance of the time it takes an operator to fill out the form?
What is the probability that it will take less than two minutes to fill out the form?
Determine the cumulative distribution function of the time it takes to fill out the form.

An electronic product contains 40 integrated circuits. The probability that any integrated circuit is defective is 0.01, and the integrated circuits are independent. The product operates only if there are no defective integrated circuits. What is the probability that the product operates?
A bag of 200 chocolate chips is dumped into a batch of cookies dough. 40 cookies are made from such a batch of dough. What is the probability that a randomly selected cookie has at least 4 chocolate chips?
For the case of the thin copper wire, suppose that the number of flaws follows a Poisson distribution with a mean of 2.3 flaws per millimeter.

Determine the probability of exactly two flaws in 1 millimeter of wire.
Determine the probability of 10 flaws in 5 millimeters of wire

Contamination is a problem in the manufacture of magnetic storage disks. Assume that the number of particles of contamination that occur on a disk surface has a Poisson distribution, and the average number of particles per square centimeter of media surface is 0.1. The area of a disk under study is 100 square centimeters.

Determine the probability that 12 particles occur in the area of a disk under study.
Determine the probability that zero particles occur in the area of the disk under study.
Determine the probability that 12 or fewer particles occur in the area of the disk under study

The number of surface flaws in plastic panels used in the interior of automobiles has a Poisson distribution with a mean of 0.05 flaw per square foot of plastic panel. Assume that an automobile interior contains 10 square feet of plastic panel.

What is the probability that there are no surface flaws in an auto’s interior?
If 10 cars are sold to a rental company, what is the probability that none of the 10 cars has any surface flaws?
If 10 cars are sold to a rental company, what is the probability that at most 1 car has any surface flaws?

Cabs pass your workplace according to a Poisson process with a mean of five cabs per hour. Suppose that you exit the workplace at 6:00 p.m. Determine the following:

Probability that you wait more than 10 minutes for a cab.
Probability that you wait fewer than 20 minutes for a cab.

An electronic product contains 40 integrated circuits. The probability that any integrated circuit is defective is 0.01, and the integrated circuits are independent. The product operates only if there are no defective integrated circuits. What is the probability that the product operates?

A person must take two buses to go to work. From the past experience, he knows that a bus can come at any time within 6 minutes. Also, the probability that a bus comes within any period of the same length is the same. Hence, it is reasonable to assume the following probability density function for the waiting time $X$ (in minutes) for a bus

$f_X(x) = \frac{1}{6}, \;\;0<x<6.$

Then the total waiting time $T$ at both bus-stops has the following density function

\[\begin{equation} f_T(t) = \begin{cases} \frac{t}{36} & 0\leq t \leq 6\\ \frac{1}{3}- \frac{t}{36} & 6\leq t \leq 12 \end{cases} \end{equation}\]

Verify that each of above function is a proper density function
What is the probability that the waiting time at the first bus-stop will be less than 2 minutes?
What is the probability that the waiting time at the first bus-stop will be less than 2 minutes and the waiting time at the second

Summary

Models for Discrete Distributions

Name	Remarks	Probability mass function	values of $X$	Parameter Space	Mean	Variance
Discrete Uniform	Outcomes that are equally likely (finite)	$f(x) = \frac{1}{N}$	$x=1,2,\dots, N$	$N=1,2,\dots$	$\frac{N+1}{2}$	$\frac{N^2-1}{12}$
Bernoulli	Bernoulli trial	$f(x) = \theta^x (1-\theta)^{1-x}$	$x=0,1$	$0\leq \theta \leq 1$	$\theta$	$\theta (1-\theta)$
Binomial	$X=$ Number of successes in $n$ fixed trials	$f_X(x) = {n\choose x}\theta ^x(1-\theta)^{n-x}$	$x=0,1,2,\dots, n$	$0\leq \theta \leq 1; \;\;\; \newline n =1,2,3,\dots$	$n\theta$	$n\theta(1-\theta)$
Geometric	$X=$ Number of failures before the first success	$f_X(x)= \theta (1-\theta)^x$	$x=0,1,2,\dots$	$0\leq \theta \leq 1$	$\frac{1-\theta}{\theta}$	$\frac{1-\theta}{\theta^2}$
Negative Binomial	$X=$ Number of failures before the $r$th success	$f_X(x)= {x+r-1\choose r-1} \theta^r (1-\theta)^x$	$x=0,1,2,\dots$	$0\leq \theta \leq 1; \;\; r > 0$	$\frac{r(1-\theta)}{\theta}$	$\frac{r(1-\theta)}{\theta^2}$
Hypergeometric	$X=$ Number of successes in the sample taken without replacement	$f_X(x)= \frac{{M\choose x}{N-M\choose n-x}}{{N \choose n}}$	$x=0,1,2,\dots, n$	$N = 1,2,\dots;\;\;\;\;\newline M= 0,1,\dots,N; \;\;\;\; \newline n=1,2,\dots,N$	$n\frac{M}{N}$	$n\frac{M}{N}\left(1-\frac{M}{N} \right)\left(\frac{N-n}{N-1}\right)$
Poisson	$X=$ number of events in a given period of time, space, region or length	$f_X(x)= \frac{e^{-\lambda} (\lambda)^x}{x!}$	$x=0,1,2,\dots$	$\lambda > 0$	$\lambda$	$\lambda$

Models for Continuous Distributions

Name	Probability density function	Values of $X$	Parameter Space	Mean	Variance
Uniform	$f_X(x) = \frac{1}{b-a}$	$a\leq x\leq b$	$-\infty<a<b<\infty$	$\frac{a+b}{2}$	$\frac{(b-a)^2}{12}$
Normal (Gaussian)	$f_X(x)=\frac{1}{\sqrt{2\pi}\sigma}e^{-\frac{1}{2}\left(\frac{x-\mu}{\sigma}\right)^2}$	$-\infty<x<\infty$	$-\infty<\mu<\infty;\;\; \sigma >0$	$\mu$	$\sigma^2$
Gamma	$f_X(x)= \frac{1}{\Gamma(\alpha)\beta ^ {\alpha}}x^{\alpha-1}e^{-x/\beta}$	$0<x<\infty$	$\alpha >0; \;\;\beta >0$	$\alpha\beta$	$\alpha\beta^2$
Exponential	$f_X(x)=\frac{1}{\beta}e^{-x/\beta}$	$0<x<\infty$	$\beta >0$	$\beta$	$\beta^2$
Beta	$f_X(x)= \frac{1}{B(\alpha, \beta)}x^{\alpha -1}(1-x)^{\beta-1}$	$0<x<1$	$\alpha >0;\;\; \beta >0$	$\frac{\alpha}{\alpha+\beta}$	$\frac{\alpha\beta}{(\alpha+\beta+1)(\alpha+\beta)^2}$

Interval	Class boundaries	Class frequency	Height of the bar	Area of the bar
\(I_1\)	\(x_1 -\delta x/2, x_1+\delta x/2\)	\(n_1\)	\(\frac{n_1}{\delta x*N}\)	\(\frac{n1}{N}\)
\(I_2\)	\(x_2 -\delta x/2, x_2+\delta x/2\)	\(n_2\)	\(\frac{n_2}{\delta x*N}\)	\(\frac{n2}{N}\)
\(\vdots\)	\(\vdots\)	\(\vdots\)	\(\vdots\)	\(\vdots\)
\(I_k\)	\(x_k -\delta x/2, x_k+\delta x/2\)	\(n_k\)	\(\frac{n_k}{\delta x*N}\)	\(\frac{nk}{N}\)
Total

No.	Calculate this probability	Answer
1	\(P(Z < 0)\)
2	\(P(Z < 2.02)\)
3	\(P(Z > 0.95)\)
4	\(P(Z > -1.48)\)
5	\(P(Z < -1.76)\)
6	\(P(Z < 1.7)\)
7	\(P(Z < -0.33)\)
8	\(P(0.94 < Z < 2.41)\)
9	\(P(-2.41 < Z < -0.94)\)
10	\(P(-2.96 < Z < 1.05)\)

Name	Remarks	Probability mass function	values of \(X\)	Parameter Space	Mean	Variance
Discrete Uniform	Outcomes that are equally likely (finite)	\(f(x) = \frac{1}{N}\)	\(x=1,2,\dots, N\)	\(N=1,2,\dots\)	\(\frac{N+1}{2}\)	\(\frac{N^2-1}{12}\)
Bernoulli	Bernoulli trial	\(f(x) = \theta^x (1-\theta)^{1-x}\)	\(x=0,1\)	\(0\leq \theta \leq 1\)	\(\theta\)	\(\theta (1-\theta)\)
Binomial	\(X=\) Number of successes in \(n\) fixed trials	\(f_X(x) = {n\choose x}\theta ^x(1-\theta)^{n-x}\)	\(x=0,1,2,\dots, n\)	\(0\leq \theta \leq 1; \;\;\; \newline n =1,2,3,\dots\)	\(n\theta\)	\(n\theta(1-\theta)\)
Geometric	\(X=\) Number of failures before the first success	\(f_X(x)= \theta (1-\theta)^x\)	\(x=0,1,2,\dots\)	\(0\leq \theta \leq 1\)	\(\frac{1-\theta}{\theta}\)	\(\frac{1-\theta}{\theta^2}\)
Negative Binomial	\(X=\) Number of failures before the \(r\)th success	\(f_X(x)= {x+r-1\choose r-1} \theta^r (1-\theta)^x\)	\(x=0,1,2,\dots\)	\(0\leq \theta \leq 1; \;\; r > 0\)	\(\frac{r(1-\theta)}{\theta}\)	\(\frac{r(1-\theta)}{\theta^2}\)
Hypergeometric	\(X=\) Number of successes in the sample taken without replacement	\(f_X(x)= \frac{{M\choose x}{N-M\choose n-x}}{{N \choose n}}\)	\(x=0,1,2,\dots, n\)	\(N = 1,2,\dots;\;\;\;\;\newline M= 0,1,\dots,N; \;\;\;\; \newline n=1,2,\dots,N\)	\(n\frac{M}{N}\)	\(n\frac{M}{N}\left(1-\frac{M}{N} \right)\left(\frac{N-n}{N-1}\right)\)
Poisson	\(X=\) number of events in a given period of time, space, region or length	\(f_X(x)= \frac{e^{-\lambda} (\lambda)^x}{x!}\)	\(x=0,1,2,\dots\)	\(\lambda > 0\)	\(\lambda\)	\(\lambda\)

Name	Probability density function	Values of \(X\)	Parameter Space	Mean	Variance
Uniform	\(f_X(x) = \frac{1}{b-a}\)	\(a\leq x\leq b\)	\(-\infty<a<b<\infty\)	\(\frac{a+b}{2}\)	\(\frac{(b-a)^2}{12}\)
Normal (Gaussian)	\(f_X(x)=\frac{1}{\sqrt{2\pi}\sigma}e^{-\frac{1}{2}\left(\frac{x-\mu}{\sigma}\right)^2}\)	\(-\infty<x<\infty\)	\(-\infty<\mu<\infty;\;\; \sigma >0\)	\(\mu\)	\(\sigma^2\)
Gamma	\(f_X(x)= \frac{1}{\Gamma(\alpha)\beta ^ {\alpha}}x^{\alpha-1}e^{-x/\beta}\)	\(0<x<\infty\)	\(\alpha >0; \;\;\beta >0\)	\(\alpha\beta\)	\(\alpha\beta^2\)
Exponential	\(f_X(x)=\frac{1}{\beta}e^{-x/\beta}\)	\(0<x<\infty\)	\(\beta >0\)	\(\beta\)	\(\beta^2\)
Beta	\(f_X(x)= \frac{1}{B(\alpha, \beta)}x^{\alpha -1}(1-x)^{\beta-1}\)	\(0<x<1\)	\(\alpha >0;\;\; \beta >0\)	\(\frac{\alpha}{\alpha+\beta}\)	\(\frac{\alpha\beta}{(\alpha+\beta+1)(\alpha+\beta)^2}\)

No.	\(\mu\)	\(\sigma\)	Calculate this probability	Answer
1	95	16	\(P(104.92 \leq X \leq 115.16)\)
2	65	15	\(P(X \leq 86.66)\)
3	96	20	\(P( X < 86.3)\)
4	93	8	\(P(91.24 \leq X \leq 109.34)\)
5	63	9	\(P(65.55 < X < 76.61)\)
6	102	8	\(P( X > 80.55)\)
7	79	18	\(P(X < 131.15)\)
8	86	6	\(P( X \leq 69.2)\)
9	85	2	\(P(X < 86.46)\)
10	100	5	\(P(X \leq 112.26)\)
11	58	10	\(P(75.19 \leq X \leq 82.1)\)	0.0348
12	49	7	\(P(X \geq 48.52)\)	0.5273
13	103	17	\(P(73.97 \leq X \leq 138.28)\)	0.9371
14	99	24	\(P( X < 82.8)\)	0.2498
15	52	10	\(P( X \leq 53.58)\)	0.5628
16	72	8	\(P(70.45 < X \leq 93.5)\)	0.5732
17	82	20	\(P(48.14 < X < 99.49)\)	0.7639
18	94	15	\(P(91.93 \leq X \leq 98.55)\)	0.1741
19	45	4	\(P(42.36 \leq X \leq 50.59)\)	0.6643
20	73	1	\(P(X \geq 72.38)\)	0.7324

No.	\(\mu\)	\(\sigma\)	Calculate \(k\) such that
1	85	6	\(P( X <k)= 0.9936\)
2	97	23	\(P( X <k)= 0.0694\)
3	77	5	\(P( X >k)= 0.0002\)
4	93	12	\(P( X >k)= 0.0023\)
5	67	3	\(P( X <k)= 0.0197\)
6	59	5	\(P( X >k)= 0.9756\)
7	94	13	\(P( X >k)= 0.3228\)
8	51	4	\(P( X <k)= 0.1515\)
9	49	10	\(P( X >k)= 0.9693\)
10	61	13	\(P( X <k)= 0.9946\)
11	69	14	\(P( X <k)= 0.9357\)
12	85	5	\(P( X>k)= 0.008\)
13	96	16	\(P( X >k)= 0.0014\)
14	96	7	\(P( X <k)= 0.2578\)
15	45	4	\(P( X <k)= 0.2578\)