4 - Basic Probability Continued Continued

Alex John Quijano

09/24/2021

Previously on Probability…


Last session, we discussed about:

Basic Probability Continued Continued


In this lecture, we will learn about,

Random Variable (R.V.)

A Random Variable (R.V.) is a type of variable where the value is a function that associates a numerical value with a potential outcome.

Tossing a Fair Coin THREE Times

Discrete R.V. vs Continuous R.V.

Discrete R.V. corresponds to
Probability Mass Functions (PMF)

Example:

Continuous R.V. corresponds to
Probability Density Functions (PDF)

Example:

Some Examples of Discrete Random Variables

Examples of Continuous Random Variables

Unemployment Rates (1/7)

Unemployment Rates (2/7)

Unemployment Rates (3/7)

Using the PDF, we can compute probabilities of unemployment rates.

Unemployment Rates (4/7)

Using the PDF, we can compute probabilities of unemployment rates.

The probability of unemployment rate of exacty 5.75 is \[P(X = 5.75) = 0.162\]

Unemployment Rates (5/7)

Using the PDF, we can compute probabilities of unemployment rates.

The probability of unemployment rate for at least 5.75 is \[P(X \ge 5.75) = 0.211\]

Note: We are taking the “area under the curve” here to compute the probability of the union of a disjoint interval.

Unemployment Rates (6/7)

Using the PDF, we can compute probabilities of unemployment rates.

The probability of unemployment rate for at most 5.75 is \[P(X \le 5.75) = 0.789\]

Note: We are taking the “area under the curve” here to compute the probability of the union of a disjoint interval.

Unemployment Rates (7/7)

Using the PDF, we can compute probabilities of unemployment rates.

The probability of unemployment rate between 2.75 and 5.75 is \[P(2.75 \le X \le 5.75) = 0.693\]

Note: We are taking the “area under the curve” here to compute the probability of the union of a disjoint interval.

The Mean of the Unemployment Rates

Resampling (1/2)

Image Source: [Bootstrapping Statistics.](https://towardsdatascience.com/bootstrapping-statistics-what-it-is-and-why-its-used-e2fa29577307){target=_blank}

Image Source: Bootstrapping Statistics.

Resampling (2/2)

Image Source: [Bootstrapping Statistics by Trist'n Joseph.](https://towardsdatascience.com/bootstrapping-statistics-what-it-is-and-why-its-used-e2fa29577307){target=_blank}

Image Source: Bootstrapping Statistics by Trist’n Joseph.

Resampling 1000 times (1/2)

Original Sample

Resamples - Distribution of Means

Resampling 1000 times (2/2)

Resamples - Distribution of Means

The Central Limit Theorem



The Central Limit Theorem says that regardless of the underlying distribution, the sampling distribution of the mean of any independent, random variable will be normal or near normal.

Hypothesis Testing Preview

Suppose that you want to know the difference in mean scores for two exams A and B.

Apply Some Resampling and Randomization Method

Summary

In this lecture, we talked about the following:

In the next lecture, we will talk about,

Today’s Activity


Sorry, no activity today. :-(