Alex John Quijano
10/01/2021
In the previous lecture, we learned about the following:
Examples of the null and alternative hypothesis using difference in proportions.
Basic ideas of the p-value and statistical significance.
In this lecture, we will learn about:
A hypothesis test is a statistical technique used to evaluate competing claims using data.
Null Hypothesis (\(H_0\)): A statement about a population parameter. We test the probability of this statement being true to decide whether to accept or reject. This hypothesis can include the \(=\), \(\ge\), or \(\le\) signs.
Alternative Hypothesis (\(H_A\)): A statement that contradicts the null hypothesis. We determine the probability of this statement being true based on the likelihood of the null hypothesis being true. This hypothesis can include \(\ne\), \(>\), or \(<\) signs.
There are two possible outcomes of the hypothesis test:
Reject \(H_0\): If the p-value is less than the significance level, then we reject the null hypothesis. Then, we have enough evidence to support \(H_A\).
Fail to Reject \(H_0\): If the p-value is greater than or equal to the significance level, then we fail to reject the null hypothesis. This does not mean the the null hypothesis is true.
Making statistical decisions means that you have to deal with uncertainties.
Image Source: Statistical Performance Measures by Neeraj Kumar Vaid
This meme might be over used. If you find some memes similar to this but in “non-pregnancy” context, let me know.
What does this all mean? When the p-value is small, i.e., less than a previously set threshold (\(\alpha\)), we say the results are statistically significant. The value of \(\alpha\) represents how rare an event needs to be in order for the null hypothesis to be rejected. The \(\alpha\) also represents the probability of committing a type I error.
Reality/Decision | Reject \(H_0\) | Fail to reject \(H_0\) |
---|---|---|
\(H_0\) is true | Type I error with probability \(\alpha\) (significance level) |
Correct decision with probability \(1-\alpha\) (confidence level) |
\(H_0\) is false | Correct decision with probability \(1-\beta\) (power of test) |
Type II error with probability \(\beta\) |
Images Source: Type I and Type II errors by Pritha Bhandari
Note: Making a Type I error does not necessarily mean something was wrong with the data or that we made a computational mistake. Sometimes data can point us to the wrong conclusion. Scientific studies are often repeated to check initial findings. This is why reproducibility in Science is important!
The significance value \(\alpha\) is the probability of making a Type I error.
The power of the test \(1-\beta\) is the probability of rejecting the null claim when the alternative claim is true.
Based on the incorrect conclusion that the new drug intervention is effective, over a million patients are prescribed the medication, despite risks of severe side effects and inadequate research on the outcomes. The consequences of this Type I error also mean that other treatment options are rejected in favor of this intervention.
If a Type II error is made, the drug intervention is considered ineffective when it can actually improve symptoms of the disease. This means that a medication with important clinical significance doesn’t reach a large number of patients who could tangibly benefit from it.
Examples Source: Type I and Type II errors by Pritha Bhandari
In this lecture we talked about:
Decision errors: Type I and Type II errors.
The significance level in the context of the probability of making a Type I error.
The power of the test.
In the next lectures, we will talk about:
One-sided vs Two-sided hypothesis test.
An introduction to confidence intervals and Randomization vs Bootstrapping.
Mathematical details of the standard normal distribution. (Please read your textbook Chapter 13) Put on your math masks!
Within your group, discuss the answers for the following problem.
Testing for food safety. A food safety inspector is called upon to investigate a restaurant with a few customer reports of poor sanitation practices. The food safety inspector uses a hypothesis testing framework to evaluate whether regulations are not being met. If he decides the restaurant is in gross violation, its license to serve food will be revoked. OpenIntro: IMS Section 14.6
Write the hypotheses in words.
What is a Type I Error in this context?
What is a Type II Error in this context?
Which error is more problematic for the restaurant owner? Why?
Which error is more problematic for the diners? Why?
As a diner, would you prefer that the food safety inspector requires strong evidence or very strong evidence of health concerns before revoking a restaurant’s license? Explain your reasoning.