Lecture 21
Duke University
STA 101 - Fall 2023
Inference for comparing two population means
Specifically, making decisions via hypothesis tests
And generally, what in the world is a hypothesis test?!
Why is it conducted?
How is it conducted?
What does the p-value mean?
What does it mean to reject or fail to reject a null hypothesis?
What are testing errors?
What is the power of the test?
Combine the data from the two groups and randomly shuffle them into two groups of sizes equal to the original group sample sizes
Calculate the means of each group and record the different
Repeat steps 1 and 2 many times to build the null distribution
Find the p-value as the number of simulations with simulated differences at least as extreme (in the direction of the alternative hypothesis) as the observed difference
Every year, the United States Department of Health and Human Services releases to the public a large dataset containing information on births recorded in the country. This dataset has been of interest to medical researchers who are studying the relation between habits and practices of expectant mothers and the birth of their children. In this case study we work with a random sample of 1,000 cases from the dataset released in 2014. The distributions of birth weights of babies, measured in pounds, by mother’s smoking habit are shown below.
Go to Posit Cloud and continue the project titled ae-16-Birth weights
.