5.3 Chapter 5 Major Takeaways
There are two major ways that randomness typically plays a role in the data we observe:
- If we randomly choose a sample from the population
- If we randomly assign individuals to a treatment group in an experiment
If we do random sampling, then we need to consider how our numeric summaries (estimated slopes, estimated odds ratios) might vary if we had gotten a different sample from the population. This variation gives a sense of how uncertain we should be in making conclusions about the population from our sample. If it varies quite a lot, then we are more uncertain about the population. If it does not vary much, then we can be more certain about the information we have about the population.
- We can get an estimate of this sample variation through bootstrapping from our sample (resampling from our sample).
If we randomly assign individuals to a treatment group, then we need to consider how much the differences in the groups might vary if we had a different random assignment to groups.
- We can assume there is no real difference in the groups and then reshuffle our treatment group labels to get an estimate of this random variation and then compare our observed value to those values generated from assuming no real difference in the groups.