Study Guide for Exam 2
This will be a closed-book exam. You will be allowed to use R on a school computer for computations. This means you will not have access to, nor will you need to use, anything beyond Base R (e.g. mosaic, MUsaic, etc.) for the exam.
To do well on the exam, you should be able to do the following:
Chapter 9
The idea of probability
- Explain why we need to talk about chance when we consider a a sample from a simple random sample sampling design.
- Define probability in terms of a proportion related to long-run frequencies.
Probability rules
- State the range of values that a probability can take.
- Recognize when a number can and cannot be a probability.
- Relate the probability that an event occurs to the probability that the event does not occur.
- Characterize the probability of an event between 0 and 1 in terms of whether the event will:
- always occur
- never occur
- sometimes occur
Random variables
- Explain why some variables are considered “random” in statistics, and explain in what sense a random variable is a “number that could have been otherwise.”
- Give examples of random variables that result from simple random sampling.
- Recognize and use the notation of an upper case letter (like \(X, Y,\) or \(Z\)) for a random variable, and a lower case letter (like \(x, y,\) or \(z\)) for a particular value the random variable could take.
- Given the definition of a random variable, determine whether the random variable is discrete or continuous.
Discrete versus continuous probability models
- State the properties of a probability distribution for a discrete random variable, and identify whether a given table can or cannot be a probability distribution.
- Use a probability distribution to answer probability queries such as \(P(X > a)\), \(P(a < X < b)\), \(P(X \leq a)\), etc., for a discrete random variable \(X\).
- State the properties of a density curve for a continuous random variable.
- Relate the probability that a continuous random variable \(X\) falls in an interval to its probability curve.
- Given a rectangular or triangular density curve for a random variable \(X\), find the probability that \(X\) falls in a given interval.
Chapter 11
Normal distributions
- Sketch the basic shape of the density curve for a Normal random variable.
- Identify the two parameters that specify the center and spread of the density curve for a Normal random variable.
- Recognize the convention in statistics of using Roman letters for sample statistics (\(\bar{x}\), \(s\), etc.) and Greek letters for population parameters (\(\mu\), \(\sigma\), etc.).
- Recognize, explain, and use the notation \(X \sim N(\mu, \sigma)\).
- Sketch a normal density curve for any \(N(\mu, \sigma)\) random variable, including its: mean, and mean \(\pm\) 1, 2, and 3 standard deviations.
The 68–95–99.7 rule
- Use the 68–95–99.7 rule to reason about probabilities in the body of the density curve of a Normal random variable.
The standard Normal distribution
- State the mean and standard deviation of a standard Normal random variable.
- Explain why, for a Normal random variable with mean \(\mu\) and standard deviation \(\sigma\), subtracting its mean and then dividing by its standard deviation transforms the random variable into a standard Normal random variable.
- Compute a \(Z\)-score (equivalently, \(Z\)-statistic or standardized score) given a data value from a \(N(\mu, \sigma)\) random variable.
Finding Normal probabilities
- Convert probability queries of the form
- \(P(X \leq a)\)
- \(P(X \geq a)\)
- \(P(X \leq a \text{ or } X \geq b)\)
- \(P(a \leq X \leq b)\)
into questions about the probability in the left, right, both tails, or body of a Normal density curve, or vice versa.
- Sketch the area corresponding to probabilities in the:
- Left tail
- Right tail
- Both tails
- Body
of a Normal random variable with a given mean and standard deviation.
- Identify which tail
pnorm
returns the probability for, by default.
- Find the corresponding probabilities using
pnorm
in R.
Finding percentiles
- Define the percentile of a distribution.
- Define the quantile of a distribution.
- Convert a question asking for a quantile to a question asking for a percentile or vice versa.
- Compute percentiles or quantiles using
qnorm
.
- Describe what the first letter in the functions
pnorm
and qnorm
are indicating about the function.
- Given a probability \(p\), find the value of \(a\) for the equalities \(p = P(X \leq a)\) and \(p = P(X \geq a)\).
Chapter 13
Parameters and statistics
- Compare and contrast a parameter of a population and a statistic of a sample.
- Recognize the notation of using Greek letters for parameters and Roman letters for statistics.
- Given the description of a statistical problem, identify whether a numerical characteristic is a population parameter or a sample statistic.
Statistical estimation and sampling distributions
- Define sampling distribution, and relate it to prior concepts in the course such as random variables, probability distributions, and density curves.
- Explain why we can model the statistic of a sample as a random variable when the sample was a simple random sample from a population.
The sampling distribution of \(\bar{X}\)
- State the mean \(\mu_{\bar{X}}\) of the sample mean \(\bar{X}\) of a simple random sample.
- State the standard deviation \(\sigma_{\bar{X}}\) of the sample mean \(\bar{X}\) of a simple random sample.
- Explain, using the mean and standard deviation of the sample mean, why averaging values from a simple random sample is a good idea.
- Relate the sampling distribution of the sample mean to the concepts of accuracy and precision.
- Given a population mean and standard deviation, compute the mean and standard deviation of the sample mean from a simple random sample of the population.
The central limit theorem
- State under what conditions the sampling distribution of the sample mean is exactly Normal.
- State under what conditions the sampling distribution of the sample mean is approximately Normal.
- Answer probability queries about the sample mean from a simple random sample given the relevant characteristics of the sample and the population.
Chapter 14
Statistical estimation (and Lecture 11 Lecture Notes)
- Distinguish between a descriptive statistic and an inferential statistic, and state what they characterize about a sample or a population.
- Describe the “black box” model of statistical inference.
- State the three main types of inferential statistics we will discuss in this course.
- Define point estimator.
- Distinguish between a point estimator as a procedure and a point estimate as a number.
- Discuss the main properties of the sample mean of a random sample from a population as a point estimator for the population’s mean.
Margin of error and confidence level (and Lecture 11 Lecture Notes)
- State the standard error for the sample mean of a simple random sample.
- Explain why the standard error of the sample mean is so-named.
- Determine the probability that the sample mean is within a specified error from the population mean.
- Define a probability \(c\) margin of error for a sample mean.
- Compute a probability \(c\) margin of error for a sample mean using a critical value from the standard Normal distribution.
- Relate body probabilities \(c\) to tail probabilities \(\alpha\).
- Relate the critical values \(z_{\alpha}\) of a standard Normal distribution to its quantiles, and compute such a critical value using R.
Chapter 17
The \(t\) distributions (and Lecture 11 Lecture Notes)
- Explain why the standard error of the sample mean is not useful in practice.
- State and compute an estimate of the standard error of the sample mean.
- Compare and contrast \(Z\)-scores and \(T\)-scores.
- Explain what a \(T\)-score indicates about an observed sample mean.
- Compute the \(T\)-score of an observed sample mean given the relevant information about the population and sample.