Inferential Statistics
- State the three main tasks of inferential statistics.
- State the three procedures used to perform the three main tasks of inferential statistics.
- Explain why the outcomes of inferential procedures are themselves random.
- Explain what it means for an inferential procedure to have desirable frequency properties.
Simulation
Big Picture
- Explain the analogy between \(X_{1}, X_{2}, \ldots, X_{n} \stackrel{\text{iid}}{\sim} D\) (in math) and
Xs <- rD(n)
(in R).
Sampling Distributions via Simulation
- Given a population distribution and a sample statistic that can be calculated using R, construct a single realization of the sample statistic in R.
- Use
replicate()
to repeatedly generate a random sample and calculate an associated sample statistic.
- Compare the empirical sampling distribution from simulation to a known sampling distribution from theory.
Fisher’s Confidence Interval for \(\rho\)
- State Fisher’s confidence interval for the population correlation \(\rho\) of a bivariate Gaussian based on his \(Z\)-transformation of the sample correlation \(R\).
- Compute the hyperbolic functions \(\sinh x\), \(\cosh x\), and \(\tanh x\) and their inverses using R.
- Compute Fisher’s confidence interval in R from two vectors
x
and y
both:
- directly from the formulas
- using
cor.test()
Coverage of Confidence Intervals via Simulation
- Given a population distribution and an interval estimator that can be calculated using R, construct a single realization of the interval estimator, i.e. a confidence interval.
- Use
replicate()
to repeatedly generate a random sample and calculate an associated confidence interval.
- Check the coverage of confidence intervals returned by
replicate()
when given the true value of a population parameter.
- Explain why the number of confidence intervals in a simulation that cover the true value follows a binomial distribution.
- Use
prop.test()
to construct an interval estimate for the true coverage of a confidence interval using simulation.