Learning Objectives for Quiz 7

State the three main tasks of inferential statistics.
State the three procedures used to perform the three main tasks of inferential statistics.
Explain why the outcomes of inferential procedures are themselves random.
Explain what it means for an inferential procedure to have desirable frequency properties.

State Fisher’s confidence interval for the population correlation \(\rho\) of a bivariate Gaussian based on his \(Z\)-transformation of the sample correlation \(R\).
Compute the hyperbolic functions \(\sinh x\), \(\cosh x\), and \(\tanh x\) and their inverses using R.
Compute Fisher’s confidence interval in R from two vectors x and y both:
- directly from the formulas
- using cor.test()

You are expected to know all of the intro stats-level material about hypothesis testing, including but not limited to:
- setting up the null and alternative hypotheses for a claim
- under a given pair of hypotheses, stating the meaning of a Type I or Type II Error
- defining the Type I and Type II Error Rates of a test
- defining the power of a test
- performing a given hypothesis test using:
  1. a rejection region
  2. a confidence interval
  3. a \(P\)-value
Define the size of a hypothesis test.
Explain what it means for a hypothesis test to have (significance) level \(\alpha\).

Explain what it means for a distribution to be parametric.
Give examples of parametric distributions and state their parameters.
Explain the advantages of a nonparametric hypothesis test compared to a parametric hypothesis test.

State the null and alternative hypotheses for the one-sample sign test.
State the test statistic for the one-sample sign test and its sampling distribution under the null hypothesis.
Give the rationale for the test statistic for the one-sample sign test.
Perform a one-sample sign test using SignTest() from the DescTools package.
Explain why there are a discrete set of \(P\)-values for the one-sample sign test.
Describe the general shape of the sampling distribution of \(P\)-values for a one-sample sign test when the null hypothesis is true.

Explain the analogy between \(X_{1}, X_{2}, \ldots, X_{n} \stackrel{\text{iid}}{\sim} D\) (in math) and Xs <- rD(n) (in R).

Given a population distribution and a sample statistic that can be calculated using R, construct a single realization of the sample statistic in R.
Use replicate() to repeatedly generate a random sample and calculate an associated sample statistic.
Compare the empirical sampling distribution from simulation to a known sampling distribution from theory.

Given a population distribution and an interval estimator that can be calculated using R, construct a single realization of the interval estimator, i.e. a confidence interval.
Use replicate() to repeatedly generate a random sample and calculate an associated confidence interval.
Check the coverage of confidence intervals returned by replicate() when given the true value of a population parameter.
Explain why the number of confidence intervals in a simulation that cover the true value follows a binomial distribution.
Use prop.test() to construct an interval estimate for the true coverage of a confidence interval using simulation.

Given a population distribution and its mean, simulate a one-sample \(t\)-test from that population under the null or alternative hypothesis using t.test().
Given a population distribution and its median, simulate a one-sample sign test from that population under the null or alternative hypothesis using SignTest() from the DescTools package.
Use replicate() to simulate many \(P\)-values for a one-sample \(t\)-test or sign test under the null or alternative hypotheses.
Use ecdf() to approximate either the Type I Error Rate or Power of a hypothesis test from the simulated \(P\)-values.
Explain what the sampling distribution of \(P\)-values should look like under a point null hypothesis \(H_{0} : \theta = \theta_{0}\) when the null hypothesis is true.
Explain what the sampling distribution of \(P\)-values should look like under a point null hypothesis \(H_{0} : \theta = \theta_{0}\) when the null hypothesis is false.

Explain why the plug-in estimator for a property of a parametric distribution is so-named.
Given a parametric distribution, estimates for its parameters, and a function for how the parameters are related to a particular property of the distribution, compute a plug-in estimate for that property.

Explain the two “levels” of the parametric bootstrap in terms of what has actually happened in the real world and what is being simulated in bootstrap world.
State the “recipe” for generating bootstrap estimates from a parametric distribution.
Given a parametric distribution and estimators for its parameters, use the parametric bootstrap to obtain bootstrap estimates of a property of that distribution.

Compare and contrast the parametric and nonparametric bootstraps.
Explain the two “levels” of the nonparametric bootstrap in terms of what has actually happened in the real world and what is being simulated in bootstrap world.
State the “recipe” for generating bootstrap estimates using the nonparametric bootstrap.
Explain how to sample from the ECDF of a sample using sample().
Given an estimator for a property of a distribution, use the nonparametric bootstrap to obtain bootstrap estimates of that property of the distribution.
Explain under what scenarios a parametric or nonparametric bootstrap is more appropriate.

Construct a coverage \(1 - \alpha\) bootstrap percentile confidence interval for a property of a distribution using either parametric or nonparametric bootstrap estimates of that property.