Learning Objectives for Quiz 4

State two natural sources of random numbers.
State what devices are used to “extract” the random numbers from the two sources in the previous learning objective.

State the algorithm for generating pseudorandom numbers using a linear congruential generator (LCG).
Given the modulus, multiplier, and increment for a LCG and a seed, generate variates from the LCG.
Evaluate expressions \(a \operatorname{mod} m\) involving the modulo operation by-hand.
Compute expressions \(a \operatorname{mod} m\) using R.
State the possible values that a LCG with a given modulus could generate.
Given iterates from a LCG with a relatively small period, identify the period of the LCG.

Convert variates uniform on \(\{0, 1, 2, \ldots, m - 1\}\) to variates approximately uniform on \((0, 1)\).
Compute the quantile function of a given distribution using R.
Given the quantile function \(q_{F_{X}}\) of a distribution \(F_{X}\) and variates uniform on \((0, 1)\), generate non-uniform variates following \(F_{X}\).

Define the sample quantile function \(\widehat{q}(p)\) from a data set \(X_{1}, X_{2}, \ldots, X_{n}\).
Use R to compute the quantiles of a data set.
Given the graph of a sample quantile function, identify the first, second, and third quartiles of the data set.

Define the empirical cumulative distribution function \(\widehat{F}(x)\) from a data set \(X_{1}, X_{2}, \ldots, X_{n}\).
Given a (small) data set, evaluate \(\widehat{F}\) at a given argument \(x\) by-hand.
Use R to generate and evaluate the empirical cumulative distribution function from a data set.
Explain how the sample quantile function and the empirical cumulative distribution function are related.

Explain the difference between frequency and density histograms and use hist() to plot both.
Set the number of bins for a histogram generated by hist().
Set the bin selection algorithm used by hist().

State the form of a kernel density estimate from a data set \(X_{1}, X_{2}, \ldots, X_{n}\).
Draw a rough sketch of a kernel density estimate from a data set.
State the properties that the kernel function used in a kernel density estimate must have.
State the expression for the Gaussian kernel function.
Explain how the bandwidth of the kernel function affects the kernel density estimate.
Set the bandwidth used by density() to a specific value.
Set the bandwidth selection method used by density().