Section 7.3: Summary of Tests Concerning Regression Coefficients
- Explain why one might want to test if many regression coefficients at a time are equal to zero in the context of a larger model.
- State the null and alternative hypotheses for a given claim about more than one regression coefficient in the context of a larger model.
- State the test statistic used in a hypothesis test about more than one regression coefficient in the context of a larger model.
- Explain, qualitatively, why an \(F\)-statistic, as used in Multiple Linear Regression, will be large when the null hypothesis is false.
- State the null and alternative hypotheses for an “overall \(F\)-test” for a multiple linear regression.
Performing an \(F\)-test in R (Lecture Notes for Lecture 17)
- Interpret the output of
anova
for an \(F\)-test.
- Perform an \(F\)-test for a given claim about more than one regression coefficient in the context of a larger model using
anova
.
Common Misinterpretations of \(F\)-tests (Lecture Notes for Lecture 17)
- Give examples where non-significant overall \(F\)-tests do not indicate that the predictors are not relevant to predicting the response.
- Give examples where significant overall \(F\)-tests do not indicate that the MLRGN model is correct.
Section 8.3: Qualitative Predictors
- Give examples of predictors that are appropriately modeled as categorical rather than quantitative variables.
- Explain how to introduce a categorical predictor taking two categories into a multiple linear regression using an indicator (“dummy”) variable.
- Interpret the coefficient on an indicator variable for a binary categorical predictor.
- Interpret the intercept in a multiple linear regression with a single binary categorical predictor.
- Describe, geometrically, what the coefficient on an indicator variable does to the multiple linear regression surface.
- Explain how to introduce a categorical predictor taking more than two categories into a multiple linear regression using indicator variables.
- Interpret the coefficients on indicator variables for a >binary categorical predictor.
- Interpret the intercept in a multiple linear regression with a single >binary categorical predictor.
Fitting Regressions with Categorical Variables in R (Lecture Notes for Lecture 18)
- Describe the data type used by R for categorical variables.
- Use
relevel
to set the baseline level of a factor.
- Fit a multiple linear regression with a categorical predictor in R, and interpet the resulting coefficients in the context of the problem.
- Given the design matrix for a multiple linear regression with a categorical predictor and knowledge of which category is treated as the baseline, identify the category of a given unit using the design matrix.
- Explain how analysis of variance (ANOVA) and analysis of covariance (ANCOVA) are related to multiple linear regression.