Week 8 Checkpoint
Critical points and the second-derivative test, the gradient-descent update and learning-rate regimes, MSE, the linear-regression gradient $\frac{\partial L}{\partial w}=\frac{2}{n}X^\top(\hat{y}-y)$, gradient checking, and the closed-form normal equations.
- 1.
For , find the critical points and classify with the second-derivative test.
- 2.
Minimizing by gradient descent from with learning rate , the update gives:
- 3.
During gradient descent you observe the loss oscillating and increasing over iterations. The most likely cause is:
- 4.
For predictions and targets , the mean squared error is:
- 5.
For linear regression with and (with ), the gradient with respect to is:
- 6.
Gradient checking compares an analytic gradient to a central-difference estimate. Which outcome indicates your analytic gradient is correct?
- 7.
The closed-form (normal-equations) solution minimizing the linear-regression MSE is: