Private notes
0/8000

Notes stay private to your browser until account sync is configured.

Part 5
1 min read3 headingsSplit lesson page

Lesson overview | Previous part | Lesson overview

Gradient Descent: Part 12: Conceptual Bridge to References

12. Conceptual Bridge

Gradient Descent sits inside a chain. Earlier sections give the calculus, probability, and linear algebra needed to write the objective and interpret the update. Later sections use this material to reason about noisy gradients, adaptive state, regularization, tuning, schedules, and finally information-theoretic losses.

Backward link: Convex Optimization supplies the immediate prerequisite vocabulary.

Forward link: Second-Order Methods uses this section as a building block.

+------------------------------------------------------------+
| Chapter 8: Optimization                                    |
|    01-Convex-Optimization          Convex Optimization    |
| >> 02-Gradient-Descent             Gradient Descent       |
|    03-Second-Order-Methods         Second-Order Methods   |
|    04-Constrained-Optimization     Constrained Optimization |
|    05-Stochastic-Optimization      Stochastic Optimization |
|    06-Optimization-Landscape       Optimization Landscape |
|    07-Adaptive-Learning-Rate       Adaptive Learning Rate |
|    08-Regularization-Methods       Regularization Methods |
|    09-Hyperparameter-Optimization  Hyperparameter Optimization |
|    10-Learning-Rate-Schedules      Learning Rate Schedules |
+------------------------------------------------------------+

Appendix A. Extended Derivation and Diagnostic Cards

References

  • Nocedal and Wright, Numerical Optimization.
  • Bertsekas, Nonlinear Programming.
  • Polyak, Introduction to Optimization.
  • Nesterov, A Method for Solving the Convex Programming Problem.
  • Goodfellow, Bengio, and Courville, Deep Learning.
  • Bottou, Curtis, and Nocedal, Optimization Methods for Large-Scale Machine Learning.
  • PyTorch optimizer and scheduler documentation.
  • Optax documentation for composable optimizer transformations.

Skill Check

Test this lesson

Answer 4 quick questions to lock in the lesson and feed your adaptive practice queue.

--
Score
0/4
Answered
Not attempted
Status
1

Which module does this lesson belong to?

2

Which section is covered in this lesson content?

3

Which term is most central to this lesson?

4

What is the best way to use this lesson for real learning?

Your answers save locally first, then sync when account storage is available.
Practice queue