Stat 4830 Recap: Optimization in PyTorch - Cheat Sheet

Overall Goal: Consolidate understanding of numerical optimization for ML/Data Science using PyTorch, aiming to become an intelligent consumer of these methods.


1. Introduction


2. Key Concepts & Skills

PyTorch Fundamentals

Problem Formulation

Gradient Descent (GD) vs. Stochastic GD (SGD)

Understanding SGD Behavior (via NQM etc.)

SGD Variants (Insights from NQM L7)

Adaptive Methods & Practical Tools

Practical Tuning & Evaluation

Scaling Large Models (L12)


3. How Did We Measure Up? (Learning Outcomes from L0)


4. What We Didn’t Cover


5. Where to Go From Here


6. Conclusion