Course - Optimisation for Data Science HT25

This course analyses optimisation methods suitable for large-scale data science problems, mainly by deriving results on the rate of convergence under increasing assumptions (smooth, convex, strongly convex) on the objective functions.

The course begins with some optimisation terminology and then covers gradient descent and the proximal method, which can be used to apply steepest descent techniques to regularised problems. Then it covers acceleration techniques such as the heavy ball method, and then moves onto stochastic gradient descent and accelerated techniques in that context. Finally, it covers coordinate descent methods.

Course Webpage
Lecture Notes
From the previous year:
Other courses this term: [[Courses HT25]]^U

Notes

Problem Sheets

To-Do List

Differences between notes and slides

Relevant reading

“Optimisation for Data Science” course at ETH Zurich:
- #uni #notes #task #read 2: Theory of Convex Functions ✅ 2025-03-29
- #uni #notes #task #read 3: Gradient Descent ✅ 2025-05-06
- #uni #notes #task #read 4: Projected Gradient Descent
- #uni #notes #task #read 5: Coordinate Descent
- #uni #notes #task #read 10: Subgradient Methods
- #uni #notes #task #read 11: Mirror Descent, Smoothing, Proximal Algorithms
- #uni #notes #task #read 12: Stochastic Optimisation
- #uni #notes #task #read 13: Finite Sum Optimisation
“Convex Optimisation” at CMU:
- #uni #notes #task #read 1: Convex Sets
- #uni #notes #task #read 2: Convex Functions, Optimization Basics
- #uni #notes #task #read 3: Gradient Descent
- #uni #notes #task #read 4: More Gradient Descent and Subgradients
- #uni #notes #task #read 5: The Subgradient Method and Oracle Lower Bounds
- #uni #notes #task #read 6: Projected Gradient Descent and the Proximal Method
- #uni #notes #task #read 7: More Proximal Method
- #uni #notes #task #read 8: Stochastic Gradient Descent
Convex Optimisation, Boyd and Vandenberghe
- #uni #notes #task #read 1: Introduction
- #uni #notes #task #read 2: Convex sets
- #uni #notes #task #read 3: Convex functions
- #uni #notes #task #read 4: Convex optimization problems
- #uni #notes #task #read 9: Unconstrained minimisation
- #uni #notes #task #read 10: Equality constrained minimisation
Numerical Optimisation, Nocedal and Wright
- #uni #notes #task #read 1: Introduction
- #uni #notes #task #read 2: Fundamentals of Unconstrained Optimisation
- #uni #notes #task #read 3: Line-search methods
Deep Learning, Goodfellow
- #uni #notes #task #read 8: Optimization for Training Deep Models
Optimisation for Machine Learning, Sra, Nowozin and Wright
- #uni #notes #task #read 1: Optimization and Machine Learning
- #uni #notes #task #read 2: Convex Optimization with Sparsity-Inducing Norms
- #uni #notes #task #read 4: Incremental Gradient, Subgradient, and Proximal Methods for Convex Optimization: A survey

Notes

Problem Sheets

To-Do List

Differences between notes and slides

Relevant reading

Related posts