# Course - Optimisation for Data Science HT25

> Source: https://ollybritton.com/notes/uni/part-b/ht25/optimisation-for-data-science/ · Updated: 2026-01-26 · Tags: uni, course

> This course analyses optimisation methods suitable for large-scale data science problems, mainly by deriving results on the rate of convergence under increasing assumptions (smooth, convex, strongly convex) on the objective functions.
> <br>
> <br>
> The course begins with some optimisation terminology and then covers gradient descent and the proximal method, which can be used to apply steepest descent techniques to regularised problems. Then it covers acceleration techniques such as the heavy ball method, and then moves onto stochastic gradient descent and accelerated techniques in that context. Finally, it covers coordinate descent methods.

- [Course Webpage](https://courses.maths.ox.ac.uk/course/view.php?id=5543)
- Lecture Notes
- From the previous year:
	- [Lecture Notes](https://courses.maths.ox.ac.uk/pluginfile.php/105620/mod_folder/content/0/2023Lectures1-16.pdf)
	- [1, Scope and examples](https://courses.maths.ox.ac.uk/pluginfile.php/105621/mod_folder/content/0/b6.2Slides24hauserPart1.pdf?forcedownload=1)
	- [2, Terminology and prerequisites](https://courses.maths.ox.ac.uk/pluginfile.php/105621/mod_folder/content/0/b6.2Slides24hauserPart2.pdf)
	- [3, Method of steepest descent](https://courses.maths.ox.ac.uk/pluginfile.php/105621/mod_folder/content/0/b6.2Slides24hauserPart3.pdf)
	- [4, The proximal method](https://courses.maths.ox.ac.uk/pluginfile.php/105621/mod_folder/content/0/b6.2Slides23hauserPart4.pdf)
	- [5, Acceleration of gradient methods](https://courses.maths.ox.ac.uk/pluginfile.php/105621/mod_folder/content/0/b6.2Slides23hauserPart5.pdf)
	- [6, Stochastic gradient descent](https://courses.maths.ox.ac.uk/pluginfile.php/105621/mod_folder/content/0/b6.2Slides23hauserPart6.pdf)
	- [7, Reducing the noise floor in SGD](https://courses.maths.ox.ac.uk/pluginfile.php/105621/mod_folder/content/0/b6.2Slides23hauserPart7.pdf)
	- [8, Coordinate descent](https://courses.maths.ox.ac.uk/pluginfile.php/105621/mod_folder/content/0/b6.2Slides23hauserPart8.pdf)
	- [9, Practical coordinate descent](https://courses.maths.ox.ac.uk/pluginfile.php/105621/mod_folder/content/0/b6.2Slides23hauserPart9.pdf)
	- [10, Outlook (non-examinable)](https://courses.maths.ox.ac.uk/pluginfile.php/105621/mod_folder/content/0/b6.2Slides23hauserPart10.pdf)
- Other courses this term: [Courses HT25](https://ollybritton.com/notes/uni/part-b/ht25/)

### Notes
- [Notes - Optimisation for Data Science HT25, Overview of results](https://ollybritton.com/notes/uni/part-b/ht25/optimisation-for-data-science/notes/overview-of-results/) ⭐️

- [Notes - Optimisation for Data Science HT25, Motivation and examples](https://ollybritton.com/notes/uni/part-b/ht25/optimisation-for-data-science/notes/motivation-and-examples/)
- [Notes - Optimisation for Data Science HT25, Terminology](https://ollybritton.com/notes/uni/part-b/ht25/optimisation-for-data-science/notes/terminology/)
- [Notes - Optimisation for Data Science HT25, Convexity](https://ollybritton.com/notes/uni/part-b/ht25/optimisation-for-data-science/notes/convexity/)
- [Notes - Optimisation for Data Science HT25, Subgradients](https://ollybritton.com/notes/uni/part-b/ht25/optimisation-for-data-science/notes/subgradients/)
- [Notes - Optimisation for Data Science HT25, Steepest descent](https://ollybritton.com/notes/uni/part-b/ht25/optimisation-for-data-science/notes/steepest-descent/)
- [Notes - Optimisation for Data Science HT25, Steepest descent with inexact line search](https://ollybritton.com/notes/uni/part-b/ht25/optimisation-for-data-science/notes/steepest-descent-with-inexact-line-search/)
- [Notes - Optimisation for Data Science HT25, Proximal methods](https://ollybritton.com/notes/uni/part-b/ht25/optimisation-for-data-science/notes/proximal-methods/)
- [Notes - Optimisation for Data Science HT25, Accelerated methods](https://ollybritton.com/notes/uni/part-b/ht25/optimisation-for-data-science/notes/accelerated-methods/)
- [Notes - Optimisation for Data Science HT25, Nesterov's accelerated gradient method](https://ollybritton.com/notes/uni/part-b/ht25/optimisation-for-data-science/notes/nesterovs-accelerated-gradient-method/)
- [Notes - Optimisation for Data Science HT25, Stochastic gradient descent](https://ollybritton.com/notes/uni/part-b/ht25/optimisation-for-data-science/notes/stochastic-gradient-descent/)
- [Notes - Optimisation for Data Science HT25, Stochastic variance reduction methods](https://ollybritton.com/notes/uni/part-b/ht25/optimisation-for-data-science/notes/stochastic-variance-reduction-methods/)
- [Notes - Optimisation for Data Science HT25, Coordinate descent](https://ollybritton.com/notes/uni/part-b/ht25/optimisation-for-data-science/notes/coordinate-descent/)

- [Notes - Optimisation for Data Science HT25, Misc](https://ollybritton.com/notes/uni/part-b/ht25/optimisation-for-data-science/notes/misc/)
- [redacted](https://ollybritton.com/404)

### Problem Sheets
- [redacted](https://ollybritton.com/404)
- [Sheet 1](https://courses.maths.ox.ac.uk/pluginfile.php/105617/mod_assign/introattachment/0/ps1Questions_HT24.pdf), [partial answers](https://courses.maths.ox.ac.uk/pluginfile.php/105617/mod_assign/introattachment/0/ps1QA_Students_HT24.pdf)
- [Sheet 2](https://courses.maths.ox.ac.uk/pluginfile.php/105616/mod_assign/introattachment/0/ps2Questions_HT24.pdf), [partial answers](https://courses.maths.ox.ac.uk/pluginfile.php/105616/mod_assign/introattachment/0/ps2QA_Students_HT24.pdf)
- [Sheet 3](https://courses.maths.ox.ac.uk/pluginfile.php/105618/mod_assign/introattachment/0/psheet3_cc_2024.pdf), [partial answers](https://courses.maths.ox.ac.uk/pluginfile.php/105618/mod_assign/introattachment/0/psheet3Solutions_PartsAandC.pdf)
- [Sheet 4](https://courses.maths.ox.ac.uk/pluginfile.php/105619/mod_assign/introattachment/0/Problem_Sheet_4.pdf), [partial answers](https://courses.maths.ox.ac.uk/pluginfile.php/105619/mod_assign/introattachment/0/Problem_Sheet4_PartsA_and_C.pdf)

### To-Do List

### Relevant reading
- ["Optimisation for Data Science"](https://odi.inf.ethz.ch/teaching/ODS24.html) course at ETH Zurich
- ["Convex Optimisation"](https://www.stat.cmu.edu/~siva/teaching/725) at CMU
- [Convex Optimisation, Boyd and Vandenberghe](https://web.stanford.edu/~boyd/cvxbook/bv_cvxbook.pdf)
- [Numerical Optimisation, Nocedal and Wright](https://www.math.uci.edu/~qnie/Publications/NumericalOptimization.pdf)
- [Deep Learning, Goodfellow](https://www.deeplearningbook.org/)
- [Optimisation for Machine Learning, Sra, Nowozin and Wright](https://doc.lagout.org/science/Artificial%20Intelligence/Machine%20learning/Optimization%20for%20Machine%20Learning%20%5BSra%2C%20Nowozin%20%26%20Wright%202011-09-30%5D.pdf)

---
Olly Britton — https://ollybritton.com. Machine-readable index: https://ollybritton.com/llms.txt
