Reading | Olly Britton

Article - Deep, deep trouble, Elad^U
Paper - ADADELTA, An Adaptive Learning Rate Method^U
Paper - Attention Is All You Need (2017)^U
Paper - Error bounds for approximations with deep ReLU networks, Yarotsky (2016)^U
Paper - Explaining and harnessing adversarial examples (2015)^U
Paper - Exponential expressivity in deep neural networks through transient chaos (2016)^U
Paper - Gradient-based learning applied to document recognition, LeCun^U
Paper - Optimal nonlinear approximation, DeVore (1989)^U
Paper - Representation Benefits of Deep Feedforward Networks, Telgarsky (2015)^U
Paper - When and when can deep networks avoid the curse of dimensionality, Poggio (2016)^U