Reading
- [[Article - Deep, deep trouble, Elad]]U
- [[Paper - ADADELTA, An Adaptive Learning Rate Method]]U
- [[Paper - Attention Is All You Need (2017)]]U
- [[Paper - Error bounds for approximations with deep ReLU networks, Yarotsky (2016)]]U
- [[Paper - Explaining and harnessing adversarial examples (2015)]]U
- [[Paper - Exponential expressivity in deep neural networks through transient chaos (2016)]]U
- [[Paper - Gradient-based learning applied to document recognition, LeCun]]U
- [[Paper - Optimal nonlinear approximation, DeVore (1989)]]U
- [[Paper - Representation Benefits of Deep Feedforward Networks, Telgarsky (2015)]]U
- [[Paper - When and when can deep networks avoid the curse of dimensionality, Poggio (2016)]]U