Deep Learning (DL)

Deep learning (DL), also known as deep structured learning, is part of a broader family of AI/ML methods based on artificial neural networks with representation learning. Learning can be supervised, semi-supervised or unsupervised. DL uses huge neural networks with many layers of processing units, taking advantage of advances in computing power and improved training techniques to learn complex patterns in large amounts of data

Resources

DL news aggregators

Cheatsheets

https://github.com/afshinea/stanford-cs-230-deep-learning/blob/master/super-cheatsheet-deep-learning.pdf

When to use and not to use deep learning

Books

#BOOK Understanding Deep Learning (Prince 2023, MIT)
- #CODE https://github.com/udlbook/udlbook

#BOOK Deep Learning Interviews: Hundreds of fully solved job interview questions from a wide range of key topics in AI (Kashani 2022)
#BOOK Deep Learning with R, 2nd Edition (Kalinowski 2022)
#BOOK Physics-based Deep Learning Book (Thuerey 2021)
#BOOK The Principles of DL Theory: An Effective Theory Approach to Understanding Neural Networks (Roberts 2022)
#BOOK Deep Learning Book (Goodfellow, 2016 MIT)
- The Deep Learning textbook is a resource intended to help students and practitioners enter the field of machine learning in general and deep learning in particular
#BOOK DL tutorial (LISA Lab, U Montreal)
#BOOK Deep Learning with Python (Chollet, 2021 MANNING)
- 1st edition
#BOOK Machine learning yearning (Andrew Ng, 2018)
- https://github.com/ajaymache/machine-learning-yearning
#BOOK Dive into Deep Learning (Zhang)
- An interactive deep learning book for students, engineers, and researchers. Uses MXNet/Gluon, Pytorch and Tensorflow
- Jupyter notebooks for each section
#BOOK Neural Networks and Deep Learning

Talks

#TALK The Future of Sparsity in Deep Learning (Trevor Gale, Phd student Stanford, 2021)
#TALK Deep Learning (Yoshua Bengio, MLSS 2020):
- Part I
- Part II
#TALK Deep Learning Hardware: Past, Present, and Future (Yann LeCun, ISSCC 2019)
#TALK Deep Learning and the Future of Artificial Intelligence (Yann LeCun, 2018)
#TALK AI Breakthroughs & Obstacles to Progress, Mathematical and Otherwise (Yann LeCun, 2018)
#TALK François Chollet at France is AI 2017: Deep Learning: current limits and future perspectives (Chollet 2017)
#TALK Power & Limits of Deep Learning (Yann Lecun, 2017)
#TALK The Deep End of Deep Learning (Hugo Larochelle, TEDxBoston 2016)
#TALK How deep neural networks work (Brandon Rohrer)
- Simple explanations of DL basics and nice graphics

Courses

Code

State of ML frameworks:

https://thegradient.pub/state-of-ml-frameworks-2019-pytorch-dominates-research-tensorflow-dominates-industry/
TensorFlow or PyTorch?
TensorFlow, PyTorch, and JAX: Choosing a deep learning framework
#CODE Tensorflow
#CODE Keras
#CODE Pytorch
#CODE Ivy - Convert Machine Learning Code Between Frameworks
#CODE Huggingface - Build, train and deploy state of the art models powered by the reference open source in ML
#CODE Openvino - open-source toolkit for optimizing and deploying AI inference
- OpenVINO 2024.4 — OpenVINO™ documentation
#CODE Triton - language and compiler for writing highly efficient custom Deep-Learning primitives
- https://openai.com/blog/triton/
- https://www.infoq.com/news/2021/08/openAI-triton/
- Triton uses Python as its base. The developer writes code in Python using Triton’s libraries, which are then JIT-compiled to run on the GPU. This allows integration with the rest of the Python ecosystem, currently the biggest destination for developing machine-learning solutions
#CODE Oneflow - OneFlow is a performance-centered and open-source deep learning framework
- http://www.oneflow.org/
#CODE MindSpore (Huawei)
#CODE Paddle (Baidu) - PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice
- http://www.paddlepaddle.org/
#CODE Chainer - Chainer is a Python-based deep learning framework aiming at flexibility
#CODE Neural Network Console (Sony)
#CODE PySyft - PySyft is a Python library for secure and private Deep Learning
- PySyft decouples private data from model training, using Federated Learning, Differential Privacy, and Encrypted Computation (like Multi-Party Computation (MPC) and Homomorphic Encryption (HE)) within the main Deep Learning frameworks like PyTorch and TensorFlow.
- #PAPER A generic framework for privacy preserving deep learning

References

Dropout

Stochastic depth

#PAPER Deep Networks with Stochastic Depth (Huang 2016)
- Stochastic depth is a regularization technique that randomly drops a set of layers. During inference, the layers are kept as they are. It is very much similar to Dropout but only that it operates on a block of layers rather than individual nodes present inside a layer