Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour

June 28, 2020 — Written by Marc Päpper — ⏰ 8 min read

New blog series: Deep Learning Papers visualized This is the first post of a new series I am starting where I explain the content of a paper in a visual picture-based way. To me, this helps tremendously to better grasp the ideas and remember them and I hope this will be the same for many of you as well. Today’s paper: Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour by Goyal et al.

PyTorch multi-GPU training for faster machine learning results

February 29, 2020 — Written by Marc Päpper — ⏰ 5 min read

#python #pytorch #deeplearning #distributed #speed

When you have a big data set and a complicated machine learning problem, chances are that training your model takes a couple of days even on a modern GPU. However, it is well-known that the cycle of having a new idea, implementing it and then verifying it should be as quick as possible. This is to ensure that you can efficiently test out new ideas. If you need to wait for a whole week for your training run, this becomes very inefficient.

Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour

PyTorch multi-GPU training for faster machine learning results

I help you listen through the noise in machine learning: