[D] Training deep learning models on a cluster
I have always either trained models on my own gpu or on Google Colab. However, I need to now train a model on a cluster situated locally in a lab. All I know is that I need to use SSH and a docker container. Can anyone share any resources that will be helpful in getting started? Couldn’t find much for beginners on YouTube or Google.
submitted by /u/ssd123456789
[link] [comments]