Submitted by Infamous_Age_7731 t3_107pcux in deeplearning
BellyDancerUrgot t1_j3pw2m2 wrote
Reply to comment by ivan_kudryavtsev in Cloud VM GPU is much slower than my local GPU by Infamous_Age_7731
Oh I thought maybe he is going for distributed learning since he has access to 2 GPUs. In that case MPI has some overhead simply because it has to replicate, scatter and gather all the gradients per batch every epoch.
ivan_kudryavtsev t1_j3q0c02 wrote
>Oh I thought maybe he is going for distributed learning since he has access to 2 GPUs. In that case MPI has some overhead simply because it has to replicate, scatter and gather all the gradients per batch every epoch.
It looks like no; they speculated about the internal design of A100.
Infamous_Age_7731 OP t1_j3qrkhp wrote
Yes indeed, I am not doing anything in parallel. I use them separately and I wanted to compare their internal design as you said.
Viewing a single comment thread. View all comments