BellyDancerUrgot t1_j3pw2m2 wrote on January 10, 2023 at 6:25 AM

Reply to comment by ivan_kudryavtsev in Cloud VM GPU is much slower than my local GPU by Infamous_Age_7731

Oh I thought maybe he is going for distributed learning since he has access to 2 GPUs. In that case MPI has some overhead simply because it has to replicate, scatter and gather all the gradients per batch every epoch.

ivan_kudryavtsev t1_j3q0c02 wrote on January 10, 2023 at 7:15 AM

>Oh I thought maybe he is going for distributed learning since he has access to 2 GPUs. In that case MPI has some overhead simply because it has to replicate, scatter and gather all the gradients per batch every epoch.

It looks like no; they speculated about the internal design of A100.

Infamous_Age_7731 OP t1_j3qrkhp wrote on January 10, 2023 at 12:56 PM

Yes indeed, I am not doing anything in parallel. I use them separately and I wanted to compare their internal design as you said.