Infamous_Age_7731
Infamous_Age_7731 OP t1_j3xlhxv wrote
Reply to comment by qiltb in Cloud VM GPU is much slower than my local GPU by Infamous_Age_7731
Oh yeap, gotcha. They seem fine. The GPU for instance on the Cloud is around 60C.
Infamous_Age_7731 OP t1_j3rwt7o wrote
Reply to comment by qiltb in Cloud VM GPU is much slower than my local GPU by Infamous_Age_7731
I see, so this shouldn't be causing the issue you reckon.
Infamous_Age_7731 OP t1_j3rwor0 wrote
Reply to comment by qiltb in Cloud VM GPU is much slower than my local GPU by Infamous_Age_7731
Thanks for the input. I just did sudo dmesg --follow
and then run my model and I don't see any errors. It just informs it loaded the UVM driver...
The memory is reasonable unless, of course, I match it close to the limit (e.g., batch size).
And what are the "temps"?
Infamous_Age_7731 OP t1_j3qye3r wrote
Reply to comment by No_Cryptographer9806 in Cloud VM GPU is much slower than my local GPU by Infamous_Age_7731
Thank you for your input, it makes sense. Nonetheless, I have adequate RAM and I just checked the IO speed (using sysbench) and actually are pretty much the same with the VM's being a bit faster.
Infamous_Age_7731 OP t1_j3qy6qv wrote
Reply to comment by ivan_kudryavtsev in Cloud VM GPU is much slower than my local GPU by Infamous_Age_7731
> multithreaded
>
>sysbench
>
> to compare CPU and RAM
Thanks a lot for your input! I checked the CPU %steal it seems optimal ranging from 0.0 to 0.1st. Then, I don't think it's a shard since in the NVIDIA I have the full 80Gb memory at my disposal (unless they do some trickeries). I did a series of `sysbench tests and I found out that the VM's CPU is slightly worse for single-thread performance, but what is more astounding is the RAM speed. For 1 or 8 threads the write is 0.8x slower and the read is 1.5x slower. The Ram speed drop seems to reflect the iteration per second speed drop when I train the model. I guess this might be the fault.
Infamous_Age_7731 OP t1_j3qrkhp wrote
Reply to comment by ivan_kudryavtsev in Cloud VM GPU is much slower than my local GPU by Infamous_Age_7731
Yes indeed, I am not doing anything in parallel. I use them separately and I wanted to compare their internal design as you said.
Infamous_Age_7731 OP t1_j3qrctv wrote
Reply to comment by agentfuzzy999 in Cloud VM GPU is much slower than my local GPU by Infamous_Age_7731
Thanks for your advice. FYI, I use the A100 for larger models and/or longer inputs/outputs that don't fit to my 3080.
Infamous_Age_7731 OP t1_j3nr9wg wrote
Reply to comment by susoulup in Cloud VM GPU is much slower than my local GPU by Infamous_Age_7731
I haven't looked into that. I would guess it wouldn't matter in my case, but I might be wrong.
Infamous_Age_7731 OP t1_j3nqwgy wrote
Reply to comment by BellyDancerUrgot in Cloud VM GPU is much slower than my local GPU by Infamous_Age_7731
I see, thanks! In that case, I might be asking the vendor more questions.
Submitted by Infamous_Age_7731 t3_107pcux in deeplearning
Infamous_Age_7731 OP t1_j630pnc wrote
Reply to comment by GPUaccelerated in Cloud VM GPU is much slower than my local GPU by Infamous_Age_7731
Oh i see, thanks that fits my case then!