Viewing a single comment thread. View all comments

Infamous_Age_7731 OP t1_j3rwor0 wrote

Thanks for the input. I just did sudo dmesg --follow and then run my model and I don't see any errors. It just informs it loaded the UVM driver...

The memory is reasonable unless, of course, I match it close to the limit (e.g., batch size).

And what are the "temps"?

1

qiltb t1_j3uaop0 wrote

sorry, temperatures of GPU, CPU etc.

1

Infamous_Age_7731 OP t1_j3xlhxv wrote

Oh yeap, gotcha. They seem fine. The GPU for instance on the Cloud is around 60C.

1