Submitted by Rishh3112 t3_120gvgw in deeplearning
j-solorzano t1_jdk2kod wrote
If it works in CPU but not GPU, even though the GPU should have more memory, the only difference I can think of is garbage collection timing. Try calling the garbage collector in every epoch. Also, note that you have a GRU, which retains tensors.
Rishh3112 OP t1_jdl42av wrote
Sure I will try using a garbage collector in every epoch. Thanks.
Viewing a single comment thread. View all comments