Viewing a single comment thread. View all comments

j-solorzano t1_jdk2kod wrote on March 24, 2023 at 11:26 PM

If it works in CPU but not GPU, even though the GPU should have more memory, the only difference I can think of is garbage collection timing. Try calling the garbage collector in every epoch. Also, note that you have a GRU, which retains tensors.

Rishh3112 OP t1_jdl42av wrote on March 25, 2023 at 4:37 AM

Sure I will try using a garbage collector in every epoch. Thanks.