Viewing a single comment thread. View all comments

dumpyact t1_iqv19d4 wrote on October 3, 2022 at 8:53 AM

Have you tried using LR scheduler? I was able to reduce loss in similar situations with LR scheduler

Imaginary_Carrot4092 OP t1_iqv9whz wrote on October 3, 2022 at 10:56 AM

Do you mean to reduce the LR as training progresses ? But I already tried playing with the LR. It doesn't seem to change anything. Yes, the losses have a different magnitude but the pattern is the same.