Viewing a single comment thread. View all comments

dumpyact t1_iqv19d4 wrote

Have you tried using LR scheduler? I was able to reduce loss in similar situations with LR scheduler

2

Imaginary_Carrot4092 OP t1_iqv9whz wrote

Do you mean to reduce the LR as training progresses ? But I already tried playing with the LR. It doesn't seem to change anything. Yes, the losses have a different magnitude but the pattern is the same.

0