Viewing a single comment thread. View all comments

plocco-tocco t1_izj4iy8 wrote on December 9, 2022 at 2:17 PM

Reply to comment by CrazyCrab in [D] Did I overfit to val by choosing the best checkpoint? by CrazyCrab

I would take the best checkpoints (aka when the validation loss starts diverging from the training loss). Not the same number of steps because it can happen that the networks don't converge to a minima at the same time, some may be stuck somewhere for longer.