Dartagnjan OP t1_j103ef6 wrote
Reply to comment by dumbmachines in [D] Techniques to optimize a model when the loss over the training dataset has a Power Law type curve. by Dartagnjan
- I have already tried my own version of selective backprob, but thanks for the link. this is exactly what I was looking for. I want to know how other people implement it and if I did something wrong.
- Overfitting on the hard examples is a test that I carried out already multiple times but not yet on the latest experiments. Thanks for reminding me of this. I guess from this I can infer whether my complexity is definitely too low, if I cannot overfit. If I can overfit. If I can overfit on the hard examples it does not mean the model is able to handle easy and hard examples at the same time, still.
Viewing a single comment thread. View all comments