SatisfyingLatte

SatisfyingLatte t1_ittn52b wrote

Once all the useful representations from the training data has been extracted and learned. Beyond that, increasing model size will overfit the training data. Only language tasks might be solvable by naively scaling current techniques.

1