Viewing a single comment thread. View all comments

bladecg t1_j8bznlz wrote

Maybe their model is just overfitting a lot to the test data? That’s always a thing in ML

36

94746382926 t1_j8c1uju wrote

Yeah I feel like we need more benchmarks

20

FusionRocketsPlease t1_j8diuz2 wrote

This type of paper should not be published before passing all possible tests that can refute the claim in the title...

4