Viewing a single comment thread. View all comments

sam__izdat t1_ixnhbrh wrote

I'm just going by what I've seen people try to produce and say, so far. I haven't done any extensive testing, partly because I'm using an ancient Tesla GPU and they broke FP32.

3

hadaev t1_ixnrhn4 wrote

Colab.

But yeah, usually such big models are tested on huge scales.

Some cherry picked comparisons with tens samples shows nothing.

1