Viewing a single comment thread. View all comments

goj-145 t1_j83801h wrote

It would have been MUCH harder to prove if they spent a day preprocessing the images first!

3

currentscurrents t1_j85rpol wrote

They use the open LAION 50B dataset, everybody knows what's in there.

Still, some preprocessing and deduplication would have been a good idea just for output quality.

2