Submitted by jayalammar t3_xvje2n in MachineLearning
mrflatbush t1_is2cw12 wrote
Fantastic work. As a laymen I am almost starting to understand much of this. Almost.
In the section titled "How Clip is trained", are the captions correct? The first appears to have a typo and the FC caption seems jumbled.
jayalammar OP t1_is9vlpm wrote
Thank you!
This caption?
>Larger/better language models have a significant effect on the quality of image generation models. Source: Google Imagen paper by Saharia et. al.. Figure A.5.
What's the issue?
Viewing a single comment thread. View all comments