Viewing a single comment thread. View all comments

JJP77 t1_ire3y9h wrote on October 7, 2022 at 11:08 AM

where'd they get 3d training data from?

Smearle t1_iuhhgyz wrote on October 31, 2022 at 11:25 AM

They don't use any. Instead they capture screenshots of the 3D object from various perspectives then feed them into CLIP to determine how much the object resembles the text prompt.