Viewing a single comment thread. View all comments

MysteryInc152 t1_jcduvhn wrote

I'm sorry maybe I want clear but you obviously have API access to GPT-4 right ? Does this access include an API call to their Vision model ? Or are you sending the images straight to BLIP and the like.

2

Empty-Revolution7570 OP t1_jcdv1nt wrote

No, it understands image through other models on hugging face, and outputs image with diffusers or OpenAI dalle

1