shitasspetfuckers
shitasspetfuckers t1_jed796l wrote
Reply to comment by Qzx1 in [D] I just realised: GPT-4 with image input can interpret any computer screen, any userinterface and any combination of them. by Balance-
> Google's Spotlight paper
https://ai.googleblog.com/2023/02/a-vision-language-approach-for.html
shitasspetfuckers t1_je6p0z9 wrote
Reply to comment by detached-admin in [D] The best way to train an LLM on company data by jaxolingo
Why not other people's money?
shitasspetfuckers t1_je1v7pf wrote
Reply to comment by reditum in [D] I just realised: GPT-4 with image input can interpret any computer screen, any userinterface and any combination of them. by Balance-
Can you please clarify what specifically about their approach wasn't great?
shitasspetfuckers t1_iwzs5i8 wrote
Reply to comment by flapflip9 in [P]Modern open-source OCR capabilities and which model to choose by Rodny_
shitasspetfuckers t1_jed7vuu wrote
Reply to comment by SeymourBits in [D] I just realised: GPT-4 with image input can interpret any computer screen, any userinterface and any combination of them. by Balance-
Can you please clarify what specifically you have tried, and what was the outcome?