Viewing a single comment thread. View all comments

nielsrolf t1_j7twdn2 wrote

I thought about it again, and another candidate is all LLM capabilities: if you prompt it for "a screenshot of a python method that does xyz" the best solution would be an image that contains working code.

9

visarga t1_j7yftij wrote

There are language models without tokens. They use the raw pixels of an image with text. I can't find the link, Google is not helping me much.

2