Viewing a single comment thread. View all comments

visarga t1_j59tlpa wrote

> PaLM is already the benchmark for basically all LLM tests

I also made a time machine but nobody can see it. You got to trust me. My work is the benchmark in time travel, though.

1

TFenrir t1_j5a38bv wrote

Just because I don't physically have access to these models, doesn't mean they don't exist. Google regularly works with other institutions when running research with PaLM and their other advancements, and people frequently duplicate their findings.

Additionally, we have access to things like Flan-T5, tiny models fine tuned with their latest work that are about as powerful as gpt3, 5b vs 170b parameters.

3

visarga t1_j5luwtn wrote

I know Flan-T5, it is probably the best small model, but it only gets good scores for extractive and classification tasks, not for creative writing.

1