visarga t1_j59tlpa wrote on January 21, 2023 at 12:56 PM

Reply to comment by TFenrir in Google to relax AI safety rules to compete with OpenAI by Surur

> PaLM is already the benchmark for basically all LLM tests

I also made a time machine but nobody can see it. You got to trust me. My work is the benchmark in time travel, though.

TFenrir t1_j5a38bv wrote on January 21, 2023 at 2:23 PM

Just because I don't physically have access to these models, doesn't mean they don't exist. Google regularly works with other institutions when running research with PaLM and their other advancements, and people frequently duplicate their findings.

Additionally, we have access to things like Flan-T5, tiny models fine tuned with their latest work that are about as powerful as gpt3, 5b vs 170b parameters.

visarga t1_j5luwtn wrote on January 23, 2023 at 10:41 PM

I know Flan-T5, it is probably the best small model, but it only gets good scores for extractive and classification tasks, not for creative writing.