Viewing a single comment thread. View all comments

Poseidon_22 t1_jdpyo9u wrote

Apparently, for linear improvement in accuracy, we would need exponentially more parameters. Gpt-4 with more than 1 trillion parameters would need to be trained on 6,700gpus for a whole year!

1