Submitted by President_Xi_ t3_113tuwb in MachineLearning

Is there a blog post or a paper comparing open source / open weights models? I know flant t5 is really good at instruction following, but I am specifically refering to performance after finetuning. Preferably it compares models from somewhere around 1b to 11b parameters.

12

Comments

You must log in or register to comment.

borisfin t1_j8upatv wrote

There is some interesting comparisons found in the flan t5 paper. Checkout the paper "Scaling Instruction-Finetuned Language Models". Hope this helps.

6

adt t1_j8v1vlp wrote

5

farmingvillein t1_j90m0ab wrote

> For models, see my up-to-date list of models:

Which tab is germane to OP's request?

> but I am specifically refering to performance after finetuning.

So far as I can tell, there is nothing here that is responsive to OP's query. But there is a lot here--perhaps I read too quickly.

0