Viewing a single comment thread. View all comments

buggaby OP t1_jc3jw39 wrote

How do you find the model size? All those you listed appear to be based on GPT-3 or 3.5 which, according to my searching, are both 175B parameters. It looks to me like they are different only in the kind and amount of fine-tuning. What am I missing?

1