Submitted by buggaby t3_11qgasm in MachineLearning
buggaby OP t1_jc3jw39 wrote
Reply to comment by MysteryInc152 in [D] Are modern generative AI models on a path to significantly improved truthfulness? by buggaby
How do you find the model size? All those you listed appear to be based on GPT-3 or 3.5 which, according to my searching, are both 175B parameters. It looks to me like they are different only in the kind and amount of fine-tuning. What am I missing?
MysteryInc152 t1_jc3kb0x wrote
MysteryInc152 t1_jc3klp8 wrote
Claude is the informal name for Anthropic-LM v4-s3 (52B)
MysteryInc152 t1_jc3kufz wrote
Finally the instruct versions are prepended with "text-"
Viewing a single comment thread. View all comments