Submitted by mrx-ai t3_zgr7nr in MachineLearning
mrx-ai OP t1_izijamw wrote
Reply to comment by Acceptable-Cress-374 in [R] Large language models are not zero-shot communicators by mrx-ai
You might want to read at p.8 in the paper. The authors evaluate three different models (GPT-3-175B, InstructGPT-3-175B, and text-davinci-002) using different prompt templates, but none of the models show improved performance. The variance of the results for text-davinci-002 is particularly high, and the best prompt template only achieves a 74.5% accuracy rate.
Viewing a single comment thread. View all comments