PengsoonThePenguin t1_j16rrtg wrote
Reply to comment by [deleted] in [R] Nonparametric Masked Language Modeling - MetaAi 2022 - NPM - 500x fewer parameters than GPT-3 while outperforming it on zero-shot tasks by Singularian2501
I guess an easy explanation is that the model works solely from retrieval over the corpus. Every prediction has to be explained by the corpus.
Viewing a single comment thread. View all comments