Submitted by Secure-Technology-78 t3_10mdhxb in MachineLearning
CKtalon t1_j695owv wrote
Reply to comment by NoFairYouCheated in [R] SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot by Secure-Technology-78
No. There are blog posts about it performing quite badly: https://www.surgehq.ai/blog/how-good-is-hugging-faces-bloom-a-real-world-human-evaluation-of-language-models
Then based on the Chinchilla paper, you can kind of infer that it's a result of undertraining.
Viewing a single comment thread. View all comments