trashacount12345
trashacount12345 t1_j09v4tc wrote
Reply to comment by [deleted] in [P] Implemented Vision Transformers 🚀 from scratch using TensorFlow 2.x by TensorDudee
Onlyfans
trashacount12345 t1_ix5nkdm wrote
Reply to [R] Tips on training Transformers by parabellum630
Did you debug on a single sample or batch?
Have you double checked you don’t have something like applying two sigmoids and therefore getting tiny gradients? I make that mistake pretty much every time I set up a new model.
trashacount12345 t1_iufjx2j wrote
Reply to comment by 4Gotes in The Pillars of Creation by Webb’s mid-infrared instrument (MIRI) by MistWeaver80
I was seeing hands pointing at something up and to the right.
trashacount12345 t1_jeddoei wrote
Reply to comment by TitusPullo4 in [R] The Debate Over Understanding in AI’s Large Language Models by currentscurrents
This is the agreed upon definition in philosophy. I’m not sure what another definition would be besides “it’s not real”.