Submitted by michaelthwan_ai t3_121domd in MachineLearning
Veggies-are-okay t1_jdmopy0 wrote
Does anyone have a good resource/video on the overview of these implementations? I don’t work much with language models but figure it might be good to understand where this is but I’m just running into the buzz feed-esque surface level nonsense on YouTube.
tonicinhibition t1_jdn4v86 wrote
There's a YouTuber named Letitia, with a little Miss Coffee Bean character, who covers new models at a decent level.
CodeEmporium does a great job at introducing aspects of the GPT/ChatGPT architecture with increasing depth. Some of the videos have code.
Andrej Karpathy walks you through building GPT in code
As for the lesser known models, I just read the abstracts and skim the papers. It's a lot of the same stuff with slight variations.
michaelthwan_ai OP t1_jdpy5dy wrote
Thanks for the sharing above!
My choice is yk - Yannic Kilcher. Some "AI News" videos is a brief introduction and he sometimes go through certain papers in details. Very insightful!
Viewing a single comment thread. View all comments