Viewing a single comment thread. View all comments

PaulTheBully t1_ixbuwat wrote

Is it applicable for only LLM or any transformer architecture? (I’m sorry if my question is stupid, I’m new to the field)

2