[D] Trying to find paper about n-grams in early transformer layers Submitted by soraki_soladead t3_zmoxp7 on December 15, 2022 at 4:13 PM in MachineLearning 9 comments 28
soraki_soladead OP t1_j0el3yt wrote on December 16, 2022 at 1:54 AM Reply to comment by Axel-Blaze in [D] Trying to find paper about n-grams in early transformer layers by soraki_soladead Thanks, I’ll take a look! Permalink Parent 3
Viewing a single comment thread. View all comments