[R] Tips on training Transformers Submitted by parabellum630 t3_z088fo on November 20, 2022 at 4:23 PM in MachineLearning 23 comments 78
hadaev t1_ix5eduw wrote on November 20, 2022 at 9:49 PM Reply to comment by yannbouteiller in [R] Tips on training Transformers by parabellum630 Just replace gru with transformer and keep cnn as positional encoding. Permalink Parent 5
Viewing a single comment thread. View all comments