Viewing a single comment thread. View all comments

femboyxx98 t1_j4vlsfj wrote on January 18, 2023 at 3:58 PM

Have you compared it against modern transformer implementations e.g. with FlashAttention, which can provide 3x-5x speed up by itself?