[D] PyTorch 2.0 Native Flash Attention 32k Context Window Submitted by super_deap t3_11tmpc5 on March 17, 2023 at 9:59 AM in MachineLearning 99 comments 345
BungaBunga6767 t1_jcl6vf9 wrote on March 17, 2023 at 5:07 PM Reply to comment by harharveryfunny in [D] PyTorch 2.0 Native Flash Attention 32k Context Window by super_deap LongFormer does it but not with FlashAttention Permalink Parent 3
Viewing a single comment thread. View all comments