Submitted by super_deap t3_11tmpc5 in MachineLearning
Nhabls t1_jck9a4c wrote
Reply to comment by kittenkrazy in [D] PyTorch 2.0 Native Flash Attention 32k Context Window by super_deap
Yeah just need enough training time and data to be able to train those 32k context layers effectively........................
fastinguy11 t1_jcle8cn wrote
Gpt4 32 k api when available ?
Viewing a single comment thread. View all comments