Submitted by super_deap t3_11tmpc5 in MachineLearning
Spiritual-Reply5896 t1_jcsq4d9 wrote
Reply to comment by 127-0-0-1_1 in [D] PyTorch 2.0 Native Flash Attention 32k Context Window by super_deap
Exactly, I wanted to find out whether there is some research regarding these embeddings. I really think that by efficient pruning/organization of these "memories" its possible to generate quite advanced memory. Things like embedding consistency then becomes a big player - how much does length affect the embedding, what is the optimal information content vs string size...
Viewing a single comment thread. View all comments