Viewing a single comment thread. View all comments

lukeiy t1_j2luz7z wrote on January 2, 2023 at 7:31 AM

Use another model to reduce this context to a vector, then append it to each token. This was the process used in Set Transformers (TSPN)

kdqg t1_j2oo4rl wrote on January 2, 2023 at 9:54 PM

Also have a look at the slot attention mechanism, which does something similar but arguably more elegantly