[D] What are good ways of incorporating non-sequential context into a transformer model? Submitted by abc220022 t3_100y331 on January 2, 2023 at 12:23 AM in MachineLearning 11 comments 27