Submitted by bo_peng t3_1135aew in MachineLearning
gwern t1_j8pg3g7 wrote
farmingvillein t1_j8pni5v wrote
Neither of these offer a comparative look against transformers, although they are certainly a useful look against the limitations of your basic RNN/LSTM.
Viewing a single comment thread. View all comments