[R] RWKV-4 14B release (and ChatRWKV) - a surprisingly strong RNN Language Model Submitted by bo_peng t3_1135aew on February 15, 2023 at 6:44 PM in MachineLearning 37 comments 268
afireohno t1_j8qg9eq wrote on February 16, 2023 at 5:14 AM Reply to comment by maizeq in [R] RWKV-4 14B release (and ChatRWKV) - a surprisingly strong RNN Language Model by bo_peng There is some work on Frustratingly Short Attention Spans in Neural Language Modeling Permalink Parent 1
Viewing a single comment thread. View all comments