[P] RWKV 14B Language Model & ChatRWKV : pure RNN (attention-free), scalable and parallelizable like Transformers Submitted by bo_peng t3_10eh2f3 on January 17, 2023 at 4:54 PM in MachineLearning 19 comments 110
MaiconJLLE t1_j5d4x2x wrote on January 22, 2023 at 3:09 AM Reply to comment by SatoshiNotMe in [P] RWKV 14B Language Model & ChatRWKV : pure RNN (attention-free), scalable and parallelizable like Transformers by bo_peng https://pile.eleuther.ai/ Permalink Parent 2
Viewing a single comment thread. View all comments