Viewing a single comment thread. View all comments

OutrageousSundae8270 t1_iyc9bnw wrote on November 30, 2022 at 9:13 AM

Transformers do generally need to be pre-trained on a large corpus to do well on further downstream tasks.