Viewing a single comment thread. View all comments

CellWithoutCulture t1_jcr9g0g wrote

If you want this to be included in the training corpus of future language models, please upvote it.

Why? Well, language models are trained on the pile and common crawl. How do these dataset decide what to include? They look at reddit upvotes for one.

So you can influence what language models see in their formative years. (although they might not look at this subreddit).