Viewing a single comment thread. View all comments

baconninja0 t1_iz4uwd3 wrote

The shell commands found on websites will probably be more similar site to site than non-code topics, especially since I’m pretty sure a lot of code content farm sites just steal each other’s code anyways. This makes it much easier for the bot to learn than other topics because it sees the exact same command so many times, instead of just similar commands (which it has to learn are similar)

1

VitaminD263 t1_iz4w5i9 wrote

Yea or you know you could just make up input, let it execute the code and get the output to create your training data...

−3

PromiseChain t1_iz90i6y wrote

You’re just demonstrating you don’t understand this technology. This is not piping anything into a terminal anywhere. There is no 3080 that actually got installed by OpenAI to provide this data, they explain their data transparently. This is modeled from stackoverflow answers most likely.

3