Viewing a single comment thread. View all comments

blablanonymous t1_iswe0za wrote

Won’t you need a labeled training set to make that work?

2

stevewithaweave t1_isy1dji wrote

I think you generate your own fake papers as the label. And mix it in with real papers

1

the_mighty_skeetadon t1_iszprhg wrote

That can't be the only method, because if your model for generating fake papers differs significantly from somebody else's model, you will be both unable to detect those fake papers and unable to detect that you're failing.

Better is to have fake papers rejected from journals labeled thusly and to synthetically generate more fake papers with a wide variety of known approaches.

1

stevewithaweave t1_iszusz3 wrote

I think the original commenter was referring to an architecture similar to GANs. I agree that including examples of fake papers would improve the model but is not required

1