30katz
30katz t1_j2n9hpj wrote
Reply to comment by Terrible-List-1653 in [D] Data cleaning techniques for PDF documents with semantically meaningful parts by cm_34978
Dude, stop. No one needs more garbage information. We can use ChatGPT and Google without your help. You’re not being an AI entrepreneur by spamming ChatGPT responses.
30katz t1_j2lilwg wrote
Reply to comment by low_effort_shit-post in [D] Data cleaning techniques for PDF documents with semantically meaningful parts by cm_34978
Our company is stuck with PDF’s but it’s actually not too hard to work with using Amazon’s textract or Adobe Extract API. But maybe that’s a sign that it is hard because the technology is owned by the two biggest tech giants in the space.
30katz t1_j11v4wa wrote
Reply to comment by vprokopev in [D] Why are we stuck with Python for something that require so much speed and parallelism (neural networks)? by vprokopev
Maybe you’d take all this free software and make it easier for others in the future?
30katz t1_j0m3iuj wrote
Reply to comment by CriticalTemperature1 in [D] ChatGPT, crowdsourcing and similar examples by mvujas
Just analyzing questions and gleaning what could be going on would be a gold mine
I’m sure Google can come up with a lot of very profitable metrics
30katz t1_j6aeit8 wrote
Reply to comment by squareOfTwo in [N] OpenAI has 1000s of contractors to fine-tune codex by yazriel0
Open Aiiiiyaaaaaah