Old-Owl-139 t1_j75feeu wrote on February 4, 2023 at 5:04 AM

You are missing the point. The average "knowledge" worker doesn't go beyond Excel spreadsheets. Even an AI with the skills of the average high school graduate will make a massive disruption.

visarga t1_j764340 wrote on February 4, 2023 at 10:30 AM

No, you're thinking AI can do this alone. Let me tell you - it can't. If it has 1% error rate in information extraction from documents, you need to manually verify everything. Like Tesla's SDC, 99% there is nothing groundbreaking.

I have been working on this very task for 5+ years. I know every paper and model there is. I tested all public APIs for this task. I extensively used GPT-3 for it, and that's my professional judgement.

As for AI validation, it can be 10x more comfortable than manual information extraction, but still requires about 50% of the manual effort. It is not making people suddenly 10x more effective.

Not even OCR is 100% accurate. The best systems have 95% accuracy on noisy document scans. One digit or comma could make the whole transaction absurd, if you send those money without checking you could go in bankruptcy.

The best models we have today are good at generating correct answers 90% of the time - code, factual questions, reasoning. They can do it all but not perfectly. We don't know the risks and can't use this level of confidence without human in the loop.

X-msky t1_j765co5 wrote on February 4, 2023 at 10:48 AM

You assume humans have 100% accuracy?

visarga t1_j76lslh wrote on February 4, 2023 at 1:58 PM

Oh, I can tell you stories about human accuracy. At some point I re-labelled the same test set three times and was still finding errors. My models surpass untrained human accuracy, but still need hand holding, there's one error on every page on average. Humans do more cross checking and correlating, filling a gap in AI.

purepersistence t1_j7689y4 wrote on February 4, 2023 at 11:30 AM

If you're debugging code you don't have to be accurate until the problem is fixed. Mistakes will be common. Accuracy is not absolutely necessary. But competence is. It will be a long damn time before something like chatGPT will find an fix subtle bugs that occur in a production system with many interacting services distributed across multiple computers running software controlled by different corporations.

kai_luni t1_j76dmfm wrote on February 4, 2023 at 12:37 PM

I agree with you point and think about it the same way. Even a great GPT 4 is useless when your nodejs app does not work and chat gtp4 just gives up on it. I think a half good Software Developer is capable of trying until it works. He will sleep on it, he will try, he will talk to other people about it and then he will learn on the way.

At some point your nodejs app will work and you are happy. The question is if an AI will reach this level. Even an accuracy of 99.9% still means the app does not work. Can it fix the last ten bugs on its own? If not you need to hire someone to spend many days on this app, hire a real person.

Maybe this new technology just leads to better Code Quality. It can streamline you Spaghetti Code and give it a proper documentation. Maybe a Sales person can ask the AI "Can our program do x and then y?" and the AI will say: "Not yet, but with an estimated time of two weeks development that might be possible". That would greatly increase information flow in companies.

So lets see if current Machine Learning can reach a level where it impacts the world. Its an exciting time to be alive.

tk854 OP t1_j76ks8y wrote on February 4, 2023 at 1:49 PM

Your explanation is spot on. My one-line take on it is that a larger percentage jobs are AGI-hard than most people are assuming. Take driving for example.

I also think that a lot of people are underestimating how difficult most jobs are, even when it's a job that can be described as "just looking at a spreadsheet".

futebollounge t1_j76v5ov wrote on February 4, 2023 at 3:12 PM

The context space of understanding the content within a spreadsheet versus a dynamic physical world (driving) are night and day in complexity.

Neurogence t1_j77a6ep wrote on February 4, 2023 at 4:56 PM

It depends on how complex the program is. I think it will be much harder to have an AI that can code a program such as a browser entirely by itself versus a fully driverless car AI.

Possible first look at GPT-4

Neurogence t1_j757fn0 wrote on February 4, 2023 at 3:52 AM

tk854 OP t1_j75bo9e wrote on February 4, 2023 at 4:27 AM

Neurogence t1_j75czbn wrote on February 4, 2023 at 4:40 AM

tk854 OP t1_j75e2fp wrote on February 4, 2023 at 4:50 AM