Now that GPT-4 can work with text and images and even use plugins, it seems like there's nothing left to add. In my opinion, the secret sauce that would put GPT-5 leaps and bounds ahead of its rivals in usefulness is the ability to hold real-time conversations. Imagine having your own personal Jarvis, or even being able to banter with a whole suite of "friends", each with their own unique personality?

I'm curious what kind of breakthroughs, if any, would be needed to make this possible. Maybe AI could be trained on conversations in TV/movies?

Comments

You must log in or register to comment.

ptxtra t1_jea2njq wrote on March 30, 2023 at 2:49 PM

The biggest leap for gpt-5 would be logical reasoning, and a functional working memory.

Veleric t1_jeagmfz wrote on March 30, 2023 at 4:24 PM

Saw a video today of a rather rudimentary display of a memory plugin. It took information from a onedrive doc, was given new info from a prompt that updated it's knowledge. They closed out and went back in and it seemed to provide the correct answer then. Whether that is fully capable or something else comes along, I can't imagine memory in some meaningful capacity is more than a few weeks away.

ingeniare t1_jealsr1 wrote on March 30, 2023 at 4:57 PM

They're probably just doing a vector embedding of the information and retrieving it using semantic search, this has been around for quite some time already

Veleric t1_jeblny0 wrote on March 30, 2023 at 8:44 PM

Who knows for sure, but I definitely take it with a grain of salt since the plugin was shown as unverified and it showed no real detail. I still think it's just around the corner, though!

[deleted] t1_jecpt56 wrote on March 31, 2023 at 1:28 AM

[deleted]

ptxtra t1_jedk9yh wrote on March 31, 2023 at 6:14 AM

That doesn't help with reasoning. It only connects multiple AIs with code. If the AI gives an unreasonable answer to a prompt and forgets the context, you can't help that with chaining it to other AIs.

mckirkus t1_jebqkfu wrote on March 30, 2023 at 9:16 PM

I think having an ability to communicate verbally is more important. Not just translating a sentence into sound, but storytelling, intonation, comedic timing, etc.

Akimbo333 t1_je9v7gd wrote on March 30, 2023 at 1:54 PM

Or have GPT 5 play audio or music and video

DonOfTheDarkNight t1_jebb49e wrote on March 30, 2023 at 7:38 PM

or generate worlds and games Lol

DonOfTheDarkNight t1_jebb55b wrote on March 30, 2023 at 7:38 PM

or generate worlds and games Lol

Akimbo333 t1_jebgubg wrote on March 30, 2023 at 8:14 PM

Yeah

DonOfTheDarkNight t1_jedshi0 wrote on March 31, 2023 at 8:08 AM

I don't know why my comment got posted two times XD

Akimbo333 t1_jedwvde wrote on March 31, 2023 at 9:13 AM

Lol, all good!

grimorg80 t1_jeagznm wrote on March 30, 2023 at 4:26 PM

External memory would be the biggest one still missing

megadonkeyx t1_jeb9msu wrote on March 30, 2023 at 7:29 PM

So much more to add..

Long term memory

Continuous associated thought

A body with eyes

alexiuss t1_je9sa57 wrote on March 30, 2023 at 1:32 PM

You don't need gpt5 for that. Open source movement already made this possible with gpt3.5 https://josephrocca.github.io/OpenCharacters/

Ishynethetruth t1_jeb4xfe wrote on March 30, 2023 at 6:58 PM

Shouldn’t the next time is to give ai vision to see the world.

Cr4zko t1_jecu9xl wrote on March 31, 2023 at 2:02 AM

I don't know but we're in changing times.

Borrowedshorts t1_jed5gja wrote on March 31, 2023 at 3:36 AM

I think multistep tasks and the ability to understand context across applications is the next paradigm to solve. GPT 4 with plug-ins can do this in a rudimentary sense, but I think it will take specific training and architecture within the generative model itself to start to replace FTE workers.