turnip_burrito t1_ja2f02y wrote on February 26, 2023 at 9:33 AM

Sure, you can do it if you have enough data, and a powerful enough computer.

Idk how you're going to do reinforcement learning to update the transformer weights though (I assume you want to use a transformer?). That's a lot of computation. The bigger your model is, the slower this update step will be.

Are you separating hearing and speaking/moving in time? Like are they separate steps that can't happen at the same time? My question then is why not make them simultaneous?

visarga t1_ja3637d wrote on February 26, 2023 at 2:35 PM

A recent approach saves past experience data and loads it back for in-context-learning. The model itself can be task generic. So it learns by collecting new data.