Viewing a single comment thread. View all comments

turnip_burrito t1_ja2f02y wrote

Sure, you can do it if you have enough data, and a powerful enough computer.

Idk how you're going to do reinforcement learning to update the transformer weights though (I assume you want to use a transformer?). That's a lot of computation. The bigger your model is, the slower this update step will be.

Are you separating hearing and speaking/moving in time? Like are they separate steps that can't happen at the same time? My question then is why not make them simultaneous?

5

visarga t1_ja3637d wrote

A recent approach saves past experience data and loads it back for in-context-learning. The model itself can be task generic. So it learns by collecting new data.

6