Viewing a single comment thread. View all comments

Kitchen_Tower2800 t1_jakjrxr wrote

I've never directly worked with either, but isn't RL agent-competitions approaches (i.e. simulating games between agents with different parameter values and iterating on this agents) a form of genetic algorithms?

It's also worth noting that this is exactly the type of problem that genetic algorithms were made for: no gradients, highly multimodal.

8

TobusFire OP t1_janyxnp wrote

> isn't RL agent-competitions approaches (i.e. simulating games between agents with different parameter values and iterating on this agents) a form of genetic algorithms?

Hmm, I hadn't thought about RL like that. I guess the signal from a reward function based on competition could be considered "fitness", and then perhaps some form of cross-over is done in the way we iterate on and update the agents. Interesting thought.

1