f10101 t1_ivj1s5q wrote on November 8, 2022 at 9:31 AM

To take an example where it's a fair fight, and the computer doesn't win by virtue of having more input bandwidth: RL models applied to narrow physical tasks.

These will often exceed human ability after just a couple of hundred attempts - Cart Pole would be an example.

blimpyway t1_ivjly2a wrote on November 8, 2022 at 1:29 PM

This indeed could be one case. However a couple hundred attempts is not the limit - a kid would get it in less than a couple dozen trials or she will get bored.

However I found that some models can do it even faster. Like under 5 failures or less on 50% trials, including only 2 failures in 5% of trials.