Viewing a single comment thread. View all comments

f10101 t1_ivj1s5q wrote

To take an example where it's a fair fight, and the computer doesn't win by virtue of having more input bandwidth: RL models applied to narrow physical tasks.

These will often exceed human ability after just a couple of hundred attempts - Cart Pole would be an example.

2

blimpyway t1_ivjly2a wrote

This indeed could be one case. However a couple hundred attempts is not the limit - a kid would get it in less than a couple dozen trials or she will get bored.

However I found that some models can do it even faster. Like under 5 failures or less on 50% trials, including only 2 failures in 5% of trials.

1