abstractcontrol
abstractcontrol t1_ivjjbfq wrote
Reply to comment by flapflip9 in [Project] Rebel Poker AI by Character_Bluejay601
Poker really brings out all the weaknesses of deep learning, it is hardly a solved thing. For example, if you log into Stars and do a HU SNG, you'll see that you start with 1,000 stacks and 10/20 blinds. That means you have 960 different raises + call + fold different actions to account for just in that small game. You also have large reward variance that deep RL algorithms can't deal with properly. Some algos like categorical DRL are just too memory inefficient to be used even on moderately large games. You'd be amazed at how much memory having around 1,000 different actions takes up once you start using mini-batches.
The academic SOTA is to just stick a tabular algorithm on top of some deep net, which is hardly elegant. All these algorithms are just hacks and I wouldn't use them for real money play.
abstractcontrol t1_j915epu wrote
Reply to comment by I_will_delete_myself in [N] Google is increasing the price of every Colab Pro tier by 10X! Pro is 95 Euro and Pro+ is 433 Euro per month! Without notifying users! by FreePenalties
It is best to use a script in order to get a Paperspace notebook. Otherwise, yeah, you are going to have a hard time sometimes. The availability does depend on the timezone from what I've heard.