abstractcontrol t1_j915epu wrote on February 18, 2023 at 1:07 PM

Reply to comment by I_will_delete_myself in [N] Google is increasing the price of every Colab Pro tier by 10X! Pro is 95 Euro and Pro+ is 433 Euro per month! Without notifying users! by FreePenalties

It is best to use a script in order to get a Paperspace notebook. Otherwise, yeah, you are going to have a hard time sometimes. The availability does depend on the timezone from what I've heard.

abstractcontrol t1_ivjjbfq wrote on November 8, 2022 at 1:06 PM

Reply to comment by flapflip9 in [Project] Rebel Poker AI by Character_Bluejay601

Poker really brings out all the weaknesses of deep learning, it is hardly a solved thing. For example, if you log into Stars and do a HU SNG, you'll see that you start with 1,000 stacks and 10/20 blinds. That means you have 960 different raises + call + fold different actions to account for just in that small game. You also have large reward variance that deep RL algorithms can't deal with properly. Some algos like categorical DRL are just too memory inefficient to be used even on moderately large games. You'd be amazed at how much memory having around 1,000 different actions takes up once you start using mini-batches.

The academic SOTA is to just stick a tabular algorithm on top of some deep net, which is hardly elegant. All these algorithms are just hacks and I wouldn't use them for real money play.