Viewing a single comment thread. View all comments

MetaAI_Official OP t1_izfpjl9 wrote

One of the key challenges of Diplomacy is modeling how people might respond to your actions. We found that approaches used in prior game AI breakthroughs like Go and poker that relied purely on self-play were not able to anticipate "human" behaviors like retaliation. For that reason, a big contribution of our research is developing a way to incorporate human data into self-play, which allows us to find strong policies that also understand how people approach the game. -NB

1

MetaAI_Official OP t1_izfq18c wrote

As someone who isn't an AI specialist, this research was a fascinating read. Even for people not in the field this problem is important and if you get the chance it is worth reading! -AG

1