AI beats professionals at six-player Texas Hold ’Em poker

Technology

AI beats professionals at six-player Texas Hold ’Em poker

11 July 2019

A person playing online poker — cyano66/Getty

Artificial intelligence has finally cracked the biggest challenge in poker: beating top professionals in six-player, no-limit Texas Hold ’Em, the most popular variant of the game.

Over 20,000 hands of online poker, the AI beat 15 of the world’s top poker players, each of whom has won more than $1 million playing the game professionally.

The AI, called Pluribus, was tested in 10,000 games against five human players, as well as in 10,000 rounds where five copies of Pluribus played against one professional – and did better than the pros in both.

Pluribus was developed by Noam Brown of Facebook AI Research and Tuomas Sandholm at Carnegie Mellon University in Pennsylvania. It is an improvement on their previous poker-playing AI, called Libratus, which in 2017 outplayed professionals at Heads-Up Texas Hold ’Em, a variant of the game that pits two players head to head.

Plays like a bot

In games against five human professionals, Pluribus won by an average of 48 milli-big blinds per game – a measure of how many big blinds were won on average per thousand hands of poker.

Each human player was given an alias for the duration of the tournament, to deter people who knew each other from potentially teaming up against Pluribus.

“We made no effort to hide who the bot was,” says Brown, partially because its play style was obvious – Pluribus plays the first few actions in a round instantaneously because it has already prepared its strategy for those moves, while a human player typically takes a few seconds to decide.

Knowing which player was Pluribus meant the human player could attempt to trick the AI, says Jason Les, a professional poker player who was involved in the tournament. He played in the rounds that pitted five humans against Pluribus, playing an estimated 2000 hands over 12 days.

No guarantees

“We actually use very few computing resources to produce this AI,” says Brown. Training Pluribus required less than 512 gigabytes of memory, which would cost less than $150 using cloud computing services.

While Pluribus played better than human poker players, according to a game theory principle called the Nash equilibrium there was no theoretical guarantee it would always win, says Cazenave.

A Nash equilibrium occurs in non-cooperative games where each player has a list of strategies and no player can improve on their performance by changing to a different strategy. While a Nash equilibrium strategy is unbeatable in Heads Up Texas Hold ’Em, we still have no way of finding one for the six-player variant of the game.

Sign up to our weekly newsletter

Receive a weekly dose of discovery in your inbox. We'll also keep you up to date with New Ů��С��Ƶ events and special offers.

Ů��С��Ƶ

Technology