Alberta-developed computer program wins at no-limit Texas Hold ’Em

2017-03-03 - Bob Weber THE CANADIAN PRESS — EDMONTON

Acomputer program has learned to win at one of the most complex poker games by copying a very human impulse — trusting its gut. “I think there’s a lot of similarities to real human intuition,” said Michael Bowling of the University of Alberta’s Computer Poker Research Group.

Teaching poker to computers has been a popular tool in the artificial intelligence community for years.

Unlike games such as chess, no poker player knows what cards other players hold. Having to deal with incomplete information makes poker programs useful in everything from improving public security to helping doctors treat patients with diabetes.

Bowling’s lab has long worked on poker and attracted worldwide attention in 2015 for developing Cepheus, a program that was unbeatable in two-handed, fixedbet Texas Hold ‘Em. The lab’s latest achievement, revealed Thursday in the journal Science, went after the much more complex version of Texas Hold ‘Em in which there is no limit on bets.

Cepheus worked by allowing the computer to learn from mistakes. After billions of hands, it developed a 10-terabyte table of probabilities that made it unbeatable.

But that wasn’t going to work for no-limit Hold ‘Em.

The fixed-bet version of the game has decision points equal to the number 10 with 14 zeros after it. The no-limit equivalent is 10 with 160 zeros after it.

“That’s more than there are atoms in the universe,” said Bowling — way too many to simply crunch through probabilities.

Bowling’s team found the answer with a program called Deep Stack.

“Deep Stack doesn’t compute the whole strategy beforehand,” Bowling said.

“It’s going to compute how it’s going to play on line, as it’s playing. It’s going to only worry about the decision points it reaches while it plays, and figure out how to play those on the fly, in the middle of the game, much more like a human player.”

The key is in developing what Bowling calls intuition. Like a human player, Deep Stack trains its instincts through repetition — in this case, 10 million poker hands played against itself.

“Deep Stack will play against itself over and over again until it figures out, ‘I think this is how much it’s worth, being in this poker situation,’” Bowling said.

That data gets fed into a program

Alberta-developed computer program wins at no-limit Texas Hold ’Em

Newspapers in English

Newspapers from Canada