Premium Only Content
Player of Games: All the games, one algorithm! (w/ author Martin Schmid)
#playerofgames #deepmind #alphazero
Special Guest: First author Martin Schmid (https://twitter.com/Lifrordi)
Games have been used throughout research as testbeds for AI algorithms, such as reinforcement learning agents. However, different types of games usually require different solution approaches, such as AlphaZero for Go or Chess, and Counterfactual Regret Minimization (CFR) for Poker. Player of Games bridges this gap between perfect and imperfect information games and delivers a single algorithm that uses tree search over public information states, and is trained via self-play. The resulting algorithm can play Go, Chess, Poker, Scotland Yard, and many more games, as well as non-game environments.
OUTLINE:
0:00 - Introduction
2:50 - What games can Player of Games be trained on?
4:00 - Tree search algorithms (AlphaZero)
8:00 - What is different in imperfect information games?
15:40 - Counterfactual Value- and Policy-Networks
18:50 - The Player of Games search procedure
28:30 - How to train the network?
34:40 - Experimental Results
47:20 - Discussion & Outlook
Paper: https://arxiv.org/abs/2112.03178
Abstract:
Games have a long history of serving as a benchmark for progress in artificial intelligence. Recently, approaches using search and learning have shown strong performance across a set of perfect information games, and approaches using game-theoretic reasoning and learning have shown strong performance for specific imperfect information poker variants. We introduce Player of Games, a general-purpose algorithm that unifies previous approaches, combining guided search, self-play learning, and game-theoretic reasoning. Player of Games is the first algorithm to achieve strong empirical performance in large perfect and imperfect information games -- an important step towards truly general algorithms for arbitrary environments. We prove that Player of Games is sound, converging to perfect play as available computation time and approximation capacity increases. Player of Games reaches strong performance in chess and Go, beats the strongest openly available agent in heads-up no-limit Texas hold'em poker (Slumbot), and defeats the state-of-the-art agent in Scotland Yard, an imperfect information game that illustrates the value of guided search, learning, and game-theoretic reasoning.
Authors: Martin Schmid, Matej Moravcik, Neil Burch, Rudolf Kadlec, Josh Davidson, Kevin Waugh, Nolan Bard, Finbarr Timbers, Marc Lanctot, Zach Holland, Elnaz Davoodi, Alden Christianson, Michael Bowling
Links:
TabNine Code Completion (Referral): http://bit.ly/tabnine-yannick
YouTube: https://www.youtube.com/c/yannickilcher
Twitter: https://twitter.com/ykilcher
Discord: https://discord.gg/4H8xxDF
BitChute: https://www.bitchute.com/channel/yann...
LinkedIn: https://www.linkedin.com/in/ykilcher
BiliBili: https://space.bilibili.com/2017636191
If you want to support me, the best thing to do is to share out the content :)
If you want to support me financially (completely optional and voluntary, but a lot of people have asked for this):
SubscribeStar: https://www.subscribestar.com/yannick...
Patreon: https://www.patreon.com/yannickilcher
Bitcoin (BTC): bc1q49lsw3q325tr58ygf8sudx2dqfguclvngvy2cq
Ethereum (ETH): 0x7ad3513E3B8f66799f507Aa7874b1B0eBC7F85e2
Litecoin (LTC): LQW2TRyKYetVC8WjFkhpPhtpbDM4Vw7r9m
Monero (XMR): 4ACL8AGrEo5hAir8A9CeVrW8pEauWvnp1WnSDZxW7tziCDLhZAGsgzhRQABDnFy8yuM9fWJDviJPHKRjV4FWt19CJZN9D4n
-
LIVE
Kim Iversen
4 hours agoTerrorism, Act of God or “Newscum” Incompetence: What REALLY Fueled The California Wildfires
3,429 watching -
2:16:33
Tucker Carlson
2 hours agoTucker Carlson and Michael Shellenberger Break Down the California Fires
110K123 -
58:50
Laura Loomer
1 hour agoThe Great Replacement (Full-Length Documentary)
9 -
LIVE
Razeo
1 hour agoEp 31: Finishing March Ridge & onto Muldraugh tonight
330 watching -
LIVE
Adam Does Movies
18 minutes agoBatman II Update + Flash Director Fails + Movie Bombs! - LIVE!
46 watching -
LIVE
Flyover Conservatives
20 hours agoJack Hibbs Blasts California Leaders: Must-Watch!; Can Trump Fix the Mess? How Long will it Take? - Dr. Kirk Elliott | FOC Show
440 watching -
LIVE
DillyDillerson
1 hour agoTalking to the moon!! Just some live views of the FULL MOON!!
257 watching -
1:29:29
Glenn Greenwald
5 hours agoWith Biden Out, U.S. Finally Admits Harms of His Israel / Gaza Policy; Biden Pays Homage To George W. Bush; Insane Women’s Tennis Scandal: An “Abusive” Coach | SYSTEM UPDATE #388
24.2K29 -
LIVE
Danny Polishchuk
7 hours agoWho's To Blame For LA Fires, Jewish Tunnels Update + Forbidden Anthropology
324 watching -
1:08:10
Donald Trump Jr.
7 hours agoOne Week Until Inauguration, Live with Rep Anna Paulina Luna & Sen Tommy Tuberville
91.3K105