A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play
David Silver
,
Thomas Hubert
,
Julian Schrittwieser
,
David Silver
,
Thomas Hubert
,
Julian Schrittwieser
,
Ioannis Antonoglou
,
Matthew Lai
,
Arthur Guez
,
Marc Lanctot
,
Laurent Sifre
,
Dharshan Kumaran
,
Thore Graepel
,
Timothy Lillicrap
,
Karen Simonyan
,
Demis Hassabis
2018
Science
3,322 citations