Learning Policies from Self-Play with Policy Gradients and MCTS Value Estimates

Summary

This is a publication. If there is no link to the publication on this page, you can try the pre-formated search via the search engines listed on this page.

Authors: Soemers, Dennis J. N. J.; Piette, Éric; Stephenson, Matthew; Browne, Cameron

Journal title: Proceedings of the IEEE Conference on Games

Journal number: 2019

Journal publisher: IEEE Press

Published year: 2019