Loaded DiCE: Trading off Bias and Variance in Any-Order Score Function Estimators for Reinforcement Learning

Summary

This is a publication. If there is no link to the publication on this page, you can try the pre-formated search via the search engines listed on this page.

Authors: Farquhar, Gregory; Whiteson, Shimon; Foerster, Jakob

Journal number: 1

Journal publisher: NeurIPS

Published year: 2019