Average-Reward Off-Policy Policy Evaluation with Function Approximation

Summary

This is a publication. If there is no link to the publication on this page, you can try the pre-formated search via the search engines listed on this page.

Authors: Zhang, Shangtong; Wan, Yi; Sutton, Richard S.; Whiteson, Shimon

Journal number: 1

Journal publisher: ICML

Published year: 2021