Difference rewards policy gradients

Summary

This is a publication. If there is no link to the publication on this page, you can try the pre-formated search via the search engines listed on this page.

Authors: Jacopo Castellini; Frans Oliehoek; Sam Devlin; Rahul Savani

Journal title: Neural Computing and Applications

Journal publisher: Springer Verlag

Published year: 2022

DOI identifier: 10.48550/arxiv.2012.11258

ISSN: 0941-0643