DLT-1041554 Solving infinite-horizon POMDPs with memoryless stochastic policies in state-action space

Summary

This is a publication. If there is no link to the publication on this page, you can try the pre-formated search via the search engines listed on this page.

Authors: Müller, Johannes; Montúfar, Guido

Journal title: RLDM The Multi-disciplinary Conference on Reinforcement Learning and Decision Making

Journal number: 2022

Journal publisher: RLDM

Published year: 2022

Published pages: 435-439

DOI identifier: 10.48550/arxiv.2205.14098

Associated projects

DLT - Deep Learning Theory: Geometric Analysis of Capacity, Optimization, and Generalization for Improving Learning in Deep Neural Networks

Organisations

Not specified