Safe Policy Optimization with Local Generalized Linear Function ApproximationsAkifumi WachiYunyue Weiet al.2021NeurIPS 2021
Safe reinforcement learning in constrained markov decision processesAkifumi WachiYanan Sui2020ICML 2020