Conference paperFinite-Time Convergence and Sample Complexity of Multi-Agent Actor-Critic Reinforcement Learning with Average Reward
Workshop paperThe Combinatorial Generation of Objective Targets and Constraints for Large-scale Testing of Optimization Routines