Conference paperFinite-Time Convergence and Sample Complexity of Multi-Agent Actor-Critic Reinforcement Learning with Average Reward
PosterTowards Understanding Convergence of Simultaneous Gradient Descent-Ascent in Minimax Optimization
Conference paperTaming Communication and Sample Complexities in Decentralized Policy Evaluation for Cooperative Multi-Agent Reinforcement Learning