Finite-Time Convergence and Sample Complexity of Multi-Agent Actor-Critic Reinforcement Learning with Average RewardHairi FNUJia Liuet al.2022ICLR 2022
Look at What I’m Doing: Self-Supervised Spatial Grounding of Narrations in Instructional VideosReuben TanBryan Plummeret al.2021NeurIPS 2021
Does enforcing fairness mitigate biases caused by subpopulation shift?Subha MaityDebarghya Mukherjeeet al.2021NeurIPS 2021
Safe Policy Optimization with Local Generalized Linear Function ApproximationsAkifumi WachiYunyue Weiet al.2021NeurIPS 2021
CodeNet: A Large-Scale AI for Code Dataset for Learning a Diversity of Coding TasksRuchir PuriDavid Kunget al.2021NeurIPS 2021
A Bayesian-Symbolic Approach to Reasoning and Learning in Intuitive PhysicsKai XuAkash Srivastavaet al.2021NeurIPS 2021