Dynamic particle allocation to solve interactive pomdp models for social decision making

Rohith Dwarakanath Vallam; Sarthak Ahuja; Surya Shravan Kumar Sajja; Ritwik Chaudhuri; Rakesh Pimplikar; Kushal Mukherjee; Ramasuri Narayanam; Gyana R. Parija

AAMAS 2019

Conference paper

13 May 2019

Dynamic particle allocation to solve interactive pomdp models for social decision making

Abstract

In social dilemma settings, such as repeated Public Goods Games (PGGs), humans often come across a dilemma whether to contribute or not based on past contributions from others In such settings, the decision taken by an agent/human actually depends not only on the belief the agent has about other agents and the environment, but also on their beliefs about others' beliefs To factor in these aspects, we propose a novel formulation of computational theory of mind (ToM) to model human behavior in a repeated PGG using interactive partially observable Markov decision processes (I-POMDPs) Interactive particle filter (IPf) is a well-known algorithm used to approximately solve I-POMDP models for the agents to find their optimal contributions Number of particles assigned to an agent in IPF can be translated into time and computational resources Solving I-POMDPs in a time-memory efficient manner even in the case of small state spaces is a largely intractable problem Also, maintaining a fixed number of particles assigned to each agent, over time, will be highly inefficient in terms of resource utilization To address this problem, we propose a dynamic particle allocation algorithm for different agents based on how well they could predict We validate our proposed algorithm through real experiments involving human agents Our results suggest that dynamic particle allocation based IPF for I-POMDPs is effective in modelling human behaviours in repeated social dilemma setting while utilizing computational resources in an effective manner.

Conference paper